Yuhua Yao

ORCID: 0000-0003-4811-646X
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Machine Learning in Bioinformatics
  • Fractal and DNA sequence analysis
  • RNA and protein synthesis mechanisms
  • Genomics and Phylogenetic Studies
  • Cancer-related molecular mechanisms research
  • Computational Drug Discovery Methods
  • Astrophysics and Cosmic Phenomena
  • Influenza Virus Research Studies
  • RNA Research and Splicing
  • RNA modifications and cancer
  • vaccines and immunoinformatics approaches
  • Circular RNAs in diseases
  • Genomics and Chromatin Dynamics
  • Protein Structure and Dynamics
  • Dark Matter and Cosmic Phenomena
  • MicroRNA in disease regulation
  • AI in cancer detection
  • Radiomics and Machine Learning in Medical Imaging
  • Molecular Biology Techniques and Applications
  • Complex Network Analysis Techniques
  • Bioinformatics and Genomic Networks
  • Lung Cancer Diagnosis and Treatment
  • Gene expression and cancer classification
  • Metabolomics and Mass Spectrometry Studies
  • Gamma-ray bursts and supernovae

Hainan Normal University
2017-2025

University Health Network
2024

Chongqing University
2023-2024

Purple Mountain Observatory
2023

Hebei Normal University
2023

University of Science and Technology of China
2023

Institute of High Energy Physics
2023

Chinese Academy of Sciences
2023

Hainan University
2020-2023

University of Chinese Academy of Sciences
2023

Abstract Breast cancer patients often have recurrence and metastasis after surgery. Predicting the risk of for a breast patient is essential development precision treatment. In this study, we proposed novel multi-modal deep learning prediction model by integrating hematoxylin & eosin (H&E)-stained histopathological images, clinical information gene expression data. Specifically, segmented tumor regions in H&E into image blocks (256 × 256 pixels) encoded each block 1D feature...

10.1093/bib/bbac448 article EN Briefings in Bioinformatics 2022-10-14

Abstract Timely identification of emerging antigenic variants is critical to influenza vaccine design. The accuracy a sequence-based prediction method relies on the choice amino acids substitution matrices. In this study, we first compared comprehensive 95 matrices reflecting various properties in predicting antigenicity viruses by random forest model. We then proposed novel algorithm called joint regression (JRFR) jointly consider top applied JRFR human H3N2 seasonal data from 1968 2003. A...

10.1038/s41598-017-01699-z article EN cc-by Scientific Reports 2017-05-02

Genomic islands (GIs) that are associated with microbial adaptations and carry sequence patterns different from of the host sporadically distributed among closely related species. This bias can dominate signal interest in GI detection. However, variations still exist segments host, although no uniform standard exists regarding best methods discriminating GIs rest genome terms compositional bias. In present work, we proposed a robust software, MTGIpick, which used regions pattern showing...

10.1093/bib/bbw118 article EN Briefings in Bioinformatics 2016-11-01

Bioactive peptides are typically small functional with 2-20 amino acid residues and play versatile roles in metabolic biological processes. multi-functional, so it is vastly challenging to accurately detect all their functions simultaneously. We proposed a convolution neural network (CNN) bi-directional long short-term memory (Bi-LSTM)-based deep learning method (called MPMABP) for recognizing multi-activities of bioactive peptides. The MPMABP stacked five CNNs at different scales, used the...

10.3390/ph15060707 article EN cc-by Pharmaceuticals 2022-06-03

Abstract On the basis of a selected pair physicochemical properties amino acids, we introduce dynamic 2D graphical representation protein sequences. Then, and compare two numerical characterizations graphs as descriptors to analyze nine ND5 proteins. The approach is simple, convenient, fast. Proteins 2008. © 2008 Wiley‐Liss, Inc.

10.1002/prot.22110 article EN Proteins Structure Function and Bioinformatics 2008-06-05

In recent years, it has been increasingly clear that long noncoding RNAs (lncRNAs) play critical roles in many biological processes associated with human diseases. Inferring potential lncRNA-disease associations is essential to reveal the secrets behind diseases, develop novel drugs, and optimize personalized treatments. However, experiments validate are very time-consuming costly. Thus, effective computational models. this study, we have proposed a method called BPLLDA predict based on...

10.3389/fgene.2018.00411 article EN cc-by Frontiers in Genetics 2018-10-16

While the intratumoral microbiota has been discovered to have a close connection with tumor immunity, specific role played by in regulating immune microenvironment (TIME) of lung cancer remains largely unexplored. Here, we comprehensively investigated association between and TIME adenocarcinoma (LUAD) squamous cell carcinoma (LUSC). First, found that host transcriptome profile significantly differed LUAD LUSC. Moreover, there were strong associations abundance microbes expression genes both...

10.1152/physiolgenomics.00204.2024 article EN cc-by Physiological Genomics 2025-02-28

An increasing body of research indicates that the circulating microbiome plays a significant role in cancer initiation and progression treatment response. The genomic characteristics microorganisms may influence tumor immune microenvironment, thereby affecting therapeutic outcomes. However, whether can serve as prognostic biomarker for cervical patients its mechanistic microenvironment still requires further investigation. Univariate, Lasso, multivariate Cox regression analyses were utilized...

10.3390/ijms26094293 article EN International Journal of Molecular Sciences 2025-05-01

Abstract A (two‐dimensional) 2D graphical representation of protein sequences based on six physicochemical properties amino acids is outlined. The numerical characterization graphs given as descriptors sequences. It not only useful for comparative study proteins but also encoding innate information about the structure proteins. coefficient determination proposed a new similarity/dissimilarity measure. Finally, simple example taken to highlight behavior measure from ND6 (NADH dehydrogenase...

10.1002/jcc.21391 article EN Journal of Computational Chemistry 2009-09-23

Abstract Background Small non-coding RNAs (ncRNAs) are important regulators of gene expression in eukaryotes. Previously, only microRNAs (miRNAs) and piRNAs have been identified the silkworm, Bombyx mori . Furthermore, ncRNAs (50-500nt) intermediate size systematically silkworm. Results Here, we performed a systematic identification analysis small (18-50nt) associated with argonaute2 (BmAgo2) protein. Using RIP-seq, various types BmAGO2. These showed multimodal length distribution, three...

10.1186/1471-2164-14-661 article EN cc-by BMC Genomics 2013-09-28

Low-rank matrix completion has been demonstrated to be powerful in predicting antigenic distances among influenza viruses and vaccines from partially revealed hemagglutination inhibition table. Meanwhile, hemagglutinin (HA) protein sequences are also effective inferring distances. Thus, it is natural integrate HA sequence information into low-rank model help infer antigenicity, which critical vaccine development.We have proposed a novel algorithm called biological with side (BMCSI), first...

10.1093/bioinformatics/btx390 article EN Bioinformatics 2017-06-13

Residue-residue contact prediction has become an increasingly important tool for modeling the three-dimensional structure of a protein when no homologous is available. Ultradeep residual neural network (ResNet) most popular method making predictions because it captures contextual information between residues. In this paper, we propose novel deep framework which combines ResNet and DenseNet. This uses 1D to process sequential features, besides PSSM, SS3, solvent accessibility, have introduced...

10.1155/2020/7584968 article EN cc-by BioMed Research International 2020-04-06

Automatic detection of pulmonary nodules is great significance for early diagnosis and prevention lung cancer. Computed tomography (CT) an effective economical method. In CT images, the size shape are different, some very similar to surrounding tissues. Therefore, automatic localization in images a challenging task. this study, attention embedded three-dimensional convolutional neural network proposed nodule detection. Specifically, 1) channel-spatial guides 3D ResNet down sample input...

10.1109/access.2022.3182104 article EN cc-by IEEE Access 2022-01-01

Abstract Deep learning technology is changing the landscape of cybersecurity research, especially study large amounts data. With rapid growth in number malware, developing an efficient and reliable method for classifying malware has become one research priorities. In this paper, a new method, BIR-CNN, proposed to classify Android malware. It combines convolution neural network (CNN) with batch normalization inception-residual (BIR) modules by using 347-dim traffic features. CNN layer that...

10.1038/s41598-022-18402-6 article EN cc-by Scientific Reports 2022-08-17

Based on the chaos game representation, a 2D graphical representation of protein sequences was introduced in which 20 amino acids are rearranged cyclic order according to their physicochemical properties. The Euclidean distances between corresponding from 2-D representations computed find matching (or conserved) fragments two proteins. Again, cumulative distance 2D-graphical is defined compare similarity protein. And, examination among ND5 proteins nine species shows utility our approach.

10.1002/jcc.21501 article EN Journal of Computational Chemistry 2010-03-11

10.1016/j.physa.2011.08.015 article EN Physica A Statistical Mechanics and its Applications 2011-08-27

Although domesticated tomato is cultivated by wild tomato, there are a lot of differences between and such as shape, physiological function life history. Many studies show that has better salt resistance drought resistance. In addition to, tomato's fruit bigger more nutritious than tomato. The different features closely related to differentially expressed genes. We identified 126 up-regulated genes 87 down-regulated in RNA-Seq. These may be associated with resistance, nutrition. also further...

10.1371/journal.pone.0172411 article EN cc-by PLoS ONE 2017-03-09

Enhancers are short DNA segments that play a key role in biological processes, such as accelerating transcription of target genes. Since the enhancer resides anywhere genome sequence, it is difficult to precisely identify enhancers. We presented bi-directional long-short term memory (Bi-LSTM) and attention-based deep learning method (Enhancer-LSTMAtt) for recognition. Enhancer-LSTMAtt an end-to-end model consists mainly residual neural network, Bi-LSTM, feed-forward attention. extensively...

10.3390/biom12070995 article EN cc-by Biomolecules 2022-07-17

Based on the concepts of cell and system graphical representation, a class 2D representations RNA secondary structures are given in terms classifications bases nucleic acids. The can completely avoid loss information associated with crossing overlapping corresponding curve. As an application, we make quantitative comparisons for set at 3'-terminus different viruses based representations. examination similarities/dissimilarities illustrates utility approach.

10.1002/jcc.20271 article EN Journal of Computational Chemistry 2005-07-14

Complexes formed by proteins binding to RNAs are essential in biological processes, and can also be useful for identifying causal disease variants, gene expression regulation translation. Protein-RNA interactions identified vivo affected experimental condition, noise, some bias, while vitro experiments yield clearer signals. Therefore, accurately inferring RNA-protein models from data, predict bound unbound RNA transcripts vivo, has become a key challenge. We constructed RDense, novel deep...

10.1109/access.2019.2961260 article EN cc-by IEEE Access 2019-12-24
Coming Soon ...