- Cancer Genomics and Diagnostics
- Radiomics and Machine Learning in Medical Imaging
- Cancer-related molecular mechanisms research
- Genomics and Phylogenetic Studies
- Cancer Diagnosis and Treatment
- Computational Drug Discovery Methods
- Genetic factors in colorectal cancer
- Bioinformatics and Genomic Networks
- Single-cell and spatial transcriptomics
- Gut microbiota and health
- RNA modifications and cancer
- Lung Cancer Treatments and Mutations
- MicroRNA in disease regulation
- AI in cancer detection
- SARS-CoV-2 and COVID-19 Research
- RNA and protein synthesis mechanisms
- Machine Learning in Bioinformatics
- vaccines and immunoinformatics approaches
- Tumors and Oncological Cases
- Cancer Immunotherapy and Biomarkers
- Chromosomal and Genetic Variations
- Molecular Biology Techniques and Applications
- Gene expression and cancer classification
- Oral and Maxillofacial Pathology
- COVID-19 diagnosis using AI
Cipher Gene (China)
2018-2025
Binzhou University
2018-2025
Binzhou Medical University
2018-2025
Second Affiliated Hospital of Jilin University
2025
Sinopec (China)
2024
Research Institute of Petroleum Exploration and Development
2024
Beihang University
2024
Shanghai Electric (China)
2024
Beike Biotechnology (China)
2023
Shenzhen Second People's Hospital
2009-2022
Using next-generation sequencing technology alone, we have successfully generated and assembled a draft sequence of the giant panda genome. The contigs (2.25 gigabases (Gb)) cover approximately 94% whole genome, remaining gaps (0.05 Gb) seem to contain carnivore-specific repeats tandem repeats. Comparisons with dog human showed that genome has lower divergence rate. assessment genes potentially underlying some its unique traits indicated bamboo diet might be more dependent on gut microbiome...
Here we present the first diploid genome sequence of an Asian individual. The was sequenced to 36-fold average coverage using massively parallel sequencing technology. We aligned short reads onto NCBI human reference 99.97% coverage, and guided by genome, used uniquely mapped assemble a high-quality consensus for 92% individual’s genome. identified approximately 3 million single-nucleotide polymorphisms (SNPs) inside this region, which 13.6% were not in dbSNP database. Genotyping analysis...
High-throughput sequencing technology enables population-level surveys of human genomic variation. Here, we examine the joint allele frequency distributions across continental populations and present an approach for combining complementary aspects whole-genome, low-coverage data targeted high-coverage data. We apply this to generated by pilot phase Thousand Genomes Project, including whole-genome 2–4× coverage 179 samples from HapMap European, Asian, African panels as well target exons 800...
Particulate matter (PM) air pollution poses a formidable public health threat to the city of Beijing. Among various hazards PM pollutants, microorganisms in PM2.5 and PM10 are thought be responsible for allergies spread respiratory diseases. While physical chemical properties pollutants have been extensively studied, much less is known about inhalable microorganisms. Most existing data on airborne microbial communities using 16S or 18S rRNA gene sequencing categorize bacteria fungi into...
Worldwide, myopia is the leading cause of visual impairment. It results from inappropriate extension ocular axis and concomitant declines in scleral strength thickness caused by extracellular matrix (ECM) remodeling. However, identities initiators signaling pathways that induce ECM remodeling are unknown. Here, we used single-cell RNA-sequencing to identify activated sclera during development. We found hypoxia-signaling, eIF2-signaling, mTOR-signaling were murine myopic sclera. Consistent...
DNA methylation plays an important role in biological processes human health and disease. Recent technological advances allow unbiased whole-genome (methylome) analysis to be carried out on cells. Using bisulfite sequencing at 24.7-fold coverage (12.3-fold per strand), we report a comprehensive (92.62%) methylome of the unique sequences peripheral blood mononuclear cells (PBMC) from same Asian individual whose genome was deciphered YH project. PBMC constitute source for clinical tests...
HER2-positive breast cancer is a highly heterogeneous tumor, and about 30% of patients still suffer from recurrence metastasis after trastuzumab targeted therapy. Predicting individual prognosis great significance for the further development precise With continuous computer technology, more attention has been paid to computer-aided diagnosis prediction based on Hematoxylin Eosin (H&E) pathological images, which are available all undergone surgical treatment. In this study, we first enrolled...
Estimation of allele frequency is fundamental importance in population genetic analyses and association mapping. In most studies using next-generation sequencing, a cost effective approach to use medium or low-coverage data (e.g., < 15X). However, SNP calling estimation such associated with substantial statistical uncertainty because varying coverage high error rates. We evaluate new maximum likelihood method for estimating frequencies low sequencing data. The based on integrating over the...
A major question in evolutionary biology is how natural selection has shaped patterns of genetic variation across the human genome. Previous work documented a reduction diversity regions genome with low recombination rates. However, it unclear whether other summaries variation, like allele frequencies, are also correlated rate and these correlations can be explained solely by negative against deleterious mutations or positive acting on favorable alleles required. Here we attempt to address...
Nonsense-mediated mRNA decay (NMD) affects the outcome of alternative splicing by degrading isoforms with premature termination codons. Splicing regulators constitute important NMD targets; however, extent to which loss causes extensive deregulation has not previously been assayed in a global, unbiased manner. Here, we combine mouse genetics and RNA-seq provide first vivo analysis global impact on patterns two primary tissues ablated for factor UPF2.We developed bioinformatic pipeline that...
Carcinoma of unknown primary (CUP) is a type metastatic cancer, the tumor site which cannot be identified. CUP occupies approximately 5% cancer incidences in United States with usually unfavorable prognosis, making it big threat to public health. Traditional methods identify tissue-of-origin (TOO) like immunohistochemistry can only deal around 20% patients. In recent years, more and studies suggest that promising solve problem by integrating machine learning techniques biomedical data...
Metastatic cancers require further diagnosis to determine their primary tumor sites. However, the tissue-of-origin for around 5% tumors could not be identified by routine medical according a statistics in United States. With development of machine learning techniques and accumulation big cancer data from The Cancer Genome Atlas (TCGA) Gene Expression Omnibus (GEO), it is now feasible predict computational tools. inherits characteristics its tissue-of-origin, both gene expression profile...
In this study, we proposed an ensemble learning method, simultaneously integrating a low-rank matrix completion model and ridge regression to predict anticancer drug response on cancer cell lines. The was applied two benchmark datasets, including the Cancer Cell Line Encyclopedia (CCLE) Genomics of Drug Sensitivity in (GDSC). As previous studies suggest, dual-layer integrated line-drug network one best models by far outperformed most state-of-the-art models. Thus, performed head-to-head...
Circulating tumor cells (CTCs) derived from primary tumors and/or metastatic are markers for prognosis, and can also be used to monitor therapeutic efficacy recurrence. CTCs enrichment screening automated, but the final counting of circulating currently requires manual intervention. This not only participation experienced pathologists, easily causes artificial misjudgment. Medical image recognition based on machine learning effectively reduce workload improve level automation. So we use...
Tumor mutational burden (TMB) is an indicator of the efficacy and prognosis immune checkpoint therapy in colorectal cancer (CRC). In general, patients with higher TMB values are more likely to benefit from immunotherapy. Though whole-exome sequencing considered gold standard for determining TMB, it difficult be applied clinical practice due its high cost. There also a few DNA panel-based methods estimate TMB; however, their detection cost high, associated wet-lab experiments usually take...
Abstract Drug repositioning, the strategy of redirecting existing drugs to new therapeutic purposes, is pivotal in accelerating drug discovery. While many studies have engaged modeling complex drug–disease associations, they often overlook relevance between different node embeddings. Consequently, we propose a novel weighted local information augmented graph neural network model, termed DRAGNN, for repositioning. Specifically, DRAGNN firstly incorporates attention mechanism dynamically...
Abstract Long noncoding RNAs (lncRNAs) participate in various biological processes and have close linkages with diseases. In vivo vitro experiments validated many associations between lncRNAs However, are time-consuming expensive. Here, we introduce LDA-VGHB, an lncRNA–disease association (LDA) identification framework, by incorporating feature extraction based on singular value decomposition variational graph autoencoder LDA classification heterogeneous Newton boosting machine. LDA-VGHB was...