NFDI4DS | UHH-SEMS - Publication Details

Chuming Chen

ORCID: 0000-0002-7287-9013

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5004336328

Research Areas

Bioinformatics and Genomic Networks
Genomics and Phylogenetic Studies
Advanced Proteomics Techniques and Applications
Biomedical Text Mining and Ontologies
COVID-19 Clinical Research Studies
Long-Term Effects of COVID-19
Genetic and phenotypic traits in livestock
Genetic Mapping and Diversity in Plants and Animals
Machine Learning in Bioinformatics
Semantic Web and Ontologies
Animal Genetics and Reproduction
COVID-19 diagnosis using AI
Hepatitis B Virus Studies
Genomics and Rare Diseases
Hepatitis C virus research
Hydrocarbon exploration and reservoir analysis
RNA and protein synthesis mechanisms
Liver Disease Diagnosis and Treatment
Computational Drug Discovery Methods
HIV, Drug Use, Sexual Risk
Gene expression and cancer classification
Geological and Geophysical Studies
Scientific Computing and Data Management
Opioid Use Disorder Treatment
Plant Virus Research Studies

University of Delaware
2015-2024

Shenzhen Third People’s Hospital
2017-2024

Southern University of Science and Technology
2019-2024

European Bioinformatics Institute
2023-2024

SIB Swiss Institute of Bioinformatics
2024

Hainan General Hospital
2022-2023

Hainan Medical University
2022-2023

Machine Science
2013-2021

South China University of Technology
2021

University of Macau
2019-2021

Software for pre-processing Illumina next-generation sequencing short read sequences

OPENALEX - Publications

Chuming Chen Sari Khaleel Hongzhan Huang Cathy Wu

When compared to Sanger sequencing technology, next-generation (NGS) technologies are hindered by shorter sequence read length, higher base-call error rate, non-uniform coverage, and platform-specific artifacts. These characteristics lower the quality of their downstream analyses, e.g. de novo reference-based assembly, introducing artifacts errors that may contribute incorrect interpretation data. Although many tools have been developed for control pre-processing NGS data, none them provide...

10.1186/1751-0473-9-8 article EN cc-by Source Code for Biology and Medicine 2014-05-03

UniProt: the Universal Protein Knowledgebase in 2025

OPENALEX - Publications

Alex Bateman María Martín Sandra Orchard Michele Magrane Aduragbemi S. Adesina and 94 more

The aim of the UniProt Knowledgebase (UniProtKB; https://www.uniprot.org/) is to provide users with a comprehensive, high-quality and freely accessible set protein sequences annotated functional information. In this publication, we describe ongoing changes our production pipeline limit available in UniProtKB high-quality, non-redundant reference proteomes. We continue manually curate scientific literature add latest data use machine learning techniques. also encourage community curation...

10.1093/nar/gkae1010 article EN cc-by Nucleic Acids Research 2024-11-18

A crowdsourcing open platform for literature curation in UniProt

OPENALEX - Publications

Yuqi Wang Qinghua Wang Hongzhan Huang Wei Huang Chuming Chen and 3 more

The UniProt knowledgebase is a public database for protein sequence and function, covering the tree of life over 220 million entries. Now, whole community can use new crowdsourcing annotation system to help scale up curation receive proper attribution their biocuration work.

10.1371/journal.pbio.3001464 article EN cc-by PLoS Biology 2021-12-06

Closing history of the southern Tianshan oceanic basin, western China: an oblique collisional orogeny

OPENALEX - Publications

Chuming Chen Huafu Lu Dong Jia Dongsheng Cai Shimin Wu

10.1016/s0040-1951(98)00273-x article EN Tectonophysics 1999-02-01

PIRSF: family classification system at the Protein Information Resource

OPENALEX - Publications

Cathy Wu A. N. NIKOL'SKAYA Hongzhan Huang Lai-Su Yeh Darren A. Natale and 15 more

The Protein Information Resource (PIR) is an integrated public resource of protein informatics. To facilitate the sensible propagation and standardization annotation systematic detection errors, PIR has extended its superfamily concept developed SuperFamily (PIRSF) classification system. Based on evolutionary relationships whole proteins, this system allows both specific biological generic biochemical functions. adopts a network structure for from to subfamily levels. family members are...

10.1093/nar/gkh097 article EN Nucleic Acids Research 2003-12-17

The Protein Information Resource: an integrated public resource of functional annotation of proteins

OPENALEX - Publications

Cathy Wu Hongzhan Huang Leslie Arminski Jorge Castro-Alvear Chuming Chen and 11 more

The Protein Information Resource (PIR) serves as an integrated public resource of functional annotation protein data to support genomic/proteomic research and scientific discovery. PIR, in collaboration with the Munich Center for Sequences (MIPS) Japan International Database (JIPID), produces PIR-International Sequence (PSD), major annotated sequence database domain, containing about 250 000 proteins. To improve coverage experimentally validated data, a bibliography submission system is...

10.1093/nar/30.1.35 article EN Nucleic Acids Research 2002-01-01

A fast Peptide Match service for UniProt Knowledgebase

OPENALEX - Publications

Chuming Chen Zhiwen Li Hongzhan Huang Barış Ethem Süzek Cathy Wu

We have developed a new web application for peptide matching using Apache Lucene-based search engine. The Peptide Match service is designed to quickly retrieve all occurrences of given query from UniProt Knowledgebase (UniProtKB) with isoforms. matched proteins are shown in summary tables rich annotations, including sequence region(s) and links corresponding number proteomic/peptide spectral databases. results grouped by taxonomy can be browsed organism, taxonomic group or tree. supports...

10.1093/bioinformatics/btt484 article EN Bioinformatics 2013-08-19

Protein Ontology (PRO): enhancing and scaling up the representation of protein entities

OPENALEX - Publications

Darren A. Natale Cecilia Arighi Judith A. Blake Jonathan P. Bona Chuming Chen and 17 more

The Protein Ontology (PRO; http://purl.obolibrary.org/obo/pr) formally defines and describes taxon-specific taxon-neutral protein-related entities in three major areas: proteins related by evolution; produced from a given gene; protein-containing complexes. PRO thus serves as tool for referencing protein at any level of specificity. To enhance this ability, to facilitate the comparison such described different resources, we developed standardized representation proteoforms using UniProtKB...

10.1093/nar/gkw1075 article EN cc-by Nucleic Acids Research 2016-10-25

Pulmonary fibrosis and its related factors in discharged patients with new corona virus pneumonia: a cohort study

OPENALEX - Publications

Xiaohe Li Chenguang Shen Lifei Wang Sumit Majumder Die Zhang and 19 more

Abstract Background Thousands of Coronavirus Disease 2019 (COVID-19) patients have been discharged from hospitals Persistent follow-up studies are required to evaluate the prevalence post-COVID-19 fibrosis. Methods This study involves 462 laboratory-confirmed with COVID-19 who were admitted Shenzhen Third People’s Hospital January 11, 2020 April 26, 2020. A total 457 underwent thin-section chest CT scans during hospitalization or after discharge identify pulmonary lesion. 287 followed up 90...

10.1186/s12931-021-01798-6 article EN cc-by Respiratory Research 2021-07-09

Representative Proteomes: A Stable, Scalable and Unbiased Proteome Set for Sequence Analysis and Functional Annotation

OPENALEX - Publications

Chuming Chen Darren A. Natale ROBERT FINN Hongzhan Huang Jian Zhang and 2 more

The accelerating growth in the number of protein sequences taxes both computational and manual resources needed to analyze them. One approach dealing with this problem is minimize proteins subjected such analysis a way that minimizes loss information. To end we have developed set Representative Proteomes (RPs), each selected from Proteome Group (RPG) containing similar proteomes calculated based on co-membership UniRef50 clusters. A proteome can best represent all its group terms majority...

10.1371/journal.pone.0018910 article EN cc-by PLoS ONE 2011-04-27

A comprehensive protein-centric ID mapping service for molecular data integration

OPENALEX - Publications

Hongzhan Huang Peter B. McGarvey Barış Ethem Süzek Raja Mazumder Jian Zhang and 2 more

Identifier (ID) mapping establishes links between various biological databases and is an essential first step for molecular data integration functional annotation. ID allows diverse on genes proteins to be combined mapped pathways ontologies. We have developed comprehensive protein-centric services providing mappings 90 IDs derived from genes, proteins, pathways, diseases, structures, protein families, interaction, literature, ontologies, etc. The are widely used been regularly updated since...

10.1093/bioinformatics/btr101 article EN Bioinformatics 2011-04-06

SkateBase, an elasmobranch genome project and collection of molecular resources for chondrichthyan fishes

OPENALEX - Publications

Jennifer T. Wyffels Benjamin L. King James J. Vincent Chuming Chen Cathy Wu and 1 more

<ns4:p>Chondrichthyan fishes are a diverse class of gnathostomes that provide valuable perspective on fundamental characteristics shared by all jawed and limbed vertebrates. Studies phylogeny, species diversity, population structure, conservation, physiology accelerated genomic, transcriptomic protein sequence data. These data widely available for many sarcopterygii (coelacanth, lungfish tetrapods) actinoptergii (ray-finned fish including teleosts) taxa, but limited chondrichthyan fishes. In...

10.12688/f1000research.4996.1 preprint EN cc-by F1000Research 2014-08-12

UniRule: a unified rule resource for automatic annotation in the UniProt Knowledgebase

OPENALEX - Publications

Alistair MacDougall Vladimir Volynkin Rabie Saidi Diego Poggioli Hermann Zellner and 95 more

Abstract Motivation The number of protein records in the UniProt Knowledgebase (UniProtKB: https://www.uniprot.org) continues to grow rapidly as a result genome sequencing and prediction protein-coding genes. Providing functional annotation for these proteins presents significant continuing challenge. Results In response this challenge, has developed method annotation, known UniRule, based on expertly curated rules, which integrates related systems (RuleBase, HAMAP, PIRSR, PIRNR) by members...

10.1093/bioinformatics/btaa485 article EN cc-by Bioinformatics 2020-05-05

Rejuvenation of the Kuqa Foreland Basin, Northern Flank of the Tarim Basin, Northwest China

OPENALEX - Publications

Lu Huafu David G. Howell Dong Jia Dongsheng Cai Shimin Wu and 3 more

The Kuqa depression along the northern flank of Tarim basin is filled with a thick sequence Neogene and Quaternary coarse elastic continental sediments. This structural part large foreland that lies south Tianshan—an orogenic belt intracontinental convergence resulting from northward propagation stress following collision India southern margin Eurasia.

10.1080/00206819409465509 article EN International Geology Review 1994-12-01

Elevated FGF21 secretion, PGC-1α and ketogenic enzyme expression are hallmarks of iron–sulfur cluster depletion in human skeletal muscle

OPENALEX - Publications

Daniel R. Crooks Thanemozhi G. Natarajan Suh Young Jeong Chuming Chen Sun Young Park and 6 more

Iron–sulfur (Fe-S) clusters are ancient enzyme cofactors found in virtually all life forms. We evaluated the physiological effects of chronic Fe-S cluster deficiency human skeletal muscle, a tissue that relies heavily on cluster-mediated aerobic energy metabolism. Despite greatly decreased oxidative capacity, muscle from patients deficient scaffold protein ISCU showed predominance type I fibers and higher capillary density, enhanced expression transcriptional co-activator PGC-1α increased...

10.1093/hmg/ddt393 article EN Human Molecular Genetics 2013-08-13

Community annotation and bioinformatics workforce development in concert--Little Skate Genome Annotation Workshops and Jamborees

OPENALEX - Publications

Qi Wang Cecilia Arighi Benjamin L. King Shawn W. Polson James J. Vincent and 8 more

Recent advances in high-throughput DNA sequencing technologies have equipped biologists with a powerful new set of tools for advancing research goals. The resulting flood sequence data has made it critically important to train the next generation scientists handle inherent bioinformatic challenges. North East Bioinformatics Collaborative (NEBC) is undertaking genome and annotation little skate (Leucoraja erinacea) promote advancement bioinformatics infrastructure our region, an emphasis on...

10.1093/database/bar064 article EN Database 2012-03-20

RNA-Seq Analysis of Abdominal Fat in Genetically Fat and Lean Chickens Highlights a Divergence in Expression of Genes Controlling Adiposity, Hemostasis, and Lipid Metabolism

OPENALEX - Publications

C. Resnyk Chuming Chen Hongzhan Huang Cathy Wu Jean Simon and 3 more

Genetic selection for enhanced growth rate in meat-type chickens (Gallus domesticus) is usually accompanied by excessive adiposity, which has negative impacts on both feed efficiency and carcass quality. Enhanced visceral fatness several unique features of avian metabolism (i.e., fasting hyperglycemia insulin insensitivity) mimic overt symptoms obesity related metabolic disorders humans. Elucidation the genetic endocrine factors that contribute to could also advance our understanding human...

10.1371/journal.pone.0139549 article EN cc-by PLoS ONE 2015-10-07

Homozygous p.Ser267Phe in SLC10A1 is associated with a new type of hypercholanemia and implications for personalized medicine

OPENALEX - Publications

Ruihong Liu Chuming Chen Xuefeng Xia Qijun Liao Qiong Wang and 22 more

SLC10A1 codes for the sodium-taurocholate cotransporting polypeptide (NTCP), which is a hepatocellular transporter bile acids (BAs) and receptor hepatitis B D viruses. NTCP also target of multiple drugs. We aimed to evaluate medical consequences loss function mutation p.Ser267Phe in SLC10A1. identified eight individuals with homozygous followed up 8-90 months. compared their total serum BAs 6 species 170 wild-type 107 heterozygous healthy individuals. performed in-depth examinations exome...

10.1038/s41598-017-07012-2 article EN cc-by Scientific Reports 2017-08-17

Nicotine promotes chronic obstructive pulmonary disease via inducing pyroptosis activation in bronchial epithelial cells

OPENALEX - Publications

Rubing Mo Jun Zhang Chuming Chen Yipeng Ding

Nicotine is one of the primary components in cigarettes, which responsible for addiction. Numerous studies have investigated effects nicotine on pulmonary disease. The health epithelial cells important development chronic obstructive disease (COPD). Accumulating evidence has suggested that cell death may initiate or contribute to progression a number lung diseases via airway remodeling. Pyroptosis unique form inflammatory mediated by activation caspase‑1 and NOD‑like receptor protein‑3...

10.3892/mmr.2022.12608 article EN cc-by-nc-nd Molecular Medicine Reports 2022-01-18

Transcriptional profiling of liver during the critical embryo-to-hatchling transition period in the chicken (Gallus gallus)

OPENALEX - Publications

Larry A. Cogburn Nares Trakooljul Chuming Chen Hongzhan Huang Cathy Wu and 3 more

Although hatching is perhaps the most abrupt and profound metabolic challenge that a chicken must undergo; there have been no attempts to functionally map pathways induced in liver during embryo-to-hatchling transition. Furthermore, we know very little about regulatory factors regulate lipid metabolism late embryos or newly-hatched chicks. In present study, examined hepatic transcriptomes of 12 hatchling chicks peri-hatch period-or switch from chorioallantoic pulmonary respiration.Initial...

10.1186/s12864-018-5080-4 article EN cc-by BMC Genomics 2018-09-21

Genome sequencing and transcript analysis of Hemileia vastatrix reveal expression dynamics of candidate effectors dependent on host compatibility

OPENALEX - Publications

Brenda Neves Porto Eveline Teixeira Caixeta Sandra M. Mathioni Pedro Marcus Pereira Vidigal Laércio Zambolim and 9 more

Coffee leaf rust caused by the fungus Hemileia vastatrix is one of most important diseases coffee plantations worldwide. Current knowledge H. genome limited and only a small fraction total fungal secretome has been identified. In order to obtain more comprehensive understanding its secretome, we aimed sequence assemble entire using two next-generation sequencing platforms hybrid assembly strategy. This resulted in 547 Mb race XXXIII (Hv33), with 13,364 predicted genes that encode 13,034...

10.1371/journal.pone.0215598 article EN public-domain PLoS ONE 2019-04-18

COVID-19 Knowledge Graph from semantic integration of biomedical literature and databases

OPENALEX - Publications

Chuming Chen Karen Ross Sachin Gavali Julie Cowart Cathy Wu

Abstract Summary The global response to the COVID-19 pandemic has led a rapid increase of scientific literature on this deadly disease. Extracting knowledge from biomedical and integrating it with relevant information curated biological databases is essential gain insight into etiology, diagnosis treatment. We used Semantic Web technology RDF integrate mined by iTextMine, PubTator SemRep formalized in standardized computable Knowledge Graph (KG). published KG via SPARQL endpoint support...

10.1093/bioinformatics/btab694 article EN cc-by Bioinformatics 2021-10-06

Predicting Illness Severity and Short-Term Outcomes of COVID-19: A Retrospective Cohort Study in China

OPENALEX - Publications

Chuming Chen Haihui Wang Zhichao Liang Ling Peng Fang Zhao and 14 more

Among 417 COVID-19 patients in Shenzhen, demographic characteristics, clinical manifestations and baseline laboratory tests showed significant differences between mild-moderate cohort severe-critical cohort.Based on these differences, a convenient mathematical model was established to predict the illness severity of COVID-19. The includes four parameters: age, BMI, CD4+ lymphocytes IL-6 levels. AUC is 0.911.The high risk factors for developing severe are: age ≥ 55 years, BMI > 27 kg / m2, 20...

10.1016/j.xinn.2020.04.007 article EN cc-by-nc-nd The Innovation 2020-05-01

Understanding ER+ Breast Cancer Dormancy Using Bioinspired Synthetic Matrices for Long‐Term 3D Culture and Insights into Late Recurrence

OPENALEX - Publications

Elisa M. Ovadia Lina Pradhan Lisa A. Sawicki Julie Cowart Rebecca E. Huber and 6 more

Abstract Late recurrences of breast cancer are hypothesized to originate from disseminated tumor cells that re‐activate after a long period dormancy, ≥5 years for estrogen‐receptor positive (ER+) tumors. An outstanding question remains as what the key microenvironment interactions regulate this complex process, and well‐defined human model systems needed probing this. Here, robust, bioinspired 3D ER+ dormancy culture is established utilized probe effects matrix properties common sites late...

10.1002/adbi.202000119 article EN Advanced Biosystems 2020-06-30

Coming Soon ...