Chuming Chen

ORCID: 0000-0002-7287-9013
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Bioinformatics and Genomic Networks
  • Genomics and Phylogenetic Studies
  • Advanced Proteomics Techniques and Applications
  • Biomedical Text Mining and Ontologies
  • COVID-19 Clinical Research Studies
  • Long-Term Effects of COVID-19
  • Genetic and phenotypic traits in livestock
  • Genetic Mapping and Diversity in Plants and Animals
  • Machine Learning in Bioinformatics
  • Semantic Web and Ontologies
  • Animal Genetics and Reproduction
  • COVID-19 diagnosis using AI
  • Hepatitis B Virus Studies
  • Genomics and Rare Diseases
  • Hepatitis C virus research
  • Hydrocarbon exploration and reservoir analysis
  • RNA and protein synthesis mechanisms
  • Liver Disease Diagnosis and Treatment
  • Computational Drug Discovery Methods
  • HIV, Drug Use, Sexual Risk
  • Gene expression and cancer classification
  • Geological and Geophysical Studies
  • Scientific Computing and Data Management
  • Opioid Use Disorder Treatment
  • Plant Virus Research Studies

University of Delaware
2015-2024

Shenzhen Third People’s Hospital
2017-2024

Southern University of Science and Technology
2019-2024

European Bioinformatics Institute
2023-2024

SIB Swiss Institute of Bioinformatics
2024

Hainan General Hospital
2022-2023

Hainan Medical University
2022-2023

Machine Science
2013-2021

South China University of Technology
2021

University of Macau
2019-2021

When compared to Sanger sequencing technology, next-generation (NGS) technologies are hindered by shorter sequence read length, higher base-call error rate, non-uniform coverage, and platform-specific artifacts. These characteristics lower the quality of their downstream analyses, e.g. de novo reference-based assembly, introducing artifacts errors that may contribute incorrect interpretation data. Although many tools have been developed for control pre-processing NGS data, none them provide...

10.1186/1751-0473-9-8 article EN cc-by Source Code for Biology and Medicine 2014-05-03
Alex Bateman María Martín Sandra Orchard Michele Magrane Aduragbemi S. Adesina and 94 more Shadab Ahmad Emily Bowler-Barnett Hema Bye‐A‐Jee David C. J. Carpentier Paul Denny Jun Fan Penelope Garmiri Leonardo Jose da Costa Gonzales Abdulrahman Hussein Alexandr Ignatchenko Giuseppe Insana Rizwan Ishtiaq Vishal Joshi Dushyanth Jyothi Swaathi Kandasaamy Antonia Lock Aurélien Luciani Jie Luo Yvonne Lussi Juan Marín Pedro Raposo Daniel L Rice Rafael Silva Santos Elena Speretta James Stephenson Prabhat Totoo Nidhi Tyagi Nadya Urakova Preethi Vasudev Kate Warner Supun Wijerathne C. Yu Rossana Zaru Alan Bridge Lucila Aimo Ghislaine Argoud‐Puy Andrea H Auchincloss Kristian B. Axelsen Parit Bansal Delphine Baratin Teresa M Batista Neto Marie-Claude Blatter Jerven Bolleman Emmanuel Boutet Lionel Breuza Blanca Cabrera Gil Cristina Casals‐Casas Kamal Chikh Echioukh Elisabeth Coudert Beatrice Cuche Edouard de Castro Anne Estreicher Maria Livia Famiglietti Marc Feuermann Elisabeth Gasteiger Pascale Gaudet Sébastien Géhant Vivienne Baillie Gerritsen Arnaud Gos Nadine Gruaz Chantal Hulo Nevila Hyka‐Nouspikel Florence Jungo Arnaud Kerhornou Philippe Le Mercier Damien Lieberherr Patrick Masson Anne Morgat Salvo Paesano Ivo Pedruzzi Sandrine Pilbout Lucille Pourcel Sylvain Poux Monica Pozzato Manuela Pruess Nicole Redaschi Catherine Rivoire Christian Sigrist Karin Sonesson Shyamala Sundaram Anastasia Sveshnikova Cathy Wu Cecilia Arighi Chuming Chen Chuming Chen Hongzhan Huang Kati Laiho Minna Lehväslaiho Peter B. McGarvey Darren A. Natale Karen Ross C R Vinayaka Yuqi Wang Jian Zhang

The aim of the UniProt Knowledgebase (UniProtKB; https://www.uniprot.org/) is to provide users with a comprehensive, high-quality and freely accessible set protein sequences annotated functional information. In this publication, we describe ongoing changes our production pipeline limit available in UniProtKB high-quality, non-redundant reference proteomes. We continue manually curate scientific literature add latest data use machine learning techniques. also encourage community curation...

10.1093/nar/gkae1010 article EN cc-by Nucleic Acids Research 2024-11-18

The UniProt knowledgebase is a public database for protein sequence and function, covering the tree of life over 220 million entries. Now, whole community can use new crowdsourcing annotation system to help scale up curation receive proper attribution their biocuration work.

10.1371/journal.pbio.3001464 article EN cc-by PLoS Biology 2021-12-06

The Protein Information Resource (PIR) is an integrated public resource of protein informatics. To facilitate the sensible propagation and standardization annotation systematic detection errors, PIR has extended its superfamily concept developed SuperFamily (PIRSF) classification system. Based on evolutionary relationships whole proteins, this system allows both specific biological generic biochemical functions. adopts a network structure for from to subfamily levels. family members are...

10.1093/nar/gkh097 article EN Nucleic Acids Research 2003-12-17

The Protein Information Resource (PIR) serves as an integrated public resource of functional annotation protein data to support genomic/proteomic research and scientific discovery. PIR, in collaboration with the Munich Center for Sequences (MIPS) Japan International Database (JIPID), produces PIR-International Sequence (PSD), major annotated sequence database domain, containing about 250 000 proteins. To improve coverage experimentally validated data, a bibliography submission system is...

10.1093/nar/30.1.35 article EN Nucleic Acids Research 2002-01-01

We have developed a new web application for peptide matching using Apache Lucene-based search engine. The Peptide Match service is designed to quickly retrieve all occurrences of given query from UniProt Knowledgebase (UniProtKB) with isoforms. matched proteins are shown in summary tables rich annotations, including sequence region(s) and links corresponding number proteomic/peptide spectral databases. results grouped by taxonomy can be browsed organism, taxonomic group or tree. supports...

10.1093/bioinformatics/btt484 article EN Bioinformatics 2013-08-19

The Protein Ontology (PRO; http://purl.obolibrary.org/obo/pr) formally defines and describes taxon-specific taxon-neutral protein-related entities in three major areas: proteins related by evolution; produced from a given gene; protein-containing complexes. PRO thus serves as tool for referencing protein at any level of specificity. To enhance this ability, to facilitate the comparison such described different resources, we developed standardized representation proteoforms using UniProtKB...

10.1093/nar/gkw1075 article EN cc-by Nucleic Acids Research 2016-10-25

Abstract Background Thousands of Coronavirus Disease 2019 (COVID-19) patients have been discharged from hospitals Persistent follow-up studies are required to evaluate the prevalence post-COVID-19 fibrosis. Methods This study involves 462 laboratory-confirmed with COVID-19 who were admitted Shenzhen Third People’s Hospital January 11, 2020 April 26, 2020. A total 457 underwent thin-section chest CT scans during hospitalization or after discharge identify pulmonary lesion. 287 followed up 90...

10.1186/s12931-021-01798-6 article EN cc-by Respiratory Research 2021-07-09

The accelerating growth in the number of protein sequences taxes both computational and manual resources needed to analyze them. One approach dealing with this problem is minimize proteins subjected such analysis a way that minimizes loss information. To end we have developed set Representative Proteomes (RPs), each selected from Proteome Group (RPG) containing similar proteomes calculated based on co-membership UniRef50 clusters. A proteome can best represent all its group terms majority...

10.1371/journal.pone.0018910 article EN cc-by PLoS ONE 2011-04-27

Identifier (ID) mapping establishes links between various biological databases and is an essential first step for molecular data integration functional annotation. ID allows diverse on genes proteins to be combined mapped pathways ontologies. We have developed comprehensive protein-centric services providing mappings 90 IDs derived from genes, proteins, pathways, diseases, structures, protein families, interaction, literature, ontologies, etc. The are widely used been regularly updated since...

10.1093/bioinformatics/btr101 article EN Bioinformatics 2011-04-06

<ns4:p>Chondrichthyan fishes are a diverse class of gnathostomes that provide valuable perspective on fundamental characteristics shared by all jawed and limbed vertebrates. Studies phylogeny, species diversity, population structure, conservation, physiology accelerated genomic, transcriptomic protein sequence data. These data widely available for many sarcopterygii (coelacanth, lungfish tetrapods) actinoptergii (ray-finned fish including teleosts) taxa, but limited chondrichthyan fishes. In...

10.12688/f1000research.4996.1 preprint EN cc-by F1000Research 2014-08-12
Alistair MacDougall Vladimir Volynkin Rabie Saidi Diego Poggioli Hermann Zellner and 95 more Emma Hatton-Ellis Vishal Joshi Claire O’Donovan Sandra Orchard Andrea H Auchincloss Delphine Baratin Jerven Bolleman Elisabeth Coudert Leyla Jael Castro Chantal Hulo Patrick Masson Ivo Pedruzzi Catherine Rivoire Cecilia Arighi Qinghua Wang Chuming Chen Hongzhan Huang John S. Garavelli C R Vinayaka Lai-Su Yeh Darren A. Natale Kati Laiho María Martín Alexandre Renaux Klemens Pichler Alex Bateman Alan Bridge Cathy Wu Cecilia Arighi Lionel Breuza Elisabeth Coudert Hongzhan Huang Damien Lieberherr Michele Magrane María Martín Peter B. McGarvey Darren A. Natale Sandra Orchard Ivo Pedruzzi Sylvain Poux Manuela Pruess Shriya Raj Nicole Redaschi Lucila Aimo Ghislaine Argoud‐Puy Andrea H Auchincloss Kristian B. Axelsen Emmanuel Boutet Emily Bowler-Barnett Ramona Britto Hema Bye‐A‐Jee Cristina Casals‐Casas Paul Denny Anne Estreicher Maria Livia Famiglietti Marc Feuermann John S. Garavelli Penelope Garmiri Arnaud Gos Nadine Gruaz Emma Hatton-Ellis Chantal Hulo Nevila Hyka‐Nouspikel Florence Jungo Kati Laiho Philippe Le Mercier Antonia Lock Yvonne Lussi Alistair MacDougall Patrick Masson Anne Morgat Sandrine Pilbout Lucille Pourcel Catherine Rivoire Karen Ross Christian Sigrist Elena Speretta Shyamala Sundaram Nidhi Tyagi C R Vinayaka Qinghua Wang Kate Warner Lai-Su Yeh Rossana Zaru Shadab Ahmed Emanuele Alpi Leslie Arminski Parit Bansal Delphine Baratin Teresa Batista Neto Jerven Bolleman Chuming Chen Chuming Chen Beatrice Cuche Austra Cukura

Abstract Motivation The number of protein records in the UniProt Knowledgebase (UniProtKB: https://www.uniprot.org) continues to grow rapidly as a result genome sequencing and prediction protein-coding genes. Providing functional annotation for these proteins presents significant continuing challenge. Results In response this challenge, has developed method annotation, known UniRule, based on expertly curated rules, which integrates related systems (RuleBase, HAMAP, PIRSR, PIRNR) by members...

10.1093/bioinformatics/btaa485 article EN cc-by Bioinformatics 2020-05-05

The Kuqa depression along the northern flank of Tarim basin is filled with a thick sequence Neogene and Quaternary coarse elastic continental sediments. This structural part large foreland that lies south Tianshan—an orogenic belt intracontinental convergence resulting from northward propagation stress following collision India southern margin Eurasia.

10.1080/00206819409465509 article EN International Geology Review 1994-12-01

Iron–sulfur (Fe-S) clusters are ancient enzyme cofactors found in virtually all life forms. We evaluated the physiological effects of chronic Fe-S cluster deficiency human skeletal muscle, a tissue that relies heavily on cluster-mediated aerobic energy metabolism. Despite greatly decreased oxidative capacity, muscle from patients deficient scaffold protein ISCU showed predominance type I fibers and higher capillary density, enhanced expression transcriptional co-activator PGC-1α increased...

10.1093/hmg/ddt393 article EN Human Molecular Genetics 2013-08-13

Recent advances in high-throughput DNA sequencing technologies have equipped biologists with a powerful new set of tools for advancing research goals. The resulting flood sequence data has made it critically important to train the next generation scientists handle inherent bioinformatic challenges. North East Bioinformatics Collaborative (NEBC) is undertaking genome and annotation little skate (Leucoraja erinacea) promote advancement bioinformatics infrastructure our region, an emphasis on...

10.1093/database/bar064 article EN Database 2012-03-20

Genetic selection for enhanced growth rate in meat-type chickens (Gallus domesticus) is usually accompanied by excessive adiposity, which has negative impacts on both feed efficiency and carcass quality. Enhanced visceral fatness several unique features of avian metabolism (i.e., fasting hyperglycemia insulin insensitivity) mimic overt symptoms obesity related metabolic disorders humans. Elucidation the genetic endocrine factors that contribute to could also advance our understanding human...

10.1371/journal.pone.0139549 article EN cc-by PLoS ONE 2015-10-07

SLC10A1 codes for the sodium-taurocholate cotransporting polypeptide (NTCP), which is a hepatocellular transporter bile acids (BAs) and receptor hepatitis B D viruses. NTCP also target of multiple drugs. We aimed to evaluate medical consequences loss function mutation p.Ser267Phe in SLC10A1. identified eight individuals with homozygous followed up 8-90 months. compared their total serum BAs 6 species 170 wild-type 107 heterozygous healthy individuals. performed in-depth examinations exome...

10.1038/s41598-017-07012-2 article EN cc-by Scientific Reports 2017-08-17

Nicotine is one of the primary components in cigarettes, which responsible for addiction. Numerous studies have investigated effects nicotine on pulmonary disease. The health epithelial cells important development chronic obstructive disease (COPD). Accumulating evidence has suggested that cell death may initiate or contribute to progression a number lung diseases via airway remodeling. Pyroptosis unique form inflammatory mediated by activation caspase‑1 and NOD‑like receptor protein‑3...

10.3892/mmr.2022.12608 article EN cc-by-nc-nd Molecular Medicine Reports 2022-01-18

Although hatching is perhaps the most abrupt and profound metabolic challenge that a chicken must undergo; there have been no attempts to functionally map pathways induced in liver during embryo-to-hatchling transition. Furthermore, we know very little about regulatory factors regulate lipid metabolism late embryos or newly-hatched chicks. In present study, examined hepatic transcriptomes of 12 hatchling chicks peri-hatch period-or switch from chorioallantoic pulmonary respiration.Initial...

10.1186/s12864-018-5080-4 article EN cc-by BMC Genomics 2018-09-21

Coffee leaf rust caused by the fungus Hemileia vastatrix is one of most important diseases coffee plantations worldwide. Current knowledge H. genome limited and only a small fraction total fungal secretome has been identified. In order to obtain more comprehensive understanding its secretome, we aimed sequence assemble entire using two next-generation sequencing platforms hybrid assembly strategy. This resulted in 547 Mb race XXXIII (Hv33), with 13,364 predicted genes that encode 13,034...

10.1371/journal.pone.0215598 article EN public-domain PLoS ONE 2019-04-18

Abstract Summary The global response to the COVID-19 pandemic has led a rapid increase of scientific literature on this deadly disease. Extracting knowledge from biomedical and integrating it with relevant information curated biological databases is essential gain insight into etiology, diagnosis treatment. We used Semantic Web technology RDF integrate mined by iTextMine, PubTator SemRep formalized in standardized computable Knowledge Graph (KG). published KG via SPARQL endpoint support...

10.1093/bioinformatics/btab694 article EN cc-by Bioinformatics 2021-10-06

Among 417 COVID-19 patients in Shenzhen, demographic characteristics, clinical manifestations and baseline laboratory tests showed significant differences between mild-moderate cohort severe-critical cohort.Based on these differences, a convenient mathematical model was established to predict the illness severity of COVID-19. The includes four parameters: age, BMI, CD4+ lymphocytes IL-6 levels. AUC is 0.911.The high risk factors for developing severe are: age ≥ 55 years, BMI > 27 kg / m2, 20...

10.1016/j.xinn.2020.04.007 article EN cc-by-nc-nd The Innovation 2020-05-01

Abstract Late recurrences of breast cancer are hypothesized to originate from disseminated tumor cells that re‐activate after a long period dormancy, ≥5 years for estrogen‐receptor positive (ER+) tumors. An outstanding question remains as what the key microenvironment interactions regulate this complex process, and well‐defined human model systems needed probing this. Here, robust, bioinspired 3D ER+ dormancy culture is established utilized probe effects matrix properties common sites late...

10.1002/adbi.202000119 article EN Advanced Biosystems 2020-06-30
Coming Soon ...