NFDI4DS | UHH-SEMS - Publication Details

Aron Marchler‐Bauer

ORCID: 0000-0003-1516-0712

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5026383671

Research Areas

Genomics and Phylogenetic Studies
RNA and protein synthesis mechanisms
Machine Learning in Bioinformatics
Protein Structure and Dynamics
Enzyme Structure and Function
Genetics, Bioinformatics, and Biomedical Research
Bioinformatics and Genomic Networks
Advanced Proteomics Techniques and Applications
Microbial Metabolic Engineering and Bioproduction
Biomedical Text Mining and Ontologies
Molecular Biology Techniques and Applications
Glycosylation and Glycoproteins Research
Gene expression and cancer classification
COVID-19 Clinical Research Studies
Heat shock proteins research
Receptor Mechanisms and Signaling
CRISPR and Genetic Engineering
Bacterial Genetics and Biotechnology
vaccines and immunoinformatics approaches
SARS-CoV-2 and COVID-19 Research
Chemical Synthesis and Analysis
Scientific Computing and Data Management
Manufacturing Process and Optimization
Natural Language Processing Techniques
Vector-borne infectious diseases

National Center for Biotechnology Information
2015-2024

National Institutes of Health
2015-2024

Medical University of Graz
2009-2011

Institut de génétique et de biologie moléculaire et cellulaire
2010

United States National Library of Medicine
1999

Bryant University
1997

Bauer Research Foundation
1997

Universitat de València
1996

University of Vienna
1996

Research Institute of Molecular Pathology
1994

CDD: NCBI's conserved domain database

OPENALEX - Publications

Aron Marchler‐Bauer Myra K. Derbyshire Noreen R. Gonzales Shennan Lu Farideh Chitsaz and 15 more

NCBI's CDD, the Conserved Domain Database, enters its 15th year as a public resource for annotation of proteins with location conserved domain footprints. Going forward, we strive to improve coverage and consistency provided by CDD. We maintain live search system well an archive pre-computed sequences tracked in Entrez protein database, which can be retrieved single or bulk. also import procedures so that CDD contains models definitions several collections available domain, those produced...

10.1093/nar/gku1221 article EN cc-by-nc Nucleic Acids Research 2014-11-20

CDD: a Conserved Domain Database for the functional annotation of proteins

OPENALEX - Publications

Aron Marchler‐Bauer Shuo Lu J. B. Anderson Farideh Chitsaz Myra K. Derbyshire and 22 more

NCBI’s Conserved Domain Database (CDD) is a resource for the annotation of protein sequences with location conserved domain footprints, and functional sites inferred from these footprints. CDD includes manually curated models that make use 3D structure to refine provide insights into sequence/structure/function relationships. Manually are organized hierarchically if they describe families clearly related by common descent. As also imports family variety external sources, it partially...

10.1093/nar/gkq1189 article EN cc-by-nc Nucleic Acids Research 2010-11-24

CDD/SPARCLE: the conserved domain database in 2020

OPENALEX - Publications

Shennan Lu Jiyao Wang Farideh Chitsaz Myra K. Derbyshire Renata C. Geer and 12 more

Abstract As NLM’s Conserved Domain Database (CDD) enters its 20th year of operations as a publicly available resource, CDD curation staff continues to develop hierarchical classifications widely distributed protein domain families, and record conserved sites associated with molecular function, so that they can be mapped onto user queries in support hypothesis-driven biomolecular research. offers both an archive pre-computed annotations well live search services for single or nucleotide...

10.1093/nar/gkz991 article EN cc-by-nc Nucleic Acids Research 2019-11-13

CDD/SPARCLE: functional classification of proteins via subfamily domain architectures

OPENALEX - Publications

Aron Marchler‐Bauer Bo Yu Lianyi Han Jane He Christopher J. Lanczycki and 17 more

NCBI's Conserved Domain Database (CDD) aims at annotating biomolecular sequences with the location of evolutionarily conserved protein domain footprints, and functional sites inferred from such footprints. An archive pre-computed annotation is maintained for proteins tracked by Entrez database, live search services are offered as well. CDD curation staff supplements a comprehensive collection family models, which have been imported external providers, representations selected families that...

10.1093/nar/gkw1129 article EN cc-by-nc Nucleic Acids Research 2016-11-29

CD-Search: protein domain annotations on the fly

OPENALEX - Publications

Aron Marchler‐Bauer Stephen H. Bryant

We describe the Conserved Domain Search service (CD-Search), a web-based tool for detection of structural and functional domains in protein sequences. CD-Search uses BLAST® heuristics to provide fast, interactive service, searches comprehensive collection domain models. results are displayed as architecture cartoons pairwise alignments between query domain-model consensus may be visualized further detail by embedding sequence into multiple alignment displays mapping onto three-dimensional...

10.1093/nar/gkh454 article EN Nucleic Acids Research 2004-07-01

Database resources of the national center for biotechnology information

OPENALEX - Publications

Eric W Sayers Evan Bolton J. Rodney Brister Kathi Canese Jessica Chan and 20 more

Abstract The National Center for Biotechnology Information (NCBI) produces a variety of online information resources biology, including the GenBank® nucleic acid sequence database and PubMed® citations abstracts published in life science journals. NCBI provides search retrieval operations most these data from 35 distinct databases. E-utilities serve as programming interface Resources receiving significant updates past year include PubMed, PMC, Bookshelf, RefSeq, SRA, Virus, dbSNP, dbVar,...

10.1093/nar/gkab1112 article EN cc-by-nc Nucleic Acids Research 2021-11-18

The InterPro protein families and domains database: 20 years on

OPENALEX - Publications

Matthias Blum Hsin-Yu Chang Sara Chuguransky Tiago Grego Swaathi Kandasaamy and 28 more

The InterPro database (https://www.ebi.ac.uk/interpro/) provides an integrative classification of protein sequences into families, and identifies functionally important domains conserved sites. InterProScan is the underlying software that allows nucleic acid to be searched against InterPro's signatures. Signatures are predictive models which describe or sites, provided by multiple databases. combines signatures representing equivalent additional information such as descriptions, literature...

10.1093/nar/gkaa977 article EN cc-by Nucleic Acids Research 2020-10-23

InterPro in 2022

OPENALEX - Publications

Typhaine Paysan-Lafosse Matthias Blum Sara Chuguransky Tiago Grego Beatriz Lázaro and 21 more

Abstract The InterPro database (https://www.ebi.ac.uk/interpro/) provides an integrative classification of protein sequences into families, and identifies functionally important domains conserved sites. Here, we report recent developments with (version 90.0) its associated software, including updates to data content the website. These extend enrich information provided by InterPro, provide a more user friendly access data. Additionally, have worked on adding Pfam website features website, as...

10.1093/nar/gkac993 article EN cc-by Nucleic Acids Research 2022-11-09

Database resources of the National Center for Biotechnology Information

OPENALEX - Publications

Richa Agarwala Tanya Barrett Jeffrey Beck D. A. Benson Colleen J. Bollin and 68 more

The National Center for Biotechnology Information (NCBI) provides a large suite of online resources biological information and data, including the GenBank® nucleic acid sequence database PubMed citations abstracts published life science journals. Entrez system search retrieval operations most these data from 39 distinct databases. E-utilities serve as programming interface system. Augmenting many Web applications are custom implementations BLAST program optimized to specialized sets. New...

10.1093/nar/gkx1095 article EN cc-by-nc Nucleic Acids Research 2017-11-09

InterPro in 2017—beyond protein family and domain annotations

OPENALEX - Publications

ROBERT FINN Teresa K. Attwood Patricia C. Babbitt Alex Bateman Peer Bork and 42 more

InterPro (http://www.ebi.ac.uk/interpro/) is a freely available database used to classify protein sequences into families and predict the presence of important domains sites. InterProScan underlying software that allows both nucleic acid be searched against InterPro's predictive models, which are provided by its member databases. Here, we report recent developments with associated software, including addition two new databases (SFLD CDD), functionality include residue-level annotation...

10.1093/nar/gkw1107 article EN cc-by Nucleic Acids Research 2016-10-27

InterPro in 2019: improving coverage, classification and access to protein sequence annotations

OPENALEX - Publications

Alex Mitchell Teresa K. Attwood Patricia C. Babbitt Matthias Blum Peer Bork and 37 more

The InterPro database (http://www.ebi.ac.uk/interpro/) classifies protein sequences into families and predicts the presence of functionally important domains sites. Here, we report recent developments with (version 70.0) its associated software, including an 18% growth in size terms on new entries, updates to content, inclusion additional entry type, refined modelling discontinuous domains, development a programmatic interface website. These extend enrich information provided by InterPro,...

10.1093/nar/gky1100 article EN cc-by Nucleic Acids Research 2018-10-22

CDD: a Conserved Domain Database for protein classification

OPENALEX - Publications

Aron Marchler‐Bauer

The Conserved Domain Database (CDD) is the protein classification component of NCBI's Entrez query and retrieval system. CDD linked to other databases such as Proteins, Taxonomy PubMed, can be accessed at http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=cdd. CD-Search, which available http://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi, a fast, interactive tool identify conserved domains in new sequences. CD-Search results for sequences are pre-computed provide links between proteins domain...

10.1093/nar/gki069 article EN Nucleic Acids Research 2004-12-17

CDD: specific functional annotation with the Conserved Domain Database

OPENALEX - Publications

Aron Marchler‐Bauer James B. Anderson Farideh Chitsaz Myra K. Derbyshire C. DeWeese-Scott and 23 more

NCBI's Conserved Domain Database (CDD) is a collection of multiple sequence alignments and derived database search models, which represent protein domains conserved in molecular evolution. The can be accessed at http://www.ncbi.nlm.nih.gov/Structure/cdd/cdd.shtml, also part Entrez query retrieval system, cross-linked to numerous other resources. CDD provides annotation domain footprints functional sites on sequences. Precalculated retrieved for sequences tracked CDD's models queried with...

10.1093/nar/gkn845 article EN Nucleic Acids Research 2008-11-04

The Saccharomyces cerevisiae zinc finger proteins Msn2p and Msn4p are required for transcriptional induction through the stress response element (STRE).

OPENALEX - Publications

María Teresa Martínez‐Pastor Gabriele H. Marchler Christoph Schüller Aron Marchler‐Bauer Helmut Ruis and 1 more

10.1002/j.1460-2075.1996.tb00576.x article EN The EMBO Journal 1996-05-01

CDD: a conserved domain database for interactive domain family analysis

OPENALEX - Publications

Aron Marchler‐Bauer J. B. Anderson Myra K. Derbyshire C. DeWeese-Scott Nicole R. Gonzales and 20 more

The conserved domain database (CDD) is part of NCBI's Entrez system and serves as a primary resource for the annotation footprints on protein sequences in Entrez. Entrez's global query interface can be accessed at Author Webpage will search CDD many other databases. Domain proteins has been pre-computed readily available form 'Conserved Domain' links. Novel scanned against using CD-Search service; this service searches databases CDD-derived profile models with sequence queries BLAST...

10.1093/nar/gkl951 article EN Nucleic Acids Research 2006-11-30

CDD: conserved domains and protein three-dimensional structure

OPENALEX - Publications

Aron Marchler‐Bauer Chanjuan Zheng Farideh Chitsaz Myra K. Derbyshire Lewis Y. Geer and 13 more

CDD, the Conserved Domain Database, is part of NCBI's Entrez query and retrieval system also accessible via http://www.ncbi.nlm.nih.gov/Structure/cdd/cdd.shtml. CDD provides annotation protein sequences with location conserved domain footprints functional sites inferred from these footprints. Pre-computed available Entrez, interactive search services accept single or nucleotide queries, as well batch submissions sequences, utilizing RPS-BLAST to rapidly identify putative matches....

10.1093/nar/gks1243 article EN cc-by-nc Nucleic Acids Research 2012-11-28

RefSeq: an update on prokaryotic genome annotation and curation

OPENALEX - Publications

Daniel H. Haft Michael DiCuccio Azat Badretdin Vyacheslav Brover Vyacheslav Chetvernin and 16 more

The Reference Sequence (RefSeq) project at the National Center for Biotechnology Information (NCBI) provides annotation over 95 000 prokaryotic genomes that meet standards sequence quality, completeness, and freedom from contamination. Genomes are annotated by a single Prokaryotic Genome Annotation Pipeline (PGAP) to provide users with resource is as consistent accurate possible. Notable recent changes include development of hierarchical evidence scheme, new focus on curating sources,...

10.1093/nar/gkx1068 article EN cc-by-nc Nucleic Acids Research 2017-10-25

The NCBI BioSystems database

OPENALEX - Publications

Lewis Y. Geer Aron Marchler‐Bauer Renata C. Geer Lianyi Han Jane He and 4 more

The NCBI BioSystems database, found at http://www.ncbi.nlm.nih.gov/biosystems/, centralizes and cross-links existing biological systems databases, increasing their utility target audience by integrating pathways into resources. This integration allows users of NCBI's Entrez databases to quickly categorize proteins, genes small molecules metabolic pathway, disease state or other BioSystem type, without requiring time-consuming inference relationships from the literature multiple experimental datasets.

10.1093/nar/gkp858 article EN cc-by-nc Nucleic Acids Research 2009-10-23

RefSeq: expanding the Prokaryotic Genome Annotation Pipeline reach with protein family model curation

OPENALEX - Publications

Wenjun Li Kathleen O’Neill Daniel H. Haft Michael DiCuccio Vyacheslav Chetvernin and 16 more

Abstract The Reference Sequence (RefSeq) project at the National Center for Biotechnology Information (NCBI) contains nearly 200 000 bacterial and archaeal genomes 150 million proteins with up-to-date annotation. Changes in Prokaryotic Genome Annotation Pipeline (PGAP) since 2018 have resulted a substantial reduction spurious hierarchical collection of protein family models (PFMs) used by PGAP as evidence structural functional annotation was expanded to over 35 profile hidden Markov (HMMs),...

10.1093/nar/gkaa1105 article EN cc-by-nc Nucleic Acids Research 2020-11-03

CDD: a curated Entrez database of conserved domain alignments

OPENALEX - Publications

Aron Marchler‐Bauer

The Conserved Domain Database (CDD) is now indexed as a separate database within the Entrez system and linked to other databases such MEDLINE(R). This allows users search for domain types by name, example, or view architecture of any protein in Entrez's sequence database. CDD can be accessed on WorldWideWeb at http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=cdd. Users may also employ CD-Search service identify conserved domains new sequences,...

10.1093/nar/gkg087 article EN Nucleic Acids Research 2003-01-01

CDD: a database of conserved domain alignments with links to domain three-dimensional structure

OPENALEX - Publications

Aron Marchler‐Bauer

The Conserved Domain Database (CDD) is a compilation of multiple sequence alignments representing protein domains conserved in molecular evolution. It has been populated with alignment data from the public collections Pfam and SMART, as well contributions colleagues at NCBI. current version CDD (v.1.54) contains 3693 such models. are linked to structure Entrez. viewer Cn3D serves tool interactively visualize three-dimensional structure, link residue coordinates descriptions evolutionary...

10.1093/nar/30.1.281 article EN Nucleic Acids Research 2002-01-01

Database resources of the National Center for Biotechnology Information

OPENALEX - Publications

Eric W Sayers Tanya Barrett D. A. Benson Evan Bolton Stephen H. Bryant and 37 more

In addition to maintaining the GenBank® nucleic acid sequence database, National Center for Biotechnology Information (NCBI) provides analysis and retrieval resources data in GenBank other biological made available through NCBI Web site. include Entrez, Entrez Programming Utilities, MyNCBI, PubMed, PubMed Central (PMC), Gene, Taxonomy Browser, BLAST, BLAST Link (BLink), Primer-BLAST, COBALT, Electronic PCR, OrfFinder, Splign, ProSplign, RefSeq, UniGene, HomoloGene, ProtEST, dbMHC, dbSNP,...

10.1093/nar/gkq1172 article EN cc-by-nc Nucleic Acids Research 2010-11-21

Database resources of the National Center for Biotechnology Information

OPENALEX - Publications

Eric W Sayers Tanya Barrett D. A. Benson Evan Bolton Stephen H. Bryant and 35 more

In addition to maintaining the GenBank® nucleic acid sequence database, National Center for Biotechnology Information (NCBI) provides analysis and retrieval resources data in GenBank other biological made available through NCBI web site. include Entrez, Entrez Programming Utilities, MyNCBI, PubMed, PubMed Central, Gene, Taxonomy Browser, BLAST, BLAST Link (BLink), Electronic PCR, OrfFinder, Spidey, Splign, Reference Sequence, UniGene, HomoloGene, ProtEST, dbMHC, dbSNP, Cancer Chromosomes,...

10.1093/nar/gkp967 article EN cc-by-nc Nucleic Acids Research 2009-11-12

Database resources of the National Center for Biotechnology Information

OPENALEX - Publications

Eric W Sayers Tanya Barrett D. A. Benson Evan Bolton Stephen H. Bryant and 38 more

In addition to maintaining the GenBank® nucleic acid sequence database, National Center for Biotechnology Information (NCBI) provides analysis and retrieval resources data in GenBank other biological made available through NCBI Website. include Entrez, Entrez Programming Utilities, MyNCBI, PubMed, PubMed Central (PMC), Gene, Taxonomy Browser, BLAST, BLAST Link (BLink), Primer-BLAST, COBALT, Splign, RefSeq, UniGene, HomoloGene, ProtEST, dbMHC, dbSNP, dbVar, Epigenomics, Genome related tools,...

10.1093/nar/gkr1184 article EN cc-by-nc Nucleic Acids Research 2011-12-02

Coming Soon ...