NFDI4DS | UHH-SEMS - Publication Details

William Stafford Noble

ORCID: 0000-0001-7283-4715

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5057375933

Research Areas

Genomics and Chromatin Dynamics
Advanced Proteomics Techniques and Applications
Mass Spectrometry Techniques and Applications
Machine Learning in Bioinformatics
Genomics and Phylogenetic Studies
RNA and protein synthesis mechanisms
Metabolomics and Mass Spectrometry Studies
Gene expression and cancer classification
Single-cell and spatial transcriptomics
RNA Research and Splicing
Bioinformatics and Genomic Networks
Epigenetics and DNA Methylation
Chromosomal and Genetic Variations
RNA modifications and cancer
Protein Structure and Dynamics
Cancer Genomics and Diagnostics
Cell Image Analysis Techniques
Advanced Biosensing Techniques and Applications
CRISPR and Genetic Engineering
Scientific Computing and Data Management
Cancer-related molecular mechanisms research
Genetic and Clinical Aspects of Sex Determination and Chromosomal Abnormalities
Genetics, Bioinformatics, and Biomedical Research
Genomic variations and chromosomal abnormalities
Fungal and yeast genetics research

University of Washington
2016-2025

Seattle University
2015-2025

Human Genome Sciences (United States)
2018-2019

Center for Innovation
2015

École Nationale Supérieure des Mines de Paris
2007-2014

Institut Curie
2014

Inserm
2014

University of Massachusetts Chan Medical School
2013

The University of Queensland
2009-2011

The University of Sydney
2011

MEME SUITE: tools for motif discovery and searching

OPENALEX - Publications

Timothy L. Bailey Mikael Bodén Fabian A. Buske Martin C. Frith Charles E. Grant and 4 more

The MEME Suite web server provides a unified portal for online discovery and analysis of sequence motifs representing features such as DNA binding sites protein interaction domains. popular motif algorithm is now complemented by the GLAM2 which allows containing gaps. Three scanning algorithms—MAST, FIMO GLAM2SCAN—allow numerous databases discovered GLAM2. Transcription factor (including those using MEME) can be compared with in many database Tomtom. further analyzed putative function...

10.1093/nar/gkp335 article EN cc-by-nc Nucleic Acids Research 2009-05-20

Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project

OPENALEX - Publications

Ewan Birney J Stamatoyannopoulos Anindya Dutta Roderic Guigó T Gingeras and 95 more

10.1038/nature05874 article EN Nature 2007-06-01

FIMO: scanning for occurrences of a given motif

OPENALEX - Publications

Charles E. Grant Timothy L. Bailey William Stafford Noble

Abstract Summary: A motif is a short DNA or protein sequence that contributes to the biological function of in which it resides. Over past several decades, many computational methods have been described for identifying, characterizing and searching with motifs. Critical nearly any motif-based analysis pipeline ability scan database occurrences given by position-specific frequency matrix. Results: We describe Find Individual Motif Occurrences (FIMO), software tool scanning sequences motifs as...

10.1093/bioinformatics/btr064 article EN cc-by-nc Bioinformatics 2011-02-16

The MEME Suite

OPENALEX - Publications

Timothy L. Bailey James Johnson Charles E. Grant William Stafford Noble

The MEME Suite is a powerful, integrated set of web-based tools for studying sequence motifs in proteins, DNA and RNA. Such encode many biological functions, their detection characterization important the study molecular interactions cell, including regulation gene expression. Since previous description 2009 Nucleic Acids Research Web Server Issue, we have added six new tools. Here describe capabilities all within suite, give advice on best use provide several case studies to illustrate how...

10.1093/nar/gkv416 article EN Nucleic Acids Research 2015-05-07

Semi-supervised learning for peptide identification from shotgun proteomics datasets

OPENALEX - Publications

Lukas Käll Jesse D. Canterbury Jason Weston William Stafford Noble Michael J. MacCoss

10.1038/nmeth1113 article EN Nature Methods 2007-10-21

Quantifying similarity between motifs

OPENALEX - Publications

Shobhit Gupta J Stamatoyannopoulos Timothy L. Bailey William Stafford Noble

A common question within the context of de novo motif discovery is whether a newly discovered, putative resembles any previously discovered in an existing database. To answer this question, we define statistical measure motif-motif similarity, and describe algorithm, called Tomtom, for searching database motifs with given query motif. Experimental simulations demonstrate accuracy Tomtom's E values its effectiveness finding similar motifs.

10.1186/gb-2007-8-2-r24 article EN cc-by Genome biology 2007-02-26

Assessing computational tools for the discovery of transcription factor binding sites

OPENALEX - Publications

Martin Tompa Nan Li Timothy L. Bailey George M. Church Bart De Moor and 20 more

10.1038/nbt1053 article EN Nature Biotechnology 2005-01-01

A three-dimensional model of the yeast genome

OPENALEX - Publications

Zhijun Duan Mirela Andronescu Kevin Schutz Sean McIlwain Yoo Jung Kim and 5 more

10.1038/nature08973 article EN Nature 2010-05-01

THE SPECTRUM KERNEL: A STRING KERNEL FOR SVM PROTEIN CLASSIFICATION

OPENALEX - Publications

Christina Leslie Eleazar Eskin William Stafford Noble

10.1142/9789812799623_0053 article EN Biocomputing 2001-12-01

Sequence features and chromatin structure around the genomic regions bound by 119 human transcription factors

OPENALEX - Publications

Jie Wang Jiali Zhuang Sowmya Iyer Xin-Ying Lin Troy W. Whitfield and 11 more

Chromatin immunoprecipitation coupled with high-throughput sequencing (ChIP-seq) has become the dominant technique for mapping transcription factor (TF) binding regions genome-wide. We performed an integrative analysis centered around 457 ChIP-seq data sets on 119 human TFs generated by ENCODE Consortium. identified highly enriched sequence motifs in most sets, revealing new and validating known ones. The motif sites (TF sites) are conserved evolutionarily show distinct footprints upon DNase...

10.1101/gr.139105.112 article EN cc-by-nc Genome Research 2012-09-01

How does multiple testing correction work?

OPENALEX - Publications

William Stafford Noble

10.1038/nbt1209-1135 article EN Nature Biotechnology 2009-12-01

A statistical framework for genomic data fusion

OPENALEX - Publications

Gert Lanckriet Tijl De Bie Nello Cristianini Michael I. Jordan William Stafford Noble

During the past decade, new focus on genomics has highlighted a particular challenge: to integrate different views of genome that are provided by various types experimental data.This paper describes computational framework for integrating and drawing inferences from collection genome-wide measurements. Each dataset is represented via kernel function, which defines generalized similarity relationships between pairs entities, such as genes or proteins. The representation both flexible...

10.1093/bioinformatics/bth294 article EN Bioinformatics 2004-05-06

Unsupervised pattern discovery in human chromatin structure through genomic segmentation

OPENALEX - Publications

Michael M. Hoffman Orion J. Buske Jie Wang Zhiping Weng Jeff Bilmes and 1 more

10.1038/nmeth.1937 article EN Nature Methods 2012-03-18

Global mapping of protein-DNA interactions in vivo by digital genomic footprinting

OPENALEX - Publications

Jay R. Hesselberth Xiaoyu Chen Zhihong Zhang Peter J. Sabo Richard Sandstrom and 7 more

10.1038/nmeth.1313 article EN Nature Methods 2009-03-22

Kernel methods for predicting protein-protein interactions

OPENALEX - Publications

Asa Ben‐Hur William Stafford Noble

Motivation: Despite advances in high-throughput methods for discovering protein–protein interactions, the interaction networks of even well-studied model organisms are sketchy at best, highlighting continued need computational to help direct experimentalists search novel interactions.

10.1093/bioinformatics/bti1016 article EN Bioinformatics 2005-06-01

Integrative annotation of chromatin elements from ENCODE data

OPENALEX - Publications

Michael M. Hoffman Jason Ernst Steven P. Wilder Anshul Kundaje Robert S. Harris and 9 more

The ENCODE Project has generated a wealth of experimental information mapping diverse chromatin properties in several human cell lines. Although each such data track is independently informative toward the annotation regulatory elements, their interrelations contain much richer for systematic elements. To uncover these and to generate an interpretable summary massive datasets Project, we apply unsupervised learning methodologies, converting dozens into discrete maps regions other elements...

10.1093/nar/gks1284 article EN cc-by-nc Nucleic Acids Research 2012-12-05

Mismatch string kernels for discriminative protein classification

OPENALEX - Publications

Christina Leslie Eleazar Eskin Adiel Cohen Jason Weston William Stafford Noble

Abstract Motivation: Classification of proteins sequences into functional and structural families based on sequence homology is a central problem in computational biology. Discriminative supervised machine learning approaches provide good performance, but simplicity efficiency training prediction are also important concerns. Results: We introduce class string kernels, called mismatch for use with support vector machines (SVMs) discriminative approach to the protein classification remote...

10.1093/bioinformatics/btg431 article EN Bioinformatics 2004-01-22

Massively multiplex single-cell Hi-C

OPENALEX - Publications

Vijay Ramani Xinxian Deng Ruolan Qiu Kevin L. Gunderson Frank J. Steemers and 4 more

10.1038/nmeth.4155 article EN Nature Methods 2017-01-30

Statistical confidence estimation for Hi-C data reveals regulatory chromatin contacts

OPENALEX - Publications

Ferhat Ay Timothy L. Bailey William Stafford Noble

Our current understanding of how DNA is packed in the nucleus most accurate at fine scale individual nucleosomes and large chromosome territories. However, modeling architecture intermediate ∼50 kb–10 Mb crucial for identifying functional interactions among regulatory elements their target promoters. We describe a method, Fit-Hi-C , that assigns statistical confidence estimates to mid-range intra-chromosomal contacts by jointly random polymer looping effect previously observed technical...

10.1101/gr.160374.113 article EN cc-by-nc Genome Research 2014-02-05

A Genome-wide Framework for Mapping Gene Regulation via Cellular Genetic Screens

OPENALEX - Publications

Molly Gasperini Andrew J. Hill José L. McFaline‐Figueroa Beth Martin Seungsoo Kim and 8 more

10.1016/j.cell.2018.11.029 article EN publisher-specific-oa Cell 2019-01-01

Unsupervised pattern discovery in human chromatin structure through genomic segmentation

OPENALEX - Publications

Michael M. Hoffman Orion J. Buske Jie Wang Zhiping Weng Jeff Bilmes and 1 more

Sequence census methods like ChIP-seq now produce an unprecedented amount of genome-anchored data. We have developed integrative method to identify patterns from multiple experiments simultaneously while taking full advantage high-resolution data, discovering joint across different assay types. apply this ENCODE chromatin data for the human chronic myeloid leukemia cell line K562, including on covalent histone modifications and transcription factor binding, DNase-seq FAIRE-seq readouts open...

10.1145/2506583.2506701 article EN 2013-09-22

HiCRep: assessing the reproducibility of Hi-C data using a stratum-adjusted correlation coefficient

OPENALEX - Publications

Tao Yang Feipeng Zhang Galip Gürkan Yardımcı Fan Song Ross C. Hardison and 3 more

Hi-C is a powerful technology for studying genome-wide chromatin interactions. However, current methods assessing data reproducibility can produce misleading results because they ignore spatial features in data, such as domain structure and distance dependence. We present HiCRep, framework the of that systematically accounts these features. In particular, we introduce novel similarity measure, stratum adjusted correlation coefficient (SCC), quantifying between interaction matrices. Not only...

10.1101/gr.220640.117 article EN cc-by-nc Genome Research 2017-08-30

Coming Soon ...