William Stafford Noble

ORCID: 0000-0001-7283-4715
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Genomics and Chromatin Dynamics
  • Advanced Proteomics Techniques and Applications
  • Mass Spectrometry Techniques and Applications
  • Machine Learning in Bioinformatics
  • Genomics and Phylogenetic Studies
  • RNA and protein synthesis mechanisms
  • Metabolomics and Mass Spectrometry Studies
  • Gene expression and cancer classification
  • Single-cell and spatial transcriptomics
  • RNA Research and Splicing
  • Bioinformatics and Genomic Networks
  • Epigenetics and DNA Methylation
  • Chromosomal and Genetic Variations
  • RNA modifications and cancer
  • Protein Structure and Dynamics
  • Cancer Genomics and Diagnostics
  • Cell Image Analysis Techniques
  • Advanced Biosensing Techniques and Applications
  • CRISPR and Genetic Engineering
  • Scientific Computing and Data Management
  • Cancer-related molecular mechanisms research
  • Genetic and Clinical Aspects of Sex Determination and Chromosomal Abnormalities
  • Genetics, Bioinformatics, and Biomedical Research
  • Genomic variations and chromosomal abnormalities
  • Fungal and yeast genetics research

University of Washington
2016-2025

Seattle University
2015-2025

Human Genome Sciences (United States)
2018-2019

Center for Innovation
2015

École Nationale Supérieure des Mines de Paris
2007-2014

Institut Curie
2014

Inserm
2014

University of Massachusetts Chan Medical School
2013

The University of Queensland
2009-2011

The University of Sydney
2011

The MEME Suite web server provides a unified portal for online discovery and analysis of sequence motifs representing features such as DNA binding sites protein interaction domains. popular motif algorithm is now complemented by the GLAM2 which allows containing gaps. Three scanning algorithms—MAST, FIMO GLAM2SCAN—allow numerous databases discovered GLAM2. Transcription factor (including those using MEME) can be compared with in many database Tomtom. further analyzed putative function...

10.1093/nar/gkp335 article EN cc-by-nc Nucleic Acids Research 2009-05-20
Ewan Birney J Stamatoyannopoulos Anindya Dutta Roderic Guigó T Gingeras and 95 more Elliott H. Margulies Zhiping Weng M Snyder Emmanouil T. Dermitzakis Robert E. Thurman Michael S. Kuehn Christopher M. Taylor Shane Neph Christof Koch Saurabh Asthana Ankit Malhotra Ivan Adzhubei Jason Greenbaum Robert Andrews Paul Flicek Patrick J. Boyle Hua Cao N. P. Carter Gayle K. Clelland Sean Davis Nathan Day Pawandeep Dhami Shane C. Dillon Michael O. Dorschner Heike Fiegler Paul G. Giresi Jeff Goldy Michael Hawrylycz Andrew Haydock Richard Humbert Keith D. James Brett Johnson Ericka M. Johnson Tristan Frum Elizabeth Rosenzweig Neerja Karnani Kirsten Lee Grégory Lefebvre Patrick A. Navas Fidencio Neri Stephen C. J. Parker Peter J. Sabo Richard Sandstrom Anthony Shafer David Vetrie Molly Weaver Sarah Wilcox Man Yu Francis S. Collins Job Dekker Jason D. Lieb Thomas D. Tullius Gregory E. Crawford Shamil Sunyaev William Stafford Noble Ian Dunham Alexandre Reymond Philipp Kapranov Joel Rozowsky Deyou Zheng Robert Castelo Adam Frankish Jennifer Harrow Srinka Ghosh Albin Sandelin Ivo L. Hofacker Robert Baertsch Damian Keefe Sujit Dike Jill Cheng Heather A. Hirsch Edward A. Sekinger Julien Lagarde Josep F. Abril Atif Shahab Christoph Flamm Claudia Fried Jörg Hackermüller Jana Hertel Manja Lindemeyer Kristin Missal Andrea Tanzer Stefan Washietl Jan O. Korbel Olof Emanuelsson Jakob Skou Pedersen Nancy Holroyd Ruth Taylor David Swarbreck Nicholas Matthews Mark Dickson Daryl J. Thomas Matthew T. Weirauch James Gilbert Jörg Drenkow

10.1038/nature05874 article EN Nature 2007-06-01

Abstract Summary: A motif is a short DNA or protein sequence that contributes to the biological function of in which it resides. Over past several decades, many computational methods have been described for identifying, characterizing and searching with motifs. Critical nearly any motif-based analysis pipeline ability scan database occurrences given by position-specific frequency matrix. Results: We describe Find Individual Motif Occurrences (FIMO), software tool scanning sequences motifs as...

10.1093/bioinformatics/btr064 article EN cc-by-nc Bioinformatics 2011-02-16

The MEME Suite is a powerful, integrated set of web-based tools for studying sequence motifs in proteins, DNA and RNA. Such encode many biological functions, their detection characterization important the study molecular interactions cell, including regulation gene expression. Since previous description 2009 Nucleic Acids Research Web Server Issue, we have added six new tools. Here describe capabilities all within suite, give advice on best use provide several case studies to illustrate how...

10.1093/nar/gkv416 article EN Nucleic Acids Research 2015-05-07

A common question within the context of de novo motif discovery is whether a newly discovered, putative resembles any previously discovered in an existing database. To answer this question, we define statistical measure motif-motif similarity, and describe algorithm, called Tomtom, for searching database motifs with given query motif. Experimental simulations demonstrate accuracy Tomtom's E values its effectiveness finding similar motifs.

10.1186/gb-2007-8-2-r24 article EN cc-by Genome biology 2007-02-26

Chromatin immunoprecipitation coupled with high-throughput sequencing (ChIP-seq) has become the dominant technique for mapping transcription factor (TF) binding regions genome-wide. We performed an integrative analysis centered around 457 ChIP-seq data sets on 119 human TFs generated by ENCODE Consortium. identified highly enriched sequence motifs in most sets, revealing new and validating known ones. The motif sites (TF sites) are conserved evolutionarily show distinct footprints upon DNase...

10.1101/gr.139105.112 article EN cc-by-nc Genome Research 2012-09-01

10.1038/nbt1209-1135 article EN Nature Biotechnology 2009-12-01

During the past decade, new focus on genomics has highlighted a particular challenge: to integrate different views of genome that are provided by various types experimental data.This paper describes computational framework for integrating and drawing inferences from collection genome-wide measurements. Each dataset is represented via kernel function, which defines generalized similarity relationships between pairs entities, such as genes or proteins. The representation both flexible...

10.1093/bioinformatics/bth294 article EN Bioinformatics 2004-05-06

Motivation: Despite advances in high-throughput methods for discovering protein–protein interactions, the interaction networks of even well-studied model organisms are sketchy at best, highlighting continued need computational to help direct experimentalists search novel interactions.

10.1093/bioinformatics/bti1016 article EN Bioinformatics 2005-06-01

The ENCODE Project has generated a wealth of experimental information mapping diverse chromatin properties in several human cell lines. Although each such data track is independently informative toward the annotation regulatory elements, their interrelations contain much richer for systematic elements. To uncover these and to generate an interpretable summary massive datasets Project, we apply unsupervised learning methodologies, converting dozens into discrete maps regions other elements...

10.1093/nar/gks1284 article EN cc-by-nc Nucleic Acids Research 2012-12-05

Abstract Motivation: Classification of proteins sequences into functional and structural families based on sequence homology is a central problem in computational biology. Discriminative supervised machine learning approaches provide good performance, but simplicity efficiency training prediction are also important concerns. Results: We introduce class string kernels, called mismatch for use with support vector machines (SVMs) discriminative approach to the protein classification remote...

10.1093/bioinformatics/btg431 article EN Bioinformatics 2004-01-22

Our current understanding of how DNA is packed in the nucleus most accurate at fine scale individual nucleosomes and large chromosome territories. However, modeling architecture intermediate ∼50 kb–10 Mb crucial for identifying functional interactions among regulatory elements their target promoters. We describe a method, Fit-Hi-C , that assigns statistical confidence estimates to mid-range intra-chromosomal contacts by jointly random polymer looping effect previously observed technical...

10.1101/gr.160374.113 article EN cc-by-nc Genome Research 2014-02-05

Sequence census methods like ChIP-seq now produce an unprecedented amount of genome-anchored data. We have developed integrative method to identify patterns from multiple experiments simultaneously while taking full advantage high-resolution data, discovering joint across different assay types. apply this ENCODE chromatin data for the human chronic myeloid leukemia cell line K562, including on covalent histone modifications and transcription factor binding, DNase-seq FAIRE-seq readouts open...

10.1145/2506583.2506701 article EN 2013-09-22

Hi-C is a powerful technology for studying genome-wide chromatin interactions. However, current methods assessing data reproducibility can produce misleading results because they ignore spatial features in data, such as domain structure and distance dependence. We present HiCRep, framework the of that systematically accounts these features. In particular, we introduce novel similarity measure, stratum adjusted correlation coefficient (SCC), quantifying between interaction matrices. Not only...

10.1101/gr.220640.117 article EN cc-by-nc Genome Research 2017-08-30
Coming Soon ...