iRegulon: From a Gene List to a Gene Regulatory Network Using Large Motif and Track Collections
0301 basic medicine
Chromatin Immunoprecipitation
QH301-705.5
Wide Identification
Breast Neoplasms
03 medical and health sciences
Expression Analysis
Cell Line, Tumor
Databases, Genetic
Humans
Gene Regulatory Networks
Nf-Y
Biology (General)
Models, Genetic
Sequence Analysis, RNA
Target Genes
Computational Biology
Protein-Dna Interactions
006
Multiple Sequence Alignment
Genes, p53
3. Good health
Gene Expression Regulation
Chip-Seq Datasets
Transcription Factor-Binding
Over-Representation
Integrative Genomics Viewer
Research Article
Transcription Factors
DOI:
10.1371/journal.pcbi.1003731
Publication Date:
2014-07-24T14:44:34Z
AUTHORS (15)
ABSTRACT
Identifying master regulators of biological processes and mapping their downstream gene networks are key challenges in systems biology. We developed a computational method, called iRegulon, to reverse-engineer the transcriptional regulatory network underlying a co-expressed gene set using cis-regulatory sequence analysis. iRegulon implements a genome-wide ranking-and-recovery approach to detect enriched transcription factor motifs and their optimal sets of direct targets. We increase the accuracy of network inference by using very large motif collections of up to ten thousand position weight matrices collected from various species, and linking these to candidate human TFs via a motif2TF procedure. We validate iRegulon on gene sets derived from ENCODE ChIP-seq data with increasing levels of noise, and we compare iRegulon with existing motif discovery methods. Next, we use iRegulon on more challenging types of gene lists, including microRNA target sets, protein-protein interaction networks, and genetic perturbation data. In particular, we over-activate p53 in breast cancer cells, followed by RNA-seq and ChIP-seq, and could identify an extensive up-regulated network controlled directly by p53. Similarly we map a repressive network with no indication of direct p53 regulation but rather an indirect effect via E2F and NFY. Finally, we generalize our computational framework to include regulatory tracks such as ChIP-seq data and show how motif and track discovery can be combined to map functional regulatory interactions among co-expressed genes. iRegulon is available as a Cytoscape plugin from http://iregulon.aertslab.org.
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES (137)
CITATIONS (747)
EXTERNAL LINKS
PlumX Metrics
RECOMMENDATIONS
FAIR ASSESSMENT
Coming soon ....
JUPYTER LAB
Coming soon ....