A database that uncovers the molecular basis of TF binding in the human genome based on regulatory motif analysis of all Transcription Factors (TFs) grouped by family. This allows browsing of all known motifs for each factor, curated from TRANSFAC, Jaspar, and Protein Binding Microarray (PBM) experiments, and their enrichment and instances within corresponding TF binding experiments. It also provides a list of novel regulatory motifs discovered by systematic application of several motif discovery tools (including MEME, MDscan, Weeder, AlignACE) and evaluated based on their enrichment relative to control motifs within TF-bound regions. ENCODE-motifs also provides a genome-wide map of regulatory motif instances in the human genome for both known and novel motifs.
Kheradpour P, Kellis M. ENCODE-motifs: systematic analysis of regulatory motifs associated with transcription factor binding in the human genome. (Submitted)
A wiki-style resource that organizes all the information associated with each transcription factor (TF), including the ChIP-seq peaks, discovered motifs, TF-TF interactions, and the chromatin features (histone modification patterns, DNase I cleavage, and nucleosome positioning) around the ChIP-seq peaks. Will be updated as the project proceeds. The Factorbook display of this information is transcription factor anchored and dynamic.
Wang J, Zhuang J, Iyer S, Lin X, Whitfield TW, Greven MC, Pierce BG, Dong X, Kundaje A, Cheng Y et al. Sequence features and chromatin structure around the genomic regions bound by 119 human transcription factors. Genome Res. 2012 Sep;22(9):1798-812.
Explores annotations of the noncoding genome at variants on haplotype blocks, such as candidate regulatory SNPs at disease-associated loci.
Under Set Options tab, set Browse ENCODE button to on and select an LD threshold.
Under Build Query Tab, enter SNP (rsXXXXX)
HaploReg returns SNPs in LD, their frequency in 3 populations, and also tells you what evidence ENCODE has found for regulatory protein binding (mouse over to see the protein names), chromatin structure (mouse over to see the cell types), and the chromatin state of the region (the chromatin state can predict an enhancer or promoter), and putative transcription factor binding motifs that are altered by the variant. Clicking on the SNP name hyperlink reveals chromosomal position, sequence, allele frequencies in three populations, and ENCODE chromatin state in 9 cell types.
Ward LD, Kellis M. HaploReg: a resource for exploring chromatin states, conservation, and regulatory motif alterations within sets of genetically linked variants. Nucleic Acids Res. 2012 Jan;40(Database issue):D930-4.
Identifies DNA features and regulatory elements in non-coding regions of the human genome. One can enter dbSNP IDs, BED files, VCF files, or GFF3 files. A score is returned assessing the evidence for regulatory potential. Clicking on the score reveals the data supporting the inference, by data type and cell type. One can also click on hyperlinks to see the SNP or the region in the UCSC browser, ENSEMBL browser, and dbSNP.
Boyle AP, Hong EL, Hariharan M, Cheng Y, Schaub MA, Kasowski M, Karczewski KJ, Park J, Hitz BC, Weng S et al. Annotation of functional variation in personal genomes using RegulomeDB. Genome Res. 2012 Sep;22(9):1790-7.
Tutorial on functional annotation of non-coding sequences
Briefly describes how to display the basic ENCODE tracks, using the UCSC genome browser, at SNPs identified from GWAS studies.
Mortlock DP, Pregizer S. Identifying functional annotation for noncoding genomic sequences. Curr Protoc Hum Genet. 2012 Jan;Chapter 1:Unit1.10.
Tutorials on using ENCODE data and the ENCODE portal at UCSC
A suite of tutorials and PowerPoint handouts that include how to identify ENCODE data on the UCSC browser, what types of ENCODE data are available, what is the data use policy, how to display the data, and how to download the data. There are also links to other tutorials on the use of the UCSC genome browser.