常见基因组功能数据介绍 | 表观注释 | DNase-seq | ChIP-Seq | ATAC-Seq | eQTL | ENCODE | ROADMAP

functional genomics | epigenomic annotations

刚入行的总是一头雾水,对这些表观的标记一点兴趣都没有,种类繁多,总是记不住,这里我就做一个常识性的总结,不搞太多术语。

需要了解的也不多,常见的就那么几个,搞懂ENCODE和ROADMAP上有的就行。细节颇多,需要一点耐心。

各种类型的数据可以直接在这个genome browser里浏览:http://genomebrowser.wustl.edu/

注意:

  • 所有的表观或转录组都具有非常强的组织(cell type)特异性
  • ChIP-seq最大的特点就是需要input,作为对照
  • ChIP-seq可以Identify direct and indirect protein-DNA interactions
  • ChIP-seq preferred for functional information

原始数据:

  • DHSs
  • H3K4me3
  • H3K9ac
  • H3K27ac
  • H3K4me1

处理后数据:

  • Enhancer
  • TFBSs

主要是ChIP-seq(immunological assays)占了很大一类,把它搞懂就行。

另一类non-immunological assays:ATAC-seq, MNase-seq, DNase-seq, and FAIRE-seq。

DHSs

DNase I hypersensitive site

DNase-seq

FAIRE-Seq is a successor 

genome-wide DNA footprints

Deoxyribonuclease 脱氧核糖核酸酶

DNase I hypersensitive sites (DHSs) are regions of chromatin that are sensitive to cleavage by the DNase I enzyme. In these specific regions of the genome, chromatin has lost its condensed structure, exposing the DNA and making it accessible. This raises the availability of DNA to degradation by enzymes, such as DNase I. These accessible chromatin zones are functionally related to transcriptional activity, since this remodeled state is necessary for the binding of proteins such as transcription factors.

ChIP-seq

Basically,
- "encc-enhancer.bed" is enhancers defined with H3K27ac & H3K4me1 activity
- "encc-enhancer-atac.bed" is enhancers defined with H3K27ac & H3K4me1 activity as well as open chromatin (ATAC-seq) signal summits.

不同ChIP-seq的功能,一图胜千言:【我们用了第一行和最后一行,效率最高】

不同表观注释的比较:

待续~


快速使用epigenomic annotations data:

有个叫做baseline_v1.1的文件,里面包含了各种整理好的表观注释数据。

https://data.broadinstitute.org/alkesgroup/LDSCORE/baseline_v1.1_bedfiles.tgz
~/project2/CPloci/Evo/ENCODE/

  

包含的数据类型:

  • Coding
  • Intron
  • Transcribe
  • Conserved
  • DGF
  • DHS
  • H3K9ac
  • H3K27ac
  • H3K4me1
  • H3K4me3
  • CTCF
  • TFBS
  • TSS
  • Promoter
  • Enhancer
  • SuperEnhancer
  • WeakEnhancer
  • Repressed
  • UTR_5
  • UTR_3

算是种类非常多了,如果对精度没有要求,就可以直接用了,全部是bed格式的。

  

参考:

Chromatin accessibility and the regulatory epigenome

Identifying and mitigating bias in next-generation sequencing methods for chromatin biology - 刘小乐

Chromatin Structure Research Methods

Introduction to ChIP-seq and ATAC-seq - 非常赞

Mapping DNA-protein interactions via ChIP-seq - 非常详细

如何通过CHIP-seq分析鉴别基因启动子和增强子 - ChIP-seq详解 

ChIP-seq实践(H3K27Ac,enhancer的筛选和enhancer相关基因的GO分析) - 实战 

原文地址:https://www.cnblogs.com/leezx/p/14336365.html