Bioinformatics Asked on December 2, 2021
I have several sets of chip-seq data. I called the peaks using Macs2. I am pretty new to the field and I will appreciate any help. I wanted to annotate the peaks and see which peaks are shared between them and which ones are different. What is the best way to do this? which package should I use?
Many thanks
It depends on how many samples you have. If you only compare one sample to the second, you can pool and call peaks and then directly calculate RPKM ratios for each peak. It you have more samples, you may want to use IDR to score differential peaks. You may also use packages such as Fseq or HOMER to treat one group as ChIP experiment while the other group as control to call peaks. Alternatively, you could pool all the samples to call peaks, then calculate read counts in each peak for each sample, and then use DESeq2 or other differential tools to calculate significantly different peaks’ significance and effect size.
Answered by Dr_Hope on December 2, 2021
In a similar question, I recommended that directly comparing called peaks can be somewhat misleading. This topic is addressed in the documentation of the DiffBind
R package on Bioconductor, in the "Comparison of occupancy and affinity based analyses" section.
But if you're just looking for genome annotation software, the annotatr
package on Bioconductor is pretty useful
Answered by James Hawley on December 2, 2021
bedtools intersect
or the find.overlap
function in GenomicRanges
packages in R.bedtools merge
, and then count the number of reads/fragments in each peak for each sample using featureCounts
. The output of this is a matrix, which is very similar to gene expression results. Then you can do some differential tests on this data.Answered by Phoenix Mu on December 2, 2021
Get help from others!
Recent Answers
Recent Questions
© 2024 TransWikia.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP