Bioinformatics Asked by musinn on November 29, 2020
Is there any user friendly way to find rare mutations in the individual human whole genome sequencing raw data? (from Dante, 30x coverage).
To be more specific, I want to find mutations from this paper:
https://docs.google.com/document/d/1EkRMuD6J0-zyMY3MegKhz7Th0hzhzVXg6DWg2Hm6l2Y/edit?usp=sharing
(it’s very short, less than one page).
But I’m confused. In their paper they just talk about genes (RAD21, B3GAT2, SMC3, SCN11A , SCN5A, SCN9A, SCN10A, SCN11A, TRPA1), but not mutations? Can we find that diseased-mutations which they talking about in genes, which their list? Or is there not enough data in the paper/study for this?
And if there is not enough data, which data I need to request from authors?
Or those genes have kind a “gold standard sequences” and if that sequence different from standard – there is “diseased” gene?
(I have only very basic genetics and bioinformatics knowledges)
You need to run a SNP finding pipeline, filter for SNPs in those genes and then run a SNP annotation pipeline. (You could annotate before you filter but it will take longer).
SNP finding usually has three steps, 1) read alignment to reference, 2) SNP calling and 3) SNP filtering (by quality).
You then bring in gene annotation data to filter SNPs to just those genes.
You then run SNP annotation using tools like VEP, SIFT, PolyPhen, etc.
People use tools like GATK to do this.
Hopefully that's enough information to get you started.
Answered by Dan Bolser on November 29, 2020
Get help from others!
Recent Questions
Recent Answers
© 2024 TransWikia.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP