gatk过滤_GATK Hard-filter 过滤变异结果推荐阈值

Hard-filter阈值探究

GATK4官网给出的推荐阈值:For SNPs:

QD 

MQ 

FS > 60.0

SOR > 3.0

MQRankSum 

ReadPosRankSum 

For indels:

QD 

ReadPosRankSum 

InbreedingCoeff 

FS > 200.0

SOR > 10.0

查看GATK4原始网页:https://software.broadinstitute.org/gatk/documentation/article?id=11097该阈值选择来自于GATK4官网的推荐,阈值依据于比较真 vs. 假 snp的特征值(annotation values)统计分布

One of the most helpful ways to approach hard-filtering is to visualize the distribution of annotation values for a truth set called using a particular pipeline. These distributions are sharped by both the pipeline methodology and the underlying physical properties of the sequence data; so for a given pairing of data generation technology + analysis pipeline, you can derive filtering thresholds based on what the distributions look like for the truth set

评估数据来源:1000Genomes 中的 whole genome trio

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值