1、snakemake运行infer_experiment.py
rule check_ss:
input:
bed = config["REF"]["genome_bed12"],
bam = rules.bam_index.output.bam,
output:
txt = "result/QC/check_ss/{sample}.txt",
params:
name = "{sample}"
shell: """
echo "sample name: {params.name}" > {output.txt}
infer_experiment.py -r {input.bed} -i {input.bam} \
-s 1000000 &> {output.txt}
"""
2、运行结果解读
This is PairEnd Data
Fraction of reads failed to determine: 0.0072
Fraction of reads explained by "1++,1--,2+-,2-+": 0.9441
Fraction of reads explained by "1+-,1-+,2++,2--": 0.0487
结果解读看官方文档:
3、判断建库类型
对于双端测序,1++,1--,2+-,2-+与1+-,1-+,2++,2--的比例可以体现出文库类型;对于单端测序,++,--和+-,-+的比例亦然:
二者比例接近1时,文库为unstranded
前者明显大于后者时,文库为FR/fr-secondstrand stranded
后者明显大于前者时,文库为RF/fr-firststrand stranded
4、文库类型对应的软件参数
Tool | RF/fr-firststrand stranded | FR/fr-secondstrand stranded | Unstranded |
---|---|---|---|
StringTie (parameter) | –rf | –fr | NONE |
本次实验数据check_ss解读:
This is PairEnd Data
Fraction of reads failed to determine: 0.0322
Fraction of reads explained by "1++,1--,2+-,2-+": 0.0267
Fraction of reads explained by "1+-,1-+,2++,2--": 0.9410
判断文库为RF/fr-firststrand stranded
stringtie应该使用 --rf参数
家益师兄确实用的 --rf !!!!