RPKM vs. FPKM vs. TPM

JasonKQLin

已于 2024-03-27 15:52:05 修改

阅读量2.4k

点赞数 1

分类专栏：生物信息文章标签： python

于 2019-08-09 01:00:44 首次发布

本文链接：https://blog.csdn.net/linkequa/article/details/98901976

版权

生物信息专栏收录该内容

19 篇文章 1 订阅

订阅专栏

1，全称

RPKM: Reads Per Kilobase Million
FPKM: Fragments Per Kiolbase Million
TPM: Transcripts Per Million

RPKM vs. FPKM

二者类似，RPKM针对单端测序，测得的一条序列就叫做一条reads；FPKM针对双端测序，测序得到的Read 1和Read 2合起来称为一个fragment。如果严格来区分的话，对双端测序，RPKM是FPKM的两倍。

RPKM/FPKM与TPM的区别主要在对测序深度和基因长进行normalize的先后顺序上。

计算RPKM的过程：
1，Count up the total reads in a sample and divide that number by 1,000,000 – this is our “per million” scaling factor.
2，Divide the read counts by the “per million” scaling factor. This normalizes for sequencing depth, giving you reads per million (RPM)
3，Divide the RPM values by the length of the gene, in kilobases. This gives you RPKM.

计算TPM的过程：
1，Divide the read counts by the length of each gene in kilobases. This gives you reads per kilobase (RPK).
2，Count up all the RPK values in a sample and divide this number by 1,000,000. This is your “per million” scaling factor.
3，Divide the RPK values by the “per million” scaling factor. This gives you TPM.