这是另外一题,也是cg含量,
但是是求一个fastq文件中每个read位点的cg分布情况
这个解法是逐行读取文件,比较优雅
number = {}
buffer = 200
for i in range(buffer):
kkk = i
number[kkk] = 0
with open("Test1.fastq","r") as f:
line_number = 0
while True:
line = f.readline()
line_number += 1
if line_number % 4 == 2 :
for i in range(len(line)):
kkk = i
if line[kkk] == "C" or line[kkk] == "G" :
number[kkk] = 1 +