我正在使用python创建一个程序,该程序将一组DNA序列转换为氨基酸(蛋白质)序列.然后,我需要找到一个特定的子序列,并计算存在该特定子序列的序列数.这是我到目前为止的代码:
#Open cDNA_sequences file and read in line by line
with open('cDNA_sequences.csv', 'r') as results:
for line in results:
columns = line.rstrip("
").split(",") #remove end of line characters and split commas to produce a list
ensemblID = columns[0] #ensemblID is first element in our list
dna_seq = columns[1] #dna_seq is second element in our list
genetic code = {
"UUU":"F", "UUC":"F", "UUA":"L", "UUG":"L",
"UCU":"S", "UCC":"s", "UCA":"S", "UCG":"S",
"UAU":"Y", "UAC":"Y", "UAA":"STOP", "UAG":"STOP",
"UGU&#