phyloP46way_placental: PhyloP预测结果(dbNSFP version3.0),基于46个哺乳动物物种的多序列比对得到位点的保守性分值,分值越大,位点越保守。该分值考虑的是变异位点的保守性,而非考虑该位点上的碱基,所以无论该位点上是同义突变还是非同义突变,score都相同。该分值用来寻找具有功能重要性的位点,利用这些score值,能够推断出疾病易感性位点
http://www.360doc.cn/mip/966143054.html
python3.8
代码1
with open("C:\\Users\\admin\\Desktop\\wufeizhen\\circularRNA.bed_fasta",'r') as f:
length = []
for value1 in f.readlines():
if value1.find('>') != 0:
length.append(len(value1))
print (min(length)) #返回最小值
print (max(length)) #返回最大值
代码2
with open("C:\\Users\\admin\\Desktop\\wufeizhen\\circularRNA.bed_fasta",'r') as f:
length = []
circularRNA = []
num = 0
for value1 in f.readlines():
if value1.find('>') != 0:
length.append(len(value1))
if value1.find('>') == 0:
num +=1
circularRNA.append(value1[1:17]) #打印的是下标为1到16(不包含17)的元素
print (min(length))
print (max(length))
print (num)
print (circularRNA)
with open("C:\\Users\\admin\\Desktop\\wufeizhen\\features\\circularRNA.bed_fasta_ORF",'r') as f2:
orf = []
for value2 in f2.readlines():
if value2.find('hsa') == 0:
num +=1
orf.append(value2[0:16])
print (orf)
orf_set = set(orf)
circularRNA_set = set(circularRNA)
a_alone = circularRNA_set - orf_set #计算两个列表集合之间的差集
print(len(a_alone)) #计算列表中的元素个数
print(a_alone)