原标题:如何提取gff文件中的基因注释信息
gff3格式注释文件是最常见的基因注释,(https://archive.broadinstitute.org/annotation/argo/help/gff3.html)
简单来说,gff3是以tab分隔的文本文件,共有9列,对应信息如下:
1、seqname
The name of the sequence. Typically a chromosome or a contig. Argo does not care what you put here. It will superimpose gff features on any sequence you like.
2、source
The program that generated this feature. Argo displays the value of this field in the inspector but does not do anything special with it.
3、feature
The name of this type of feature. The official GFF3 spec states that this should be a term from the SOFA ontology, but Argo does not do anything with this value except display it.
4、start
The starting position of the feature in the sequence. The first base is numbered 1.
5、end
The ending position of the feature (inclusive).
6、score</