1. 从UCSC下载hg38参考序列及注释数据
# 下载到当前文件夹
wget https://hgdownload.soe.ucsc.edu/goldenPath/hg38/bigZips/md5sum.txt #for checking
wget https://hgdownload.soe.ucsc.edu/goldenPath/hg38/bigZips/hg38.fa.gz
# gtf注释文件
wget https://hgdownload.soe.ucsc.edu/goldenPath/hg38/bigZips/genes/hg38.ncbiRefSeq.gtf.gz
wget https://hgdownload.soe.ucsc.edu/goldenPath/hg38/bigZips/genes/hg38.refGene.gtf.gz
http://genome.ucsc.edu/cgi-bin/hgTables选择注释数据下载
2. 从Ensemble 下载
文件位置 http://ftp.ensembl.org/pub/
# 下载gtf注释文件
wget http://ftp.ensembl.org/pub/current_gtf/homo_sapiens/Homo_sapiens.GRCh38.104.gtf.gz
3. 从ncbi 下载
选择下载相应的文件
https://www.ncbi.nlm.nih.gov/genome/?term=homo+sapiense
参考: