1. 参考基因组下载和准备
进入UCSC网站(http://genome.ucsc.edu/),Downloads >> Genome Data >> Human >> Feb. 2009 (GRCh37/hg19) >> Genome sequence files and select annotations (2bit, GTF, GC-content, etc) >> chromFa.tar.gz,右键复制链接地址如下:
http://hgdownload.soe.ucsc.edu/goldenPath/hg19/bigZips/chromFa.tar.gz
可以在windows里用下载工具快速下载,传送到服务器上,使用下面命令解压缩。
tar xvzf chromFa.tar.gz
解压出来的新文件都是同一类型:chr*.fa。
chr1.fa chr5.fa chrUn_gl000223.fa
chr10.fa chr6.fa chrUn_gl000224.fa
chr11.fa chr6_apd_hap1.fa chrUn_gl000225.fa
chr11_gl000202_random.fa chr6_cox_hap2.fa chrUn_gl000226.fa
chr12.fa chr6_dbb_hap3.fa chrUn_gl000227.fa
chr13.fa chr6_mann_hap4.fa chrUn_gl000228.fa
chr14.fa chr6_mcf_hap5.fa chrUn_gl000229.fa
chr15.fa chr6_qbl_hap6.fa chrUn_gl000230.fa
chr16.fa chr6_ssto_hap7.fa chrUn_gl000231.fa
chr17.fa chr7.fa chrUn_gl000232.fa
chr17_ctg5_hap1.fa chr7_gl000195_random.fa chrUn_gl000233.fa
chr17_gl000203_ran