使用fastq-dump下载SRA数据
环境和配置请见系列博文
1.下载:
fastq-dump -Z DRR047093
然后会显示信息:如果文件过大会有很多
可以显示制定条数
fastq-dump -X 5 -Z DRR047093文件位置:自己安装sratoolkit时配置的位置
hadoop@Mcnode1:~/cloud/adam/xubo/data/down-sratool/sra$ ll
total 7532
drwxrwxr-x 2 hadoop hadoop 4096 1月 13 17:17 ./
drwxrwxr-x 7 hadoop hadoop 4096 1月 13 16:16 ../
-rw-rw-r-- 1 hadoop hadoop 5270322 1月 13 17:17 DRR047093.fastq
-rw-rw-r-- 1 hadoop hadoop 1043468 1月 13 17:16 DRR047093.sra
-rw-rw-r-- 1 hadoop hadoop 1701387928 1月 13 17:12 SRR003161.sra.cache
2.prefetch
hadoop@Mcnode1:~/.aspera/connect/bin$ prefetch -c DRR047084
Maximum file size download limit is 20,971,520KB
2016-01-13T13:42:20 prefetch.2.5.5: 1) Downloading 'DRR047084'...
2016-01-13T13:42:20 prefetch.2.5.5: Downloading via fasp...
2016-01-13T13:42:41 prefetch.2.5.5 err: process failed while waiting process - ascp failed with 1
2016-01-13T13:43:13 prefetch.2.5.5 err: process failed while waiting process - ascp failed with 1
2016-01-13T13:43:13 prefetch.2.5.5: fasp download failed
2016-01-13T13:43:13 prefetch.2.5.5: Downloading via http...
2016-01-13T13:43:41 prefetch.2.5.5: 1) 'DRR047084' was downloaded successfully
2016-01-13T13:43:41 prefetch.2.5.5: 'DRR047084' has 0 dependencies
查看效果:成功
hadoop@Mcnode1:~/cloud/adam/xubo/data/down-sratool/sra$ ll -h
total 252M
drwxrwxr-x 2 hadoop hadoop 4.0K 1月 13 21:43 ./
drwxrwxr-x 7 hadoop hadoop 4.0K 1月 13 16:16 ../
-rw-rw-r-- 1 hadoop hadoop 877K 1月 13 21:40 DRR047083.sra
-rw-rw-r-- 1 hadoop hadoop 1007K 1月 13 21:43 DRR047084.sra
-rw-rw-r-- 1 hadoop hadoop 1.1M 1月 13 21:33 DRR047091.sra
-rw-rw-r-- 1 hadoop hadoop 1.2M 1月 13 17:22 DRR047092.sra
-rw-rw-r-- 1 hadoop hadoop 5.1M 1月 13 20:31 DRR047093_1.fastq
-rw-rw-r-- 1 hadoop hadoop 5.1M 1月 13 17:17 DRR047093.fastq
-rw-rw-r-- 1 hadoop hadoop 1020K 1月 13 17:16 DRR047093.sra
-rw-rw-r-- 1 hadoop hadoop 180K 1月 13 20:32 RAL357_1.sai
-rw-rw-r-- 1 hadoop hadoop 5.2M 1月 13 20:36 RAL357_1.sam
-rw-rw-r-- 1 hadoop hadoop 271M 1月 13 21:28 SRR002664.sra.cache
-rw-rw-r-- 1 hadoop hadoop 1.6G 1月 13 20:26 SRR003161.sra.cache
-rw-rw-r-- 1 hadoop hadoop 0 1月 13 21:19 SRR003162.sra.lock
-rw-rw-r-- 1 hadoop hadoop 15M 1月 13 21:34 SRR003162.sra.tmp.98592.tmp
-rw-rw-r-- 1 hadoop hadoop 0 1月 13 21:43 SRR1482462.sra.lock
-rw-rw-r-- 1 hadoop hadoop 0 1月 13 21:40 --user=anonftp
链接到的下载地址是:http://sra-download.ncbi.nlm.nih.gov/srapub/SRR003162
文件大概1.6G
之前运行这两个语句都不行,不知道是不是网络的原因??
3.prefetch -v
hadoop@Mcnode1:~/.aspera/connect/bin$ prefetch -v DRR047083
Maximum file size download limit is 20,971,520KB
2016-01-13T13:38:40 prefetch.2.5.5: Using 'ascp'
2016-01-13T13:38:40 prefetch.2.5.5: Using 'ascp'
2016-01-13T13:38:40 prefetch.2.5.5: Using '/home/hadoop/.aspera/connect/bin/ascp'
2016-01-13T13:39:00 prefetch.2.5.5: 1) Downloading 'DRR047083'...
2016-01-13T13:39:00 prefetch.2.5.5: Downloading via fasp...
/home/hadoop/.aspera/connect/bin/ascp /home/hadoop/.aspera/connect/bin/ascp -i /home/hadoop/.aspera/connect/etc/asperaweb_id_dsa.openssh -pQTk1 -l 1000m dbtest@sra-download.ncbi.nlm.nih.gov:data/sracloud/srapub/DRR047083 /home/hadoop/cloud/adam/xubo/data/down-sratool/sra/DRR047083.sra.tmp.96547.tmp
2016-01-13T13:39:15 prefetch.2.5.5 err: process failed while waiting process - ascp failed with 1
/home/hadoop/.aspera/connect/bin/ascp /home/hadoop/.aspera/connect/bin/ascp -i /home/hadoop/.aspera/connect/etc/asperaweb_id_dsa.openssh -pQTk1 -l 1000m dbtest@sra-download.ncbi.nlm.nih.gov:data/sracloud/srapub/DRR047083 /home/hadoop/cloud/adam/xubo/data/down-sratool/sra/DRR047083.sra.tmp.96547.tmp
2016-01-13T13:39:30 prefetch.2.5.5 err: process failed while waiting process - ascp failed with 1
2016-01-13T13:39:30 prefetch.2.5.5: fasp download failed
2016-01-13T13:39:30 prefetch.2.5.5: Downloading via http...
2016-01-13T13:40:06 prefetch.2.5.5: http://sra-download.ncbi.nlm.nih.gov/srapub/DRR047083 -> /home/hadoop/cloud/adam/xubo/data/down-sratool/sra/DRR047083.sra.tmp.96547.tmp
2016-01-13T13:40:10 prefetch.2.5.5: /home/hadoop/cloud/adam/xubo/data/down-sratool/sra/DRR047083.sra.tmp.96547.tmp (897071)
2016-01-13T13:40:10 prefetch.2.5.5: 1) 'DRR047083' was downloaded successfully
2016-01-13T13:40:10 prefetch.2.5.5: 'DRR047083' has 0 unresolved dependencies
2016-01-13T13:40:10 prefetch.2.5.5: 'DRR047083' is not cSRA
成功:
hadoop@Mcnode1:~/cloud/adam/xubo/data/down-sratool/sra$ ll -h
total 251M
drwxrwxr-x 2 hadoop hadoop 4.0K 1月 13 21:42 ./
drwxrwxr-x 7 hadoop hadoop 4.0K 1月 13 16:16 ../
-rw-rw-r-- 1 hadoop hadoop 877K 1月 13 21:40 DRR047083.sra
-rw-rw-r-- 1 hadoop hadoop 0 1月 13 21:42 DRR047084.sra.lock
-rw-rw-r-- 1 hadoop hadoop 1.1M 1月 13 21:33 DRR047091.sra
-rw-rw-r-- 1 hadoop hadoop 1.2M 1月 13 17:22 DRR047092.sra
-rw-rw-r-- 1 hadoop hadoop 5.1M 1月 13 20:31 DRR047093_1.fastq
-rw-rw-r-- 1 hadoop hadoop 5.1M 1月 13 17:17 DRR047093.fastq
-rw-rw-r-- 1 hadoop hadoop 1020K 1月 13 17:16 DRR047093.sra
-rw-rw-r-- 1 hadoop hadoop 180K 1月 13 20:32 RAL357_1.sai
-rw-rw-r-- 1 hadoop hadoop 5.2M 1月 13 20:36 RAL357_1.sam
-rw-rw-r-- 1 hadoop hadoop 271M 1月 13 21:28 SRR002664.sra.cache
-rw-rw-r-- 1 hadoop hadoop 1.6G 1月 13 20:26 SRR003161.sra.cache
-rw-rw-r-- 1 hadoop hadoop 0 1月 13 21:19 SRR003162.sra.lock
-rw-rw-r-- 1 hadoop hadoop 15M 1月 13 21:34 SRR003162.sra.tmp.98592.tmp
-rw-rw-r-- 1 hadoop hadoop 0 1月 13 21:40 --user=anonftp
本文详细介绍了如何使用fastq-dump命令下载SRA数据,并通过prefetch命令进行预取,包括文件下载过程及检查下载效果。
2612

被折叠的 条评论
为什么被折叠?



