今天第一次尝试处理ATAC_seq数据,希望能尽快做完吧。
先放个找好的参考文章:ATAC-seq/ChIP-seq分析方法
1.建立相应目录
对新数据建立对应实验人员(zhaoyingying)、测序类型(ATAC_seq)和日期(2021_05_03)的目录。
# 建立后如下:
(base) zexing@DNA:~/projects/zhaoyingying/ATAC_seq/2021_05_03$
# 新建对应的目录
mkdir raw_data clean_data bam bam_bw bam_sort sam macs2_bdgdiff macs2_callpeak matrix_reference_point matrix_scale_regions fastqc_report MD5_txt scripts_log
2.检查数据完整性
(base) zexing@DNA:~/projects/zhaoyingying/ATAC_seq/AJV5-ATAC_FKDL210049869-1a$ cat MD5_AJV5_FKDL210049869-1a.txt > check_md5sum_AJV5_FKDL210049869-1a.txt && md5sum -c check_md5sum_AJV5_FKDL210049869-1a.txt
AJV5_FKDL210049869-1a_1.clean.fq.gz: OK
AJV5_FKDL210049869-1a_2.clean.fq.gz: OK
(base) zexing@DNA:~/projects/zhaoyingying/ATAC_seq/AJV93-ATAC_FKDL210049870-1a$ cat MD5_AJV93_FKDL210049870-1a.txt > check_MD5_AJV93_FKDL210049870-1a.txt && md5sum -c check_MD5_AJV93_FKDL210049870-1a.txt
AJV93_FKDL210049870-1a_1.clean.fq.gz: OK
AJV93_FKDL210049870-1a_2.clean.fq.gz: OK
(base) zexing@DNA:~/projects/zhaoyingying/ATAC_seq/JV84-ATAC_FKDL210049867-1a$ cat MD5_JV84_FKDL210049867-1a.txt > check_MD5_JV84_FKDL210049867-1a.txt && md5sum -c check_MD5_JV84_FKDL210049867-1a.txt
JV84_FKDL210049867-1a_1.clean.fq.gz: OK
JV84_FKDL210049867-1a_2.clean.fq.gz: OK
(base) zexing@DNA:~/projects/zhaoyingying/ATAC_seq/JV85-ATAC_FKDL210049868-1a$ cat MD5_JV85_FKDL210049868-1a.txt > check_MD5_JV85_FKDL210049868-1a.txt && md5sum -c check_MD5_JV85_FKDL210049868-1a.txt
JV85_FKDL210049868-1a_1.clean.fq.gz: OK
JV85_FKDL210049868-1a_2.clean.fq.gz: OK