gtf等文件下载

  1. 下载gtf文件
    另外可根据如下网址,直接下载,以小鼠为例,可以一步步向上文件夹打开
https://ftp.ensembl.org/pub/release-109/gtf/mus_musculus/Mus_musculus.GRCm39.109.gtf.gz
  1. 下载tss.bed文件
hg38
#下载地址,下载refFlat.txt.gz
wget http://hgdownload.soe.ucsc.edu/goldenPath/hg38/database/refFlat.txt.gz
#转为bed文件
zcat refFlat.txt.gz | awk '{print $3"\t"$5"\t"$6"\t"$1"\t"$4}' > hg38.tss.bed
# 只保留确定位置的片段
awk '/chr[0-9XY]{1,2}\t/' /Users/guoyin/hg38.tss.bed > /Users/guoyin/hg38_filter.tss.bed

hg19
#下载地址,下载refFlat.txt.gz
wget http://hgdownload.soe.ucsc.edu/goldenPath/hg19/database/refFlat.txt.gz
#转为bed文件
zcat refFlat.txt.gz | awk '{print $3"\t"$5"\t"$6"\t"$1"\t"$4}' > hg19.tss.bed
awk '/chr[0-9XY]{1,2}\t/' /Users/guoyin/hg19.tss.bed > /Users/guoyin/hg19_filter.tss.bed
  1. hg38 坐标转换到 hg19 坐标
    下载网址:https://hgdownload.soe.ucsc.edu/downloads.html#human
    Other downloads----Utilities
cd atac_fragments.tsv.gz
gzcat atac_fragments.tsv.gz | head -1000 > atac_hg38.bed
cat atac_hg38.bed | head -10
wget hgdownload.soe.ucsc.edu/goldenPath/hg38/liftOver/hg38ToHg19.over.chain.gz
gzcat hg38ToHg19.over.chain.gz | less
# wget http://hgdownload.soe.ucsc.edu/admin/exe/linux.x86_64/liftOver
# wget https://hgdownload.soe.ucsc.edu/admin/exe/macOSX.x86_64/liftOver
chmod +x liftOver
./liftOver atac_hg38.bed hg38ToHg19.over.chain.gz atac_hg19_translated.bed atac_hg19_lost.bed
  1. 给fragments.tsv.gz创建fragments.tsv.gz.tbi
# gzip -d atac_hg19_translated.bed.gz 
sort -k1,1 -k2,2n -k3,3n atac_hg19_translated.bed > atac_hg19_translated.sorted.bed
bgzip atac_hg19_translated.sorted.bed
tabix -p bed atac_hg19_translated.sorted.bed.gz
评论 1
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值