minimap2和yacrd 去16S nanopore的嵌合体 【学习笔记】

github官网

在这里插入图片描述

WARNING:
Minimap2 v2.19 introduce some change in selection of seed and chaining of this seed to generate overlap. This change could have an important impacte on yacrd behavior, I can’t guarantee the quality of yacrd results with this version of minimap2, I recommend you not to use a minimap2 version higher than 2.18.

在这里插入图片描述

cd到安装的目录,make安装软件,要打开软件cd到安装目录,在安装目录才可以操作

cd xxx/xxx/minimap2.18/     【打开软件方式】
./minimap2 -x ava-ont reads.fa reads.fa > overlaps.paf   【运行软件方式】
minimap2 -x {corresponding preset} reads.fq reads.fq > mapping.paf
yacrd -i mapping.paf -o reads.yacrd filter -i reads.fasta -o reads.filter.fasta
yacrd -i mapping.paf -o reads.yacrd extract -i reads.fasta -o reads.extract.fasta
yacrd -i mapping.paf -o reads.yacrd split -i reads.fasta -o reads.split.fasta
yacrd -i mapping.paf -o reads.yacrd scrubb -i reads.fasta -o reads.scrubb.fasta

在这里插入图片描述

minimap2 -x ava-pb -g 5000 reads.fasta reads.fasta > overlap.paf
yacrd -i overlap.paf -o report.yacrd -c 4 -n 0.4 scrubb -i reads.fasta -o reads.scrubb.fasta
$yacrd -h
yacrd 0.6.2 Ivysaur
Pierre Marijon <pmarijon@mpi-inf.mpg.de>

Yacrd use overlap between reads, to detect 'good' and 'bad' region,
a region with coverage over the threshold is 'good' others are 'bad'.
If read has a 'bad' region in middle this reads is mark as 'Chimeric'.
If the ratio of 'bad' region length on total read length is larger than threshold this reads is mark
as 'Not_covered'.

Yacrd can make some other actions:
- filter: for sequence or overlap file, record with reads marked as Chimeric or NotCovered isn't
written in the output
- extract: for sequence or overlap file, record contains reads marked as Chimeric or NotCovered is
written in the output
- split: for sequence file bad region in the middle of reads are removed, NotCovered read is removed
- scrubb: for sequence file all bad region are removed, NotCovered read is removed

USAGE:
    yacrd [OPTIONS] --input <input> --output <output> [SUBCOMMAND]

FLAGS:
    -h, --help       Prints help information
    -V, --version    Prints version information

OPTIONS:
    -c, --coverage <coverage>
            if coverage reach this value region is mark as bad [default: 0]

    -i, --input <input>
            path to input file overlap (.paf|.m4|.mhap) or yacrd report (.yacrd), format is
            autodetect and compression input is allowed (gz|bzip2|lzma)

    -n, --not-coverage <not-coverage>
            if the ratio of bad region length on total length is lower than this value, read is
            marked as NotCovered [default: 0.8]

    -d, --ondisk <ondisk>
            yacrd switches to 'ondisk' mode which will reduce memory usage but increase computation
            time. The value passed as a parameter is used as a prefix for the temporary files
            created by yacrd. Be careful if the prefix contains path separators (`/` for unix or `\`
            for windows) this folder will be deleted

        --ondisk-buffer-size <ondisk-buffer-size>
            with the default value yacrd in 'ondisk' mode use around 1 GBytes, you can increase to
            reduce runtime but increase memory usage [default: 64000000]

    -o, --output <output>                            path output file

SUBCOMMANDS:
    extract    Record mark as chimeric or NotCovered is extract
    filter     Record mark as chimeric or NotCovered is filter
    help       Prints this message or the help of the given subcommand(s)
    scrubb     All bad region of read is removed
    split      Record mark as chimeric or NotCovered is split
  • 1
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值