MEGAHIT
MEGAHIT is a single node assembler for large and complex metagenomics NGS reads, such as soil. Compare to SOAPdenovo, it generates longer contigs and consumes less memory.
https://github.com/voutcn/megahit
在了解组装的软件前,先了解几个概念:
组装:将基因测序测得的短序列拼接连续完整的长序列(what);因为目前二代测序的序列读长比较短最长只有300bp,长序列能提高物种注释分析的准确性(why)
contig/scaffoldN50:将contig/scaffold长度从长到短进行排序并累加,当累加和达到contig/scaffold总长度的50%的时候,最后参与加和的那一条contig/scaffold长度即为contig/ scaffoldN50的长度。一般来说,contig/scaffoldN50越长,表示组装结果越好
组装的软件:
SOAPdenovo:中复杂度