Overview of PacBio SMRT sequencing: principles, workflow, and applications

Overview of PacBio SMRT sequencing: principles, workflow, and applications

PacBio's SMRT (single molecule real time) sequencing is one of the most commonly used third-generation sequencing technologies. Compared with the previous two generations, PacBio long-read sequencing enabled by SMRT Sequencing technology requires no PCR amplification and the read length is 100 times longer than that of NGS.

PacBio SMRT sequencing applications

PacBio SMRT sequencing can be used for genomic de novo sequencing to get high quality genome sequences, obtaining full transcriptome information and detecting alternative splicing isoforms, diverse mutations in target regions, and epigenetic modifications and more.

The principle of PacBio SMRT sequencing

Zero-mode waveguides (ZMWs), subwavelength optical nanostructures fabricated in a thin metallic film, are powerful analytical tools that are capable of confining an excitation volume to the range of attoliters, which allows individual molecules to be isolated for optical analysis at physiologically relevant concentrations of fluorescently labeled biomolecules. Arrays of such nanostructures can also be engineered into systems for real-time analysis of a mass of single-molecule reactions or binding events, which is the principle of PacBio SMRT sequencing.

1-s2
Figure 2. A single SMRT Cell. Each SMRT Cell contains 150,000 ZMWs. Approximately 35,000-75,000 of these wells produce a read in a run lasting 0.5-4 h, resulting in 0.5-1 Gb of sequence.

PacBio SMRT Sequencing uses the innovation of ZMW to distinguish the ideal fluorescent signal from the strong fluorescent backgrounds caused by unincorporated free-floating nucleotides. The binding of a DNA polymerase and the template DNA strand is anchored to the bottom glass surface of a ZMW. Laser light travels through the bottom surface of a ZMW and not completely penetrates it, since the ZMW dimensions are smaller than the wavelength of the light. Therefore, it allows selective excitation and identification of light emitted from nucleotides recruited for base elongation.

Library construction

The workflow for library construction involves the following steps:

  • Determine the quality of genomic DNA (gDNA)
  • Shear gDNA using a g-TUBE (Covaris)
  • Select size and adjust concentration
  • Repair DNA damage and ends of fragmented DNA
  • Conduct DNA purification
  • Blunt-end ligation using blunt adapters
  • Purify template for submission to a sequencer

The template, called a SMRTbell, is a closed single-stranded circular DNA, which is created by ligating hairpin adapters to both ends of target double-stranded DNA (dsDNA) molecules.

temolate preparation
Figure 1. Template Preparation Workflow for PacBio RS II system.

Sequencing

As in Figure 3, a SMRTbell (grey) diffuses into a ZMW, and the adaptor binds to a polymerase immobilized at the bottom. Four types of nucleotides are labeled with a different fluorescent dye (indicated in red, yellow, green, and blue, respectively for G, C, T, and A) so that they have distinct emission spectrums. As a nucleotide is held in the detection volume by the polymerase, a light pulse that identifies the base is produced. (1) A fluorescently-labeled nucleotide binds to the template in the active site of the polymerase. (2) The fluorescence output of the color corresponding to the incorporated base (yellow for base C as an example shows here) is elevated. (3) The dye linker-pyrophosphate product is cleaved from the nucleotide and diffuses out of the ZMW to end the fluorescence pulse. (4) The polymerase is translocated to the next position. (5) The next nucleotide binds to the template in the active site of the polymerase and initiates the next fluorescence pulse, which corresponds to base A here.

sequencing
Figure 3. Sequencing via light pulses.

Bioinformatics Analysis

Bioinformatics analysis, such as de novo assembly, reference genome mapping, genome annotation (pathogenic and susceptibility genes prediction, non-coding RNA prediction, CRISPRs prediction), gene function annotation (COG/ GO/ KEGG), SNP/InDel identification and comparative genomics analysis, evolutionary analysis and estimation of divergence time are viable.

A comparison of RS II and Sequel sequencing platform

Third-generation sequencing has been widely used in genome research since the successful launch of commercial sequencing instrument PacBio RS II in 2013. After continuous improvement and upgrading, PacBio launched its new and upgraded third-generation sequencer PacBio Sequel sequencing system in October 2015. A comparison of RS II and Sequel sequencing platform is outlined below.

Table 1. The comparison of RS II and Sequel sequencing platform

 RS IISequel
Average read length10~15kb8~12kb
ZMWs150,0001,000,000
Data size/SMRT Cell500Mb~1Gb5~10Gb
SMRT Cell No./Run1~161~16
Run time/SMRT Cell0.5~6 hours0.5~6 hours
Multiplex Amplicons3841536

Sequel platform has great advantages over RS II platform, since it enables higher-throughput sequencing within a shorter timeline and at a lower cost.

Features of PacBio SMRT sequencing

  • Single-molecule resolution

PacBio SMRT sequencing requires no PCR amplification, can easily cover high-GC and high-repeat regions, and is more accurate in quantifying low-frequency mutation.

  • Long reads

PacBio SMRT sequencing provides very long reads. Average read length is 8-15kb and up to 40-70kb.

  • Speediness

PacBio SMRT sequencing is time-effective at the rate of 10 nt per second.

  • High accuracy

The rapid sequencing has also brought about some obvious drawbacks. For example, the relatively high error rate of PacBio SMRT sequencing (which is almost a common fault of current single-molecule sequencing technology) can reach 10%-15%. But unlike next-generation sequencing, the errors are random without bias. Therefore, the base deviation can be effectively corrected through multiple sequencing, and the consensus accuracy of PacBio SMRT sequencing can be greater than 99.999% (Q50).

  • Direct identification of base modification

The base modifications can be directly detected when the genome is sequenced.
CD Genomics can provide integrated PacBio SMRT sequencing services, including long-read metagenomic sequencingbacterial whole genome de novo Sequencingfungal whole genome de novo sequencingfull-Length transcripts sequencing (Iso-Seq)human whole genome PacBio SMRT sequencing, and full-Length 16S/18S/ITS amplicon sequencing. If you are interested in our services, please feel free to contact us.

References:

  1. Kong, N., Ng, W., Thao, K., Agulto, R., Weis, A., & Kim, K. S., et al. (2017) ‘Automation of pacbio smrtbell ngs library preparation for bacterial genome sequencing’, Standards in Genomic Sciences, 12(1), 27.
  2. Rhoads, A., & Au, K. F. (2015) ‘Pacbio sequencing and its applications’, Genomics,Proteomics & Bioinformatics, 13(5), 278-289.
  3. PacBio's website.

* For Research Use Only. Not for use in diagnostic procedures.

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 打赏
    打赏
  • 1
    评论
评论 1
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包

打赏作者

wangchuang2017

你的鼓励将是我创作的最大动力

¥1 ¥2 ¥4 ¥6 ¥10 ¥20
扫码支付:¥1
获取中
扫码支付

您的余额不足,请更换扫码支付或充值

打赏作者

实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值