Since the Fall of 2016, when the UMGC acquired a PacBio Sequel instrument, we have been providing long-read sequencing to our clients. We currently offer the following PacBio sequencing applications:
de novo genome assembly –Our standard library prep offers an approximately 20 kb, size-selected library from high-quality genomic DNA. For particularly large or difficult genomes, we can work with you to develop ultra-long insert libraries that can maximize genome contiguity.
Genome re-sequencing –Re-sequencing with long PacBio reads can provide information that short-read sequencing technologies have missed. As little as 5X or 10X genome coverage of long reads can recover information on structural variants, including insertions, deletions, re-arrangements.
Multiplexed microbial sequencing –Recent improvements in protocols and tools for microbial sequencing on the Sequel platform have enabled increases in efficiency and reductions in cost. We can generally combine microbial genomes of approximately 5Mb in an 8-plex to produce single-digit contig assemblies.
RNA isoform sequencing – Isoform sequencing on the PacBio platform, termed IsoSeq, generates full-length, highly accurate consensus isoforms from high quality RNA. Recent experience with IsoSeq libraries shows we can expect >250,000 full-length reads per SMRT cell, generating 10,000-30,000 high quality transcripts.
de novo transcriptome sequencing and genome annotation – Sharing the same underlying methodology as IsoSeq, full-length transcript sequencing on the PacBio platform can serve as an invaluable resource for genome annotation projects, adding confidence to gene models and illuminating alternative transcripts.
Targeted sequencing –We are a PacBio focus site for targeted sequencing applications, especially target capture. We can work with you to plan, design, and execute target capture of long libraries (up to 10 kb) using probes from IDT, Nimblegen, or Agilent.
Epigenetic modifications –PacBio sequencing can natively detect certain types of epigenetic modifications in DNA with no additional modifications to sample preparation.
Long amplicon sequencing –A maximum of 96 amplicons up to 10 kb in size can be barcoded separately and multiplexed on a single SMRT cell. Expect around 200,000 reads-300,000 reads per SMRTCell using this method after demultiplexing.
Whereas the Illumina platform detects unique fluorescence from clonally amplified DNA templates, the PacBio Sequel utilizes Single Molecule Real-Time (SMRT) sequencing to provide a direct real-time readout of each base from single DNA molecules. The SMRT technology uses one million zero-mode waveguides (ZMW) - microwells bound with a DNA polymerase - to capture signal from the incorporation of fluorescent-labeled nucleotides during DNA replication of SMRTbell templates. The detection system records a movie of light pulses that are converted to base calls to create a continuous long read from each ZMW.
Compared to the Illumina platform, that generates many millions of uniform-length short reads, the Sequel provides lower output in terms of read number, but yields a distribution of read lengths averaging tens of kb in length, with some reads ≥ 50 kb.
Additional information about the SMRT cell technology can be found on the PacBio website.
Data output on the Sequel is highly dependent on the type of library and data is also more variable compared to other platforms. Given this, we do not guarantee any specific data output per SMRT Cell, but with the SMRT Cell 1M v2, we generally expect mean polymerase read lengths >10kb, with >250,000 reads under proper loading. This would result in total data output at about 4-6 Gb. For projects running multiple SMRT Cells, the loading concentration can often be titrated to achieve higher per-cell output.
PacBio libraries are prepared with hairpin loops on each end that results in a molecule that is structurally linear, but topologically circular. For moderately-sized libraries, multiple reads can be generated for a single template as the polymerase loops around the hairpin for a subsequent pass to give a high-quality consensus sequence.
The UMGC provides high quality template preparation starting with rigorous sample QC to ensure only the highest quality DNA is used in SMRTbell creation. Since the SMRT technology sequences intact single DNA molecules, the quality and quantity of the DNA submitted for SMRTbell template preparation has direct influence on the length and quality of sequencing performance.
For projects looking for the longest possible reads, our PacBio service can also provide high-pass size-selection using the PippinHT. When a variety of size-ranges are of interest, the Sage ELF can be used to fractionate a single sample into multiple size bins, which are then sequenced on separate SMRT Cells.
Visit the guidelines tab for more details on sample requirements.
To ensure our clients are getting the highest quality data and the longest reads possible, we undergo annual standardized PacBio certification, and also attend and speak at national PacBio User Group Meetings to stay on top of application updates and to exchange workflow best practices. Please contact us at email@example.com for comprehensive support on experimental design.
|SMRTbell library creation.
Includes fragmentation and Sage ELF gel cut
|cDNA library creation for Iso-Seq.||1-any||sample||inquire||inquire|
|Sequel SMRT Cell 1M v2.
Researchers can download the appropriate submission form for submitting samples. Once the form has been filled, please complete the submission by emailing firstname.lastname@example.org.
1-202 Nils Hasselmo Hall (Minneapolis campus)
1-210 Cancer & Cardiovascular Research Building (Minneapolis campus)
20 Snyder Hall (St. Paul campus)
Please send the tracking information to email@example.com.
University of Minnesota Genomics Center
1475 Gortner Ave.
28 Snyder Hall
St. Paul, MN 55108
There are three options for transferring data from the UMGC to clients: delivery to the Minnesota Supercomputing Institute’s (MSI) high-performance file system, download from a secure website, or shipment on an external hard drive. Please indicate your data delivery preference when placing a order for sequencing.
Internal clients and external clients that opt-in to MSI storage have their data released to the client’s MSI group directory on the MSI high-performance filesystem (/home/GROUP/data_release/umgc), from where it can be analyzed using MSI’s high-performance computers or Galaxy system. The data is stored for five years in uncompressed format with read-only permissions to avoid accidental modification, and secure off-site tape backups of the data are maintained for disaster recovery. MSI will charge the client a storage fee for this service.
Internal clients that opt-out of MSI storage and external clients can download their data from a secure website using either a web browser or a command-line download tool, complete instructions are provided in an email from the UMGC. The client’s data is available for download for 30 days, after which the data will be removed from the data download website and the client takes responsibility for storing the data.
External clients may have data shipped on a hard drive purchased by the UMGC and invoiced to the client.
MSI guarantees storage of client data for five years, after which the data can be transferred to tape for long-term storage, contact MSI (firstname.lastname@example.org) for more information.
Data is typically recoverable for up to 5 years. If the external researcher should ever need the FASTQ files re-posted to the data download website or shipped on a hard drive within 5 years from data release, email email@example.com to initiate data recovery. The client will be charged for the cost of recovering the data from tape archive. The UMGC does not provide any guarantee that data can be successfully recovered from tape archive.
Yes we will accept client-made libraries for PacBio. Please contact Ben Auch for details at firstname.lastname@example.org.
University of Minnesota Genomics Center
1475 Gortner Ave.
28 Snyder Hall
St. Paul, MN 55108
The UMGC has teamed up with PacBio on a SMRT Grant to explore structural variation in humans. As the host site for the SMRT Grant, our experienced long-read sequencing group will provide free high-quality template preparation; low-coverage, whole genome SMRT sequencing; and bioinformatic support for structural variant analysis to one exceptional research proposal.
The SMRT Grant program connects research projects that would benefit from single molecule sequencing with a chance to win comprehensive project support and sequencing from a skilled, certified PacBio service provider. The UMGC is pleased to be selected as the site for the 2018 Structural Variation Grant.
The increased ability to identify structural variants that have been difficult to detect with short-read sequencing is a key benefit to using low-coverage, long-read PacBio sequencing.
With improved sequencing over longer and more complex regions of the human genome, researchers will be able to discover novel disease genes, identify causative mutations, and explain the basis of complex disease.
One SMRT grant will be awarded to an oustanding proposal, and the award will consist of:
Monday, October 15th, 2018 (9:00 AM PST): Competition opens.
Friday, November 30th, 2018 (5:00 PM PST): Competition closes.
To win free SMRT sequencing, submit a 250-word proposal to the PacBio web-site describing how your human genome project would benefit from low-coverage, whole genome sequencing. One winner will be selected from all qualified submissions by PacBio and UMGC scientists. To submit your proposal by Friday, Novemeber 30th, 2018 and to read the official rules (including eligibility), see the PacBio web-site.