Library pooling and quality control for Illumina sequencing
Katherine Smollett, Lily Tong, Jenna Nichols, Kirsty Kwok, Kyriaki Nomikou, Ma. Jowina Galarion, Daniel Mair, Ana Filipe
Disclaimer
Abstract
The ability to multiplex many samples on the same run makes Illumina sequencing a powerful and affordable tool for many researchers. When pooling samples for sequencing it is important that they are pooled at equal molarity to prevent over or under representation of any individual sample in the same run. Care also has to be taken in deciding how many samples can be combined to achieve the required sequencing depth, a factor that is further complicated in viral sequencing where the majority of the reads may originate from the host and only a small proportion corresponding to viral reads.
Also of major importance is the final pool quality control (QC) to ensure that all contaminating adapter dimers are removed, generate accurate size distribution and quantification of the final pool. Accurate QC ensures accurate loading on the sequencer, overestimation of the pool molarity can cause under clustering resulting in fewer reads and can even cause run failure. Underestimation of pool molarity causes over clustering, lowering the read quality and risking run failure.
Adapter dimers are small fragments containing full length adapter sequences which can bind to the flow cell and cluster. Due to their small size, they cluster more efficiently than the longer library fragments and so reduce the library-specific read depth as well as causing over clustering which reduces the data quality possible run failure. In addition free adapter can be incorporated into the library clusters resulting in index hopping and incorrect assignment of library barcodes.
Here we describe our standard workflow for determining the appropriate sequencing depth and pooling by equal molarity of Illumina sequencing libraries, along with our procedure for pool QC.
Before start
It is recommended that individual libraries are cleaned up and QC'd following the protocol Library clean up and quality control for Illumina sequencing. Each Illumina library should have an accurate size and quantification and be free from contaminants.
Steps
Pooling
Before pooling the samples determine the number of samples per pool. The estimated reads per sample will depend on the required depth, genome length and expected percentage of on-target reads. The number of samples per pool will also depend on the type of Illumina sequencer/cartridge used and the read length.
You can use the attached calculator to determine the samples/cartridge and how many pools are required for your experiment.
For each pool calculate the volume required so that each library is present in equal quantities.
Library molar concentration can be calculated from the library fragment size and mass concentration as follows:
You can use the attached calculator to determine the pooling volumes.
You may be required to pool sub-pools instead of individual libraries. If so an additional weighting by the number of samples per sub-pool is required. See the attached calculator.
Using a fresh 1.5 mL DNA LoBind tube carefully add the required volume of each library.
Optional: Concentrate pool using 1.4X Ampure XP.
Equilibrate Ampure XP beads to Room temperature
and vortex well to mix.
Measure the total volume of the pool and add 1.4X volume of Ampure XP beads and mix well.
Keeping on the magnet air dry for up to 0h 3m 0s
.
Add the required volume of 10 mM Tris pH 8.0, remove from the magnet and mix well to fully suspend the beads.
Incubate at Room temperature
for at least 0h 2m 0s
to elute the DNA.
Place back on magnet until beads and solution have fully separated.
Transfer the supernatant containing the pool DNA to a fresh 1.5 mL DNA LoBind tube.
Incubate at Room temperature
room temperature for 0h 5m 0s
Place on a magnetic rack for 0h 5m 0s
until beads and solution have fully separated.
Carefully add 200µL
without disturbing the beads.
Incubate at Room temperature
for 0h 0m 30s
.
Place on magnetic rack for 0h 1m 0s
until the beads and solution have fully separated.
Keeping on the magnet and carefully remove supernatant without disturbing the beads.
Repeat wash with 200µL
.
Remove all traces of ethanol.
Pool QC
It is recommended that library quantification is performed using a fluorometric method (e.g. Qubit) rather than an absorbance based method (e.g. Nanodrop) as it is more specific, sensitive and accurate.
Equipment
Value | Label |
---|---|
Qubit | NAME |
Flurometer | TYPE |
Invitrogen | BRAND |
Q33228 | SKU |
Prepare 0.5 mL thin-walled PCR tubes, including 2 tubes for standard solutions and 3 for each pool.
Label the tube lids and not the sides.
Select "Run samples" and select the sample volume as 1µL
.
Insert a sample tube into the sample chamber, close the lid and press "Read tube".
Record the concentration of the Qubit sample in ng/μL.
For each pool ensure all readings are within the same range.
Calculate the average pool concentration from the replicates ensuring any outliers are removed.
Prepare the Qubit dsDNA High Sensitivity master mix for the total number of samples and standards with an excess.
A | B |
---|---|
Component | Volume (μl) |
Qubit dsDNA HS Buffer | 199 |
Qubit dsDNA HS Reagent | 1 |
Total | 200 |
Aliquot the Qubit dsDNA High Sensitivity master mix into the assay tubes as follows:
Standards 190µL
Samples 199µL
Add 10µL
to each standard assay tube.
Add 1µL
to each sample assay tube.
Vortex assay tubes and briefly centrifuge.
Incubate at Room temperature
for 0h 2m 0s
.
Select dsDNA high Sensitivity assay on the Qubit Fluorimeter and press "Read Standards".
Insert Standard 1 and 2 into the sample chamber when prompted, close the lid and and press "Read Standard".
It is recommended that the libraries are visualised using capillary electrophoresis, we describe visualisation with a TapeStation and High Sensitivity D5000 ScreenTape. Alternatives such as the BioAnalyzer or Fragment Analyzer can also be used.
The purpose is to provide a size for the library fragments to give accurate molar quantification and to determine the quality of the library.
Equipment
Value | Label |
---|---|
4200 TapeStation System | NAME |
Electrophoresis tool for DNA and RNA sample quality control. | TYPE |
TapeStation Instruments | BRAND |
G2991AA | SKU |
Ensure that the D5000 ScreenTape and Reagents are equilibrated to Room temperature
at least 0h 30m 0s
before use, vortex and briefly centrifuge.
Calculate the pool molar concentration using the fragment size (Step 6.9) and mass concentration (Step 5.14).
Prepare a dilution of each pool to approximately 1 ng/μL.
In fresh PCR strip tubes prepare the ladder assay tube as follows:
A | B |
---|---|
Component | Volume (μl) |
D5000 sample buffer | 2 |
D5000 ladder | 2 |
Total | 4 |
Prepare the sample assay tubes as follows:
A | B |
---|---|
Component | Volume (μl) |
D5000 sample buffer | 2 |
Diluted Pool | 2 |
Total | 4 |
Spin down, using IKA vortexer mix at 2000rpm
then spin down again.
Load the assay tubes and ScreenTape into the TapeStation instrument.
Select the required sample/ladder positions in the TapeStation software and click "start".
Analyse the results, the pool should generate a similar trace to the individual libraries.
If needed follow the same troubleshooting guidelines as in the protocol Library clean up and quality control for Illumina sequencing.
Determine the pool fragment peak size in bp.