Multiplexed CRISPR-based target-enriched next-generation sequencing for detecting antibiotic resistance genes in environmental samples

Yuqing Mao, Thanh H Nguyen

Published: 2023-11-16 DOI: 10.17504/protocols.io.8epv5xdnjg1b/v1

Disclaimer

DISCLAIMER – FOR INFORMATIONAL PURPOSES ONLY; USE AT YOUR OWN RISK

The protocol content here is for informational purposes only and does not constitute legal, medical, clinical, or safety advice, or otherwise; content added to protocols.io is not peer reviewed and may not have undergone a formal approval of any kind. Information presented in this protocol should not substitute for independent professional judgment, advice, diagnosis, or treatment. Any action you take or refrain from taking using or relying upon the information presented here is strictly at your own risk. You agree that neither the Company nor any of the authors, contributors, administrators, or anyone else associated with protocols.io, can be held responsible for your use of the information contained in or linked to this protocol or any of our Sites/Apps and Services.

Abstract

High-throughput detection of antibiotic resistance genes (ARGs) in complex environmental samples is challenging for two reasons: 1) ARGs account for less than 0.1% of total DNA in an environmental sample, and 2) it is difficult to detect thousands of ARGs in one reaction. Conventional methods, including metagenomic sequencing and quantitative polymerase chain reaction (qPCR), have their limitations with sensitivity and target range, respectively. Here, we propose a multiplexed CRISPR-Cas9-based target-enriched next-generation sequencing (NGS) method to detect thousands of ARGs in complex environmental samples, using sewage as a testbed. This protocol includes guide RNA design, guide RNA synthesis, DNA sample preparation, CRISPR-NGS library preparation, and data processing steps. With this protocol, ARGs in low abundances can be detected with increased read depth and higher sensitivity than regular metagenomic NGS methods. This protocol is also applicable for detecting other low-abundance genetic markers, for example, bacterial virulence factors, in environmental samples.

Before start

It is highly recommended to use DNA Away and RNase Away to clean all surfaces and equipment before wet lab experiments.

Steps

Multiplex crRNA design (using FLASHit as an example)

1.

Create a Linux environment. It can be set up in MobaXterm (https://mobaxterm.mobatek.net/) or other preferred terminal software. Using MobaXterm as an example, the Linux environment can be created by “Sessions” -> “New session” -> WSL. Then, select “Ubuntu” for “Distribution”. Click on “OK”, and the session will be created and saved.

2.

Install FLASHit (https://github.com/czbiohub-sf/flash) in the Linux environment according to the instructions in “Prerequisite” on its GitHub webpage.

Citation
Quan J, Langelier C, Kuchta A, Batson J, Teyssier N, Lyden A, Caldera S, McGeever A, Dimitrov B, King R, Wilheim J, Murphy M, Ares LP, Travisano KA, Sit R, Amato R, Mumbengegwi DR, Smith JL, Bennett A, Gosling R, Mourani PM, Calfee CS, Neff NF, Chow ED, Kim PS, Greenhouse B, DeRisi JL, Crawford ED 2019 FLASH: a next-generation CRISPR diagnostic for multiplexed detection of antimicrobial resistance sequences. https://doi.org/10.1093/nar/gkz418

3.

Collect all target genes from databases. Here, as an example, all available sequences for antibiotic resistance genes (ARGs) from The Comprehensive Antibiotic Resistance Database (CARD) (https://card.mcmaster.ca/) were downloaded using the link: https://card.mcmaster.ca/latest/data.

4.

For an ARG detection project, among all downloaded “.fasta” files, “nucleotide_fasta_protein_homolog_model.fasta” was used as the input, because the ARGs that have raised high concerns such as the CTX-M gene families and the mcr gene families are included in the protein homolog model.

5.

Trim the “nucleotide_fasta_protein_homolog_model.fasta” file using the Python code below to keep only the antibiotic resistance ontology (ARO) of the ARGs in the titles, because special characters cannot be processed by FLASHit.

import re

file=open(r"INPUT_FASTA_FILE_PATH_HERE")
raw_content=file.readlines()
file.close()

output_content=[]

for i in range(0,len(raw_content)):
    if i%2==0:
        aro_number=re.findall(r'ARO:[0-9]+',raw_content[i])[0]
        output_content.append('>'+aro_number.split(':')[1])
    if i%2!=0:
        output_content.append(raw_content[i].strip('\n'))

output_file=open(r" OUTPUT_TRIMMED_FASTA_FILE_PATH_HERE ",'w')

for i in range(0,len(output_content)):
    output_file.write(output_content[i]+'\n')

output_file.close()

Note
For DNA sequences downloaded from other databases, the above code may need to be modified. Each ARG has its own corresponding ARO in CARD.

6.

In Linux terminal, activate the conda environment, then activate the environment for running FLASHit.

7.

[optional] By default, FLASHit excludes the off-target sites from human genomes and the E. coli BL21 genome. If an environmental sample is expected to include undesired genomes other than these two, for example, swine genomes, users can modify the files in /flash/generated_files/ accordingly.

7.1.

Search the reference genome of the undesired off-targets from NCBI Genome database (https://www.ncbi.nlm.nih.gov/datasets/genome/) by typing the species name in the search box. https://www.ncbi.nlm.nih.gov/datasets/genome/) by typing the species name in the search box.

7.10.

Run the following python code to remove replicated off-targets and organize all off-targets to the same file.

import argparse
import os

parser = argparse.ArgumentParser()
parser.add_argument('-i',dest='input',type=str,required=True,help='Define input txt folder')
parser.add_argument('-o',dest='output',type=str,required=True,help="Define output path for a combined txt")
args=parser.parse_args()

full_gRNA_list=[]
dir_path=args.input.strip("'")
dir_list=os.listdir(dir_path)
for file_name in dir_list:
    print('Processing '+file_name+' ......')
    file=open(dir_path+'/'+file_name)
    raw_content=file.readlines()
    file.close()
    for i in range(0,len(raw_content)):
        raw_content[i]=raw_content[i].strip('\t\n\r')
    for i in range(0,len(raw_content),3):
        full_gRNA_list.append(raw_content[i])

full_gRNA_list=list(set(full_gRNA_list))

output_file=open(args.output.strip("'"),"w")
for i in range(0,len(full_gRNA_list)):
    output_file.write(full_gRNA_list[i]+'\n')
output_file.close()

Note
This code is written to be run by a command line. Users should save it as a “.py” file, and run it by typing “python3 NAME_OF_THE_PYTHON_CODE.py -i THE_DIRECTORY_PATH_CONTAINING_ALL_SPLIT_TXT_FILES -o THE_TXT_FILE_PATH_FOR_THE_OUTPUT_ORGANIZED_TARGET_LIST.txt”, quotation marks not included. In addition, this code may require a large memory, especially when the input genome is large. To avoid a potential crash, it is recommended to run this code on a server, instead of a personal computer.

7.11.

If there are multiple off-target genomes, place all output “.txt” files generated by the above code into the same directory. The list for human genome off-targets is already provided by FLASHit with the file path "/flash/generated_files/human_guides_38.txt". Combine all off-targets into the same “all_offtargets.txt” file using the following Python code.

import argparse

parser = argparse.ArgumentParser()
parser.add_argument('-i',dest='input',type=str,required=True,help='Define input txt folder')
parser.add_argument('-o',dest='output',type=str,required=True,help="Define output path for a combined txt")
args=parser.parse_args()

file=open(args.input.strip("'"))
gRNA_list=file.readlines()
file.close()

output_list=sorted(list(set(gRNA_list)))
output_file=open(args.output.strip("'"),'w')
for i in range(0,len(output_list)):
    output_file.write(output_list[i])
output_file.close()

Note
This code is also written to be run by a command line. Users should save it as a “.py” file and run it.

7.12.

Replace the “all_offtargets.txt” file in “/flash/generated_files/” with the new “.txt” file generated by the code above. Make sure to rename the newly generated “.txt” file to “all_offtargets.txt”.

7.13.

Rename “human_guides_38.txt” and “ecoli_bl21_de3_offtargets.txt” in “/flash/generated_files/” to “human_guides_38.txt1” and “ecoli_bl21_de3_offtargets.txt1” to avoid those two files to be identified by FLASHit by default.

7.2.

From the search results, click on the genome with the NCBI RefSeq label.

7.3.

Download the genome sequence by choosing “RefSeq only” and “Genome sequences (FASTA)”. .

7.4.

Use each “.fasta” file as an input to FLASHit following the guidance in “Creating your own library” in th “Workflow” section on the GitHub page of FLASHit. .

7.5.

After “Will discard xxx targets in amibiguous_targets.txt affecting xxx not necessarily unique genes.” is shown on the screen, break the current FLASHit run by hitting Ctrl+C.

7.6.

Go to the directory /flash/generated_files/target_index/, copy “all_targets.txt” to a customized directory , and rename it by the input “.fasta” file name.

7.7.

After collecting and renaming all “all_targets.txt” files to the customized directory, split files larger than 10 Mb to separated 10 Mb files into a new directory using the command below.

split -b 10m INPUT_FILE_NAME OUTPUT_FILE_PATH_AND_PREFIX

Note
Make sure the directory only contains split files, otherwise the other files will be also renamed.

7.8.

In the new directory containing all split files, add “.txt” suffix to all files using the command below.

ls | while read i; do mv ${i} ${i}.txt

Note
Make sure the directory only contains split files. Otherwise ,the other files will be also renamed.

7.9.

Copy and paste the other files smaller than 10 Mb to the directory containing all split files.

8.

Follow the guidance in “Creating your own library” in “Workflow” section on the GitHub page of FLASHit to generate a list for the multiplexed 20-nt target regions for the template of crRNA.

9.

Follow the guidance in “Creating a bed file of the guides” on the GitHub page of FLASHit to generate a file showing the cleavage sites of the crRNA on the target genes.

10.

Assemble the full crRNA templates by replacing the XXXXXXXXXXXXXXXXXXXX in 5’-TAATACGACTCACTATAGXXXXXXXXXXXXXXXXXXXXGTTTTAGAGCTATGCTGTTTTG-3’ by the 20-nt sequences generated by FLASHit.

11.

The assembled nucleotide sequences can be used for purchasing DNA oligo pools.

Guide RNA preparation

12.

Mix the DNA template for either crRNA or tracrRNA, forward and corresponding

reverse primers, Phusion High-Fidelity PCR Master Mix, and molecular biology

grade water in a nuclease-free PCR tube following the volumes listed in the

table below. Pipette up and down 10 times or until well mixed.

AB
ReagentVolume (μL)
DNA template4
Forward primer (10 μM)2.5
Reverse primer (10 μM)2.5
Phusion High-Fidelity PCR Master Mix25
Molecular biology grade water16
Total50
13.

Amplify the DNA templates for crRNA and tracrRNA in a thermal cycler for PCR. The steps in the thermal cycle are listed below.

ABCD
StepTemperature (℃)Time (s)Cycles
Initial denaturation98 ℃101x
Denaturation98 ℃512x
Annealing55 ℃15
Final extension72 ℃601x
Hold4 ℃1x

Citation
Liang X, Potter J, Kumar S, Zou Y, Quintanilla R, Sridharan M, Carte J, Chen W, Roark N, Ranganathan S, Ravinder N, Chesnut JD 2015 Rapid and highly efficient mammalian cell engineering via Cas9 protein transfection. https://doi.org/10.1016/j.jbiotec.2015.04.024

14.

Mix ATP, UTP, GTP, and CTP provided in TranscriptAid T7 High Yield Transcription Kit in 1:1:1:1 ratio in a nuclease-free microcentrifuge tube. Pipette up and down for 10 times or until well mixed.

15.

For the transcription reaction of crRNA and tracrRNA, mix the reagents

from TranscriptAid T7 High Yield Transcription Kit and the PCR-amplified DNA

templates in nuclease-free PCR tubes following the volumes provided in the

table below. Pipette up and down 10 times or until well mixed. A 50-μL

PCR-amplified DNA template can be divided into 4 transcription reactions in

this step.

AB
ReagentVolume (μL)
Mixed NTP16
PCR-amplified DNA template12
5X TranscriptAid Reaction Buffer8
TranscriptAid Enzyme Mix4
Total40

Note
The reagents must be added following the order from the top to the bottom in the above table.

Citation
Liang X, Potter J, Kumar S, Zou Y, Quintanilla R, Sridharan M, Carte J, Chen W, Roark N, Ranganathan S, Ravinder N, Chesnut JD 2015 Rapid and highly efficient mammalian cell engineering via Cas9 protein transfection. https://doi.org/10.1016/j.jbiotec.2015.04.024

16.

Incubate the RNA transcription samples at 37 ℃ for 4 hours.

Note
A white mist should be visible for a successful reaction after incubation.

17.

Add 5 μL of DNase I and 5 μL of DNA Digestion Buffer provided in RNA Clean & Concentrator-5 (DNase Included) to each 40-μL RNA transcription reaction. Pipette up and down 10 times or until well mixed.

18.

Incubate at room temperature for 15 min.

19.

Follow the “Total RNA Clean-up” instructions in the user manual of RNA Clean & Concentrator-5. Use 15 μL of DNase/RNase-Free Water to elute the purified crRNA or tracrRNA product. Incubate for 5 min before the final centrifugation to ensure maximum yield.

Note
Use 1.5 volumes of ethanol to reach maximum RNA yield as suggested in the manual for 17-200 nt RNA, in the case of 40 μL transcription with 5 μL of DNase I and 5 μL of DNA Digestion Buffer added, use 225 μL of 100% ethanol. Highly recommend using low-retention pipette tips to ensure maximum RNA yield.

20.

Pipette each purified crRNA or tracrRNA product up and down for 10 times or until well mixed. Take 1 μL from each purified RNA sample, dilute 100-fold in 99 μL of molecular biology grade water. Pipette up and down for 10 times or until well mixed. Quantify each 100-fold-diluted purified crRNA or tracrRNA using QubitTMRNA Broad Range (BR) Assay Kit by adding 10 μL diluted sample to 190 μL of master mix.

the

Note
This is not just a QC step. It is necessary to determine the concentration for each crRNA and tracrRNA for calculating themixing ratio before making duplexed guide RNA. Usually, the final concentrations for crRNA are >2,000 ng/μL, and the final concentrations for tracrRNA are > 4,000 ng/μL.

21.

Aliquot the crRNA and tracrRNA samples and store at -80 ℃ before use.

22.

Right before making CRISPR-NGS library, mix crRNA and tracrRNA in

anequi-molar ratio (see Equation 1), then add Nuclease-Free Duplex Buffer to

reach a final guide RNA concentration of 1500 ng/μL (see Equation 2). Pipette

up and down 10 times or until well mixed.

Equation 1: crRNA and tracrRNA equi-molar ratio mixing
Equation 1: crRNA and tracrRNA equi-molar ratio mixing
Equation 2: Volume calculation for crRNA and tracrRNA mixing
Equation 2: Volume calculation for crRNA and tracrRNA mixing
23.

Incubate the mixture in a thermal cycler at 94 ℃ for 2 min, then slowly cool down to room temperature. The guide RNA is ready to use.

DNA sample preparation (Sewage sample as an example)

24.

Shake the sewage sample until well mixed, and filter 50 mL of the sewage sample through

0.45 μm pore size membrane filter.

Note
The volume is determined by the turbidity of the sewage sample. For extremely turbid sewage samples, the volume can go down to 10-20 mL, and for relatively clear samples, the volume can go up to 100 or 200 mL until the filter is clogged.

25.

Store the membrane filter at -80 ℃ until DNA extraction.

Note
The filters can be cut into two before storage. Usually, half of a filter can obtain a high enough DNA yield.

26.

Extract DNA from the membrane filter using FastDNATM SPIN Kit for Soil following the user’s manual. Elute the DNA samples using 100 μL of DES provided in the kit.

27.

Purify the DNA samples using OneStep PCR Inhibitor Removal Kit following the user’s manual.

28.

Determine the concentrations of the DNA samples using QubitTM 1X dsDNA High Sensitivity (HS) Assay Kit by adding 2 μL diluted sample to 198 μL of master mix.

Note
Usually, the DNA concentrations are higher than 10 ng/μL.

29.

Aliquot and store the DNA samples at -20 ℃ or -80 ℃ until library preparation.

CRISPR-NGS library preparation

30.

Right before library preparation, dilute the “NH8B” external standard 100-fold using

molecular biology grade water.

31.

Determine the concentration of the diluted “NH8B” external standard using QubitTM 1X dsDNA High Sensitivity (HS) Assay Kit by adding 2 μL diluted external standard to 198 μL of master mix.

32.

Mix the diluted Cas9 protein, the duplexed guide RNA, and NEBufferTM r3.1

in nuclease-free PCR tubes with the volumes listed in the table below. Pipette

up and down 10 times or until well mixed.

AB
ReagentVolume (μL)
Cas92
Guide RNA10
NEBuffer r3.13
Molecular biology grade water4
Total19

Note
It is highly recommended to use TrueCutTM HiFi Cas9 Protein (5 µg/µL) because other commercial Cas9 proteins are not able to reach the accuracy and cleavage efficiency as high as this Cas9 protein with this workflow.

Citation
Liu Y, Tao W, Wen S, Li Z, Yang A, Deng Z, Sun Y 2015 In Vitro CRISPR/Cas9 System for Efficient Targeted DNA Editing. https://doi.org/10.1128/mBio.01714-15

33.

Incubate the above mixture at room temperature for at least 15 min to bind guide RNA to Cas9.

34.

Block the DNA samples by removing 5’ phosphate group using rAPid Alkaline Phosphatase with the volumes listed in the table below.

AB
ReagentAmount
rAPid Alkaline Phosphatase Buffer 10x concentrated2 μL
rAPid Alkaline Phosphatase 1 U/μl1 μL
DNA sample~200 ng
Molecular biology grade waterFill up the volume to 20 μL

Note
You can reduce the volume to 10 μL if the DNA sample is limited. In such cases, the DNA sample input will be ~ 100 ng, and the volumes of the buffer and the phosphatase will be reduced to half of the volumes listed above.

Citation
Gilpatrick T, Lee I, Graham JE, Raimondeau E, Bowen R, Heron A, Downs B, Sukumar S, Sedlazeck FJ, Timp W 2020 Targeted nanopore sequencing with Cas9-guided adapter ligation. https://doi.org/10.1038/s41587-020-0407-5

Citation
Quan J, Langelier C, Kuchta A, Batson J, Teyssier N, Lyden A, Caldera S, McGeever A, Dimitrov B, King R, Wilheim J, Murphy M, Ares LP, Travisano KA, Sit R, Amato R, Mumbengegwi DR, Smith JL, Bennett A, Gosling R, Mourani PM, Calfee CS, Neff NF, Chow ED, Kim PS, Greenhouse B, DeRisi JL, Crawford ED 2019 FLASH: a next-generation CRISPR diagnostic for multiplexed detection of antimicrobial resistance sequences. https://doi.org/10.1093/nar/gkz418

35.

Incubate the DNA-blocking reaction mixture in a thermal cycler with the following thermal conditions.

ABC
StepTemperature (℃)Time (min)
Incubation3710
Phosphatase inactivation752
Hold4
36.

Mix the blocked DNA samples, the mixture of Cas9 and guide RNA, and the “NH8B” external standard with the volumes listed in the table below. Pipette up and down 10 times or until well mixed.

AB
ReagentVolume (μL)
Mixture of Cas9 and guide RNA19
Blocked DNA10
100-fold diluted “NH8B” external standard1
Total30
37.

Incubate the above mixture at 37 ℃ for 16 hours.

38.

Add 5 μL of RNase T1 to the mixture, pipette up and down for 10 times, and incubate at 37 ℃ for 15 min to remove guide RNA.

39.

Prepare the master mix for dA-tailing, using the reagents and volumes listed in the table below. Pipette up and down 10 times or until well mixed.

AB
ReagentVolume (μL)
dATP (100 mM)2
Taq DNA Polymerase5
ThermoPol Reaction Buffer80
Molecular biology grade water13
Total100

Citation
Gilpatrick T, Lee I, Graham JE, Raimondeau E, Bowen R, Heron A, Downs B, Sukumar S, Sedlazeck FJ, Timp W 2020 Targeted nanopore sequencing with Cas9-guided adapter ligation. https://doi.org/10.1038/s41587-020-0407-5

40.

Add 5 μL of the dA-tailing master mix to each 35-μL mixture after RNase T1 treatment to reach a 40 μL total volume. Pipette up and down 10 times or until well mixed.

41.

Incubate the mixture at 72 ℃ for 20 min for dA-tailing and Cas9 inactivation.

42.

Dilute adapters using molecular biology grade water or xGenTM Adapter Buffer based on the total DNA input, according to the table below.

AB
Total DNA input (ng)Adapter dilution ratio
>100100x
<100200x
43.

Ligate adapters to targeted DNA fragments using the reagents and volumes listed in the table below.

AB
ReagentVolume (μL)
dA-tailed DNA sample35
Diluted adapter2.5
NEBNext Ligation Enhancer1
NEBNext Ultra II Ligation Master Mix30
Total68.5

Note
The reagents must be added following the order from the top to the bottom in the above table. After adding each reagent, pipette up and down 10 times or until well mixed. The ligation master mix should be well mixed before adding to the reaction.

44.

Incubate the ligation mixture at room temperature for 15 min.

45.

Dilute 1x TE buffer 10-fold to make 0.1x TE buffer.

46.

Purify the adapter-ligated DNA samples using AMPure XP SPRI beads with beads:DNA ratio of 0.8:1 (57 μL of SPRI beads for 68.5 μL of adapter-ligated DNA sample). The detailed SPRI beads cleaning steps are listed in the table below.

ABC
StepOn/Off the magnetic rackTime (min)
Bind DNA sample to the beadsoff5
Separate the beads from the liquid phaseon5
Discard the supernatanton/
1st wash with 80% ethanolon2
Discard the supernatanton/
2nd wash with 80% ethanolon2
Discard the supernatanton/
Air dry the beads with the lid openon3-5
Resuspend the beads with 17 μL of 0.1x TE bufferoff/
Release DNA from the beads to the liquid phaseoff10
Separate the beads from the liquid phaseon5
Transfer 15 μL of the supernatant to a clean PCR tubeon/
47.

Dilute the xGenTM Library Amplification Primer Mix 2-fold by adding an equal volume of molecular biology grade water.

48.

Mix the beads-purified DNA sample, diluted primer mix, and NEBNext®UltraTM II Q5® Master Mix in a nuclease-free PCR tube according to the table below. Pipette up and down 10 times or until well mixed.

AB
ReagentVolume (μL)
Beads-purified DNA sample15
Diluted primer mix10
NEBNext Ultra II Q5 Master Mix25
Total50
49.

Incubate the mixture above in a thermal cycler using the thermal cycle listed in the table below.

ABCD
StepTemperature (℃)TimeCycles
Initial denaturation98 ℃30 s1x
Denaturation98 ℃10 s22x for >100 ng DNA input; 30 x for <100 ng DNA input
Annealing65 ℃75 s
Final extension65 ℃5 min1x
Hold4 ℃1x
50.

Purify the PCR product using AMPure XP SPRI beads with beads:DNA ratio

of 0.9:1 (45 μL of SPRI beads for 50 μL of PCR product). The detailed SPRI

beads cleaning steps are listed in the table below.

ABC
StepOn/Off the magnetic rackTime (min)
Bind the PCR product to the beadsoff5
Separate the beads from the liquid phaseon5
Discard the supernatanton/
1st wash with 80% ethanolon2
Discard the supernatanton/
2nd wash with 80% ethanolon2
Discard the supernatanton/
Air dry the beads with the lid openon3-5
Resuspend the beads with 33 μL of 0.1x TE bufferoff/
Release DNA from the beads to the liquid phaseoff10
Separate the beads from the liquid phaseon5
Transfer 30 μL of the supernatant to a clean tubeon/
51.

Determine the DNA concentration of the library using QubitTM 1X dsDNA High Sensitivity (HS) Assay Kit by adding 2 μL diluted external standard to 198 μL of master mix.

Note
The DNA concentration should be above 1 ng/μL for a successful library.

52.

According to the DNA concentration, take 1-2 μL of the library and dilute to ~1 ng/μL for the fragment analyzer.

Note
A visible peak at ~170 bp means excess adapter dimers. Such libraries should be cleaned up after pooling and before sequencing using SPRI beads or eGel.

53.

Store the libraries at -20 ℃ or -80 ℃ until the sequencing run.

NGS read mapping

54.

After sequencing, download all raw sequencing data files.

55.

Unzip the files to get ".fastq" files for each library.

56.

Download PRICE from https://derisilab.ucsf.edu/software/price/index.html to the local Linux environment. Install by navigating to the PRICE directory and typing "make" in the command line tool.

Note
PriceSeqFilter in the PRICE package is used for screening out low-quality reads.

Citation
Ruby JG, Bellare P, Derisi JL 2013 PRICE: software for the targeted assembly of components of (Meta) genomic sequence data. https://doi.org/10.1534/g3.113.005967

57.

Clone the KMA repository from https://bitbucket.org/genomicepidemiology/kma/src/master/ using the command below to the local Linux environment. Install by navigating to the KMA directory and typing "make" in the command line tool.

git clone https://bitbucket.org/genomicepidemiology/kma.git

Note
KMA is used for mapping the reads to the list of reference genes to analyze the presence or absence of the target genes.

Citation
Clausen PTLC, Aarestrup FM, Lund O 2018 Rapid and precise alignment of raw reads against redundant databases with KMA. https://doi.org/10.1186/s12859-018-2336-6

58.

Screen the low-quality sequencing reads using PriceSeqFilter with 85% of nucleotides in a read must be in high quality, the minimum allowed probability of a nucleotide being correct is 98%, and 90% of nucleotides in a read that must be called. An example of the command for paired sequences is shown below.

PATH_TO_PRICE_FOLDER/Price/PriceSeqFilter -fp R1.fastq R2.fastq -rqf 85 0.98 -rnf 90 -op R1_filtered.fastq R2_filtered.fastq

Citation
Ruby JG, Bellare P, Derisi JL 2013 PRICE: software for the targeted assembly of components of (Meta) genomic sequence data. https://doi.org/10.1534/g3.113.005967

Citation
Quan J, Langelier C, Kuchta A, Batson J, Teyssier N, Lyden A, Caldera S, McGeever A, Dimitrov B, King R, Wilheim J, Murphy M, Ares LP, Travisano KA, Sit R, Amato R, Mumbengegwi DR, Smith JL, Bennett A, Gosling R, Mourani PM, Calfee CS, Neff NF, Chow ED, Kim PS, Greenhouse B, DeRisi JL, Crawford ED 2019 FLASH: a next-generation CRISPR diagnostic for multiplexed detection of antimicrobial resistance sequences. https://doi.org/10.1093/nar/gkz418

59.

Make a copy of the ".fasta" file used for generating guide RNA in Step 8. Add the sequence of the "NH8B" external standard to the end of the copied file. This ".fasta" file will be used as a reference gene list.

Note
Copy and paste the entire thing below:>NH8BACCCATACAAGGAACCCGGCCAGCACTACGCTCACTACGGCCGGTGGTACGGTGGGCACTCCGGTGAAATGCACGTGCTTGGCATGCCGTCAGGCCGTGAAGTCAAGCGCACCCCGGTGTTCAACATGGACAGCAACAAGATGACCATCCACATCGCCTCGCCGGCGCCGGCATACAGTCTGGGGGGAATTCAAGATGGAGAAGGGCGACGAGGTAATGGCGATCCTGACCTCGACAAGTGGAAGACCTG

Citation
Zhang Q, Ishii S 2018 Improved simultaneous quantification of multiple waterborne pathogens and fecal indicator bacteria with the use of a sample process control. https://doi.org/10.1016/j.watres.2018.03.023

60.

Index the ".fasta" reference gene list file with KMA using the command below.

kma index -i REFERENCE_GENE_LIST.fasta -o INDEX_FILE_PREFIX

Citation
Clausen PTLC, Aarestrup FM, Lund O 2018 Rapid and precise alignment of raw reads against redundant databases with KMA. https://doi.org/10.1186/s12859-018-2336-6

61.

Map the filtered ".fastq" files to the indexed reference gene list with KMA using the command below.

kma -ipe R1_filtered.fastq R2_filtered.fastq -a -t_db INDEX_FILE_PREFIX -o OUTPUT_FILE_PREFIX

Citation
Clausen PTLC, Aarestrup FM, Lund O 2018 Rapid and precise alignment of raw reads against redundant databases with KMA. https://doi.org/10.1186/s12859-018-2336-6

62.

The read mapping results for the target genes are available in the ".res" file generated by KMA.

推荐阅读

Nature Protocols
Protocols IO
Current Protocols
扫码咨询