2D-HELS MS Seq: A General LC-MS-Based Method for Direct and de novo Sequencing of RNA Mixtures with Different Nucleotide Modifications

Ning Zhang; Shundi Shi; Barney Yoo; Xiaohong Yuan; Wenjia Li; Shenglong Zhang

doi:10.3791/61281

A subscription to JoVE is required to view this content. Sign in or start your free trial.

Summary

Here, we describe a detailed protocol for an LC-MS-based sequencing method that can be used as a direct method to sequence short RNA (<35 nt per run) without a cDNA intermediate, and as a general method to sequence different nucleotide modifications in a single study at single-base precision.

Abstract

Mass spectrometry (MS)-based sequencing approaches have been shown to be useful in direct sequencing RNA without the need for a complementary DNA (cDNA) intermediate. However, such approaches are rarely applied as a de novo RNA sequencing method, but used mainly as a tool that can assist in quality assurance for confirming known sequences of purified single-stranded RNA samples. Recently, we developed a direct RNA sequencing method by integrating a 2-dimensional mass-retention time hydrophobic end-labeling strategy into MS-based sequencing (2D-HELS MS Seq). This method is capable of accurately sequencing single RNA sequences as well as mixtures containing up to 12 distinct RNA sequences. In addition to the four canonical ribonucleotides (A, C, G, and U), the method has the capacity to sequence RNA oligonucleotides containing modified nucleotides. This is possible because the modified nucleobase either has an intrinsically unique mass that can help in its identification and its location in the RNA sequence, or can be converted into a product with a unique mass. In this study, we have used RNA, incorporating two representative modified nucleotides (pseudouridine (Ψ) and 5-methylcytosine (m⁵C)), to illustrate the application of the method for the de novo sequencing of a single RNA oligonucleotide as well as a mixture of RNA oligonucleotides, each with a different sequence and/or modified nucleotides. The procedures and protocols described here to sequence these model RNAs will be applicable to other short RNA samples (<35 nt) when using a standard high-resolution LC-MS system, and can also be used for sequence verification of modified therapeutic RNA oligonucleotides. In the future, with the development of more robust algorithms and with better instruments, this method could allow sequencing of more complex biological samples.

Introduction

Mass spectrometry (MS)-based sequencing methods, including top-down MS and tandem MS¹^,²^,³^,⁴, have been developed for direct sequencing of RNA. However, in situ fragmentation techniques for effectively generating high-quality RNA ladders in mass spectrometers currently can not be applied to de novo sequencing⁵^,⁶. Furthermore, it is not very trivial to analyze the traditional one-dimensional (1D) MS data for de novo sequencing of even one purified RNA sequence, and it would be even more challenging for MS sequencing of mixed RNA samples⁷^,⁸. Therefore, a two-dimensional (2D) liquid chromatography (LC)-MS-based RNA sequencing method has been developed, incorporating production of 2D mass-retention time (t_R) ladders to replace 1D mass ladders, making it much easier to identify ladder components needed for de novo sequencing of RNAs⁸. However, the 2D LC-MS-based RNA sequencing method is mainly limited to purified synthetic short RNA, as it cannot read a complete sequence solely based on one single ladder, but must rely on two co-existing adjacent ladders (5´- and 3´-ladders)⁸. More specifically, this approach requires bidirectional paired-end reads for reading terminal nucleobases in the low-mass region⁸. The added complexity of the paired-end reading results in this method being untenable for sequencing of RNA mixtures because confusion is raised on which ladder fragment belongs to which ladder for the unknown samples.

To overcome the abovementioned barriers in MS-based RNA sequencing approaches and to broaden such applications in direct RNA sequencing, two issues must be addressed: 1) how to generate a high-quality mass ladder that can be used to read a complete sequence, from the first nucleotide to the last in an RNA strand, and 2) how to effectively identify each RNA/mass ladder in a complex MS dataset. Together with well-controlled acid degradation, we have developed a new sequencing method by introducing a hydrophobic end labeling strategy (HELS) into the MS-based sequencing technique, and successfully addressed these two issues by adding a hydrophobic tag at either 5´- and/or 3´-end of the RNAs to be sequenced⁹. This method creates an “ideal” sequence ladder from RNA—each ladder fragment derives from site-specific RNA cleavage exclusively at each phosphodiester bond, and the mass difference between two adjacent ladder fragments is the exact mass of either the nucleotide or nucleotide modification at that position ⁸^,⁹^,¹⁰. This is possible because we include a highly controlled acidic hydrolysis step, which fragments the RNA, on average, once per molecule, before it is injected into the instrument. As a result, each degradation fragment product is detected on the mass spectrometer and all fragments together form a sequencing ladder⁸^,⁹^,¹⁰. This new strategy enables complete reading of an RNA sequence from one single ladder of an RNA strand without paired-end reading from the other ladder of the RNA, and additionally allows MS sequencing of RNA mixtures with multiple different strands that contain combinatorial nucleotide modifications⁹. By adding a tag at the 5´- and/or 3´-end of the RNA, the labeled ladder fragments display a significant delay of t_R, which can help to distinguish the two mass ladders from each other and also from the noisy low-mass region. The mass-t_R shift caused by adding the hydrophobic tag facilitates mass ladder identification and simplifies data analysis for sequence generation. Furthermore, the addition of the hydrophobic tag can help to identify the terminal base in the strand by preventing its corresponding ladder fragment from being in the noisy low-mass-t_R region due to the mass and hydrophobicity increase caused by the tag, thus allowing identification of the complete sequence of an RNA from a single ladder; no paired-end reads are required. As a result, we have previously demonstrated the successful sequencing of a complex mixture of up to 12 RNA distinct strands without the use of any advanced sequencing algorithm⁹, which opens the door for de novo MS sequencing of RNA containing both canonical and modified nucleotides and makes it more feasible for the sequencing of mixed and more complex RNA samples. In fact, using 2D-HELS MS Seq, we have even successfully sequenced a mixed population of tRNA samples¹⁰ and are actively expanding its application to other complex RNA samples.

To facilitate 2D-HELS MS Seq to directly sequence a broader range of RNA samples, here we will focus on the technical aspects of this sequencing approach and will cover all of the essential steps needed when applying the technique towards direct sequencing of RNA samples. Specific examples will be used to illustrate the sequencing technique, including synthetic single RNA sequences, mixtures of multiple distinct RNA sequences, and modified RNAs containing both canonical and modified nucleotides such as pseudouridine (ψ) and 5-methylcytosine (m⁵C). Since RNAs all contain phosphodiester bonds, any type of RNA can be acid-hydrolyzed to generate an ideal sequence ladder for 2D-HELS MS Seq under optimal conditions⁸^,⁹. However, detection of all ladder fragments of a given RNA is instrument dependent. On a standard high-resolution LC-MS (40K), the minimal loading amount for sequencing a purified short RNA sample (<35 nt) is 100 pmol per run. However, more material is required (up to 400 pmol per RNA sample) when additional experiments must be conducted (e.g., to distinguish isomeric base modifications that share identical masses). The protocol used in sequencing the model synthetic modified RNAs will also be applicable to sequencing broader RNA samples, including biological RNA samples with unknown base modifications. However, an even larger sample amount, such as 1000 pmol for sequencing tRNA (~76 nt) using a standard LC-MS instrument, is required to sequence the complete tRNA with all the modifications, and an advanced algorithm must be developed for its de novo sequencing¹⁰.

Protocol

1. Design RNA oligonucleotides

Design synthetic RNA oligonucleotides with different lengths (19 nt, 20 nt and 21 nt), including one (RNA #6) with both canonical and modified nucleotides. ψ is employed as a model for non-mass-altering modifications, which is challenging for MS sequencing because it has an identical mass to U. m⁵C is chosen as a model for mass-altering modifications to demonstrate the robustness of the approach.

RNA #1: 5´-HO-CGCAUCUGACUGACCAAAA-OH-3´
RNA #2: 5´-HO-AUAGCCCAGUCAGUCUACGC-OH-3´
RNA #3: 5´-HO-AAACCGUUACCAUUACUGAG-OH-3´
RNA #4: 5´-HO-GCGUACAUCUUCCCCUUUAU-OH-3´
RNA #5: 5´-HO-GCGGAUUUAGCUCAGUUGGGA-OH-3´
RNA #6: 5´-HO-AAACCGUψACCAUUAm⁵CUGAG-OH-3´
Dissolve each synthetic RNA in nuclease-free diethyl pyrocarbonate (DEPC)-treated water (expressed as DEPC-treated H₂O unless otherwise indicated) to obtain a 100 mM RNA stock solution. Stock solutions are stored long-term at -20 °C.
To avoid possible RNA sample degradation, use RNase-free experimental supplies including DEPC-treated water, microcentrifuge tubes, and pipette tips. Frequently wipe down surfaces of lab supplies using RNase elimination wipes.

2. Label the 3´-end of RNAs with biotin

Two-step reaction protocol (adenylation and ligation)
1. Add 1 µL of 10x adenylation reaction buffer containing 50 mM sodium acetate, pH 6.0, 10 mM MgCl₂, 5 mM dichlorodiphenyltrichloroethane (DTT), 0.1 mM ethylenediaminetetraacetic acid (EDTA), 1 µL of 1 mM ATP, 1 µL of 100 µM biotinylated cytidine bisphosphate (pCp-biotin), 1 µL of 50 µM Mth RNA ligase, and 6 µL of DEPC-treated H₂O (a total volume of 10 µL) into an RNase-free thin-walled 0.2 mL PCR tube.
  NOTE: Store the reagents at -20 °C before the two-step reaction. Thaw the reagents at room temperature and mix well by vortexing and centrifuging before adding to the reaction.
2. Incubate the reaction in a PCR machine at 65 °C for 1 h and inactivate the reaction at 85 °C for 5 min.
3. Conduct the ligation step in an RNase-free, thin walled 0.2 mL PCR tube containing 10 µL of reaction solution from the previous step by adding 3 µL of 10x T4 RNA ligase reaction buffer containing 50 mM tris(hydroxymethyl)aminomethane (Tris)-HCl, pH 7.8, 10 mM MgCl₂, 1 mM DTT, 1.5 µL of the 100 mM sample stock of the RNA to be sequenced, 3 µL of anhydrous dimethyl sulfoxide (DMSO) to reach 10% (v/v), 1 µL of T4 RNA ligase (10 units/µL), and 11.5 µL of DEPC-treated H₂O (for a total volume of 30 mL). Incubate the reaction overnight at 16 °C in a PCR machine.
  NOTE: Combine reaction components at room temperature due to the high freezing point of DMSO (18.45 °C).
4. Incubate the reaction overnight at 16 °C.
5. Quench and purify the reaction by column purification to remove enzymes and free pCp-biotin using Oligo Clean & Concentrator (Zymo Research, Irvine, CA, USA). Oligo Binding Buffer, DNA Wash Buffer, spin columns and collection tubes are provided in the kit. Add 20 mL of DEPC-treated H₂O to the reaction solution to reach a 50 mL sample volume prior to adding the Binding Buffer.
6. Add 100 mL of binding buffer to each reaction solution. Add 400 µL of ethanol, mix by pipetting, and transfer the mixture to the column. Centrifuge at 10,000 x g for 30 s. Discard the flow-through.
7. Add 750 µL of DNA Wash Buffer to the column. Centrifuge at 10,000 x g and maximum speed for 30 s and 1 minute, respectively.
8. Transfer the column to a 1.5 mL microcentrifuge tube. Add 15 µL of DEPC-treated H₂O to the column and centrifuge at 10,000 x g for 30 s to elute the RNA product.
  NOTE: Samples can be stored at -20 °C at this stage until the next step is performed.
One-step reaction protocol
1. Perform a one-step labeling reaction by combining 2 µL of 150 μM adenosine-5´-5´-diphosphate-{5´-(cytidine-2´-O-methyl-3´-phosphate-TEG}C-biotin (AppCp-biotin), 3 µL of 10x ligase reaction buffer, 1.5 µL of the 100 mM sample stock of the RNA to be sequenced, 3 µL of anhydrous DMSO to reach 10% (v/v), 1 µL of T4 RNA ligase (10 units/µL), and 19.5 µL of DEPC-treated H₂O (for a total volume of 30 mL) in a 1.5 mL RNase-free microcentrifuge tube.
2. Incubate the reaction overnight at 16 °C in a PCR machine.
3. Perform column purification as described above in steps 2.1.5-2.1.8.
  NOTE: Prepare a separate/exclusive reaction tube for each RNA sample (150 pmol scale of RNA). Labeling of the 5´-end of the RNA(s) with sulfo-Cyanine3 (Cy3) or Cy3 may be needed (e.g., for bidirectional sequencing verification). The method is different than that of 3´-biotinylation and is described in a previous publication⁹.

3. Capture biotinylated RNA sample on streptavidin beads

Activate 200 µL of streptavidin C1 magnet beads by adding 200 µL of 1x B&W buffer (5 mM Tris-HCl, pH 7.5, 0.5 mM EDTA, 1 M NaCl) in a 1.5 mL RNase-free microcentrifuge tube. Vortex this solution and place it on a magnet stand for 2 min. Then discard the supernatant by carefully pipetting out the solution.
Wash the beads twice with 200 µL of Solution A (DEPC-treated 0.1 M NaOH and DEPC-treated 0.05 M NaCl) and once in 200 µL of Solution B (DEPC-treated 0.1 M NaCl). For each wash step, vortex the solution and place it on a magnet stand for 2 min, followed by discarding of the supernatant. Then add 100 µL of 2x B&W buffer (10 mM Tris-HCl, pH 7.5, 1 mM EDTA, 2 M NaCl).
Add 1x B&W buffer to the biotinylated RNA sample until the volume is 100 µL. Then add this solution to the washed beads stored in 100 µL of 2x B&W buffer. Incubate for 30 min at room temperature on a rocking platform shaker at 100 rpm. Place the tube on a magnet stand for 2 min and discard the supernatant.
Wash the coated beads 3 times in 1x B&W buffer and measure the final concentration of supernatant in each wash step by Nanodrop for recovery analysis, to confirm that the target RNA molecules remain on the beads.
Incubate the beads in 10 mM EDTA, pH 8.2 with 95% formamide at 65 °C for 5 min in a PCR machine. Keep the tube on the magnet stand for 2 min and collect the supernatant (containing the biotinylated RNAs released from the streptavidin beads) by pipet.
NOTE: This physical separation step prior to acid degradation is only used for sequencing of RNA#1 in Figure 1c, and is not mandatory for the 2D-HELS MS Seq since the hydrophobic biotin label can cause the 3´-labeled ladder fragments to have a significantly delayed t_R during LC-MS measurement, which can clearly distinguish the labeled 3´-ladder fragments from the unlabeled 5´-ladder fragments in the 2D mass-t_R plot.

4. Acid hydrolysis of RNA to generate MS ladders for sequencing

Divide each RNA sample into three equal aliquots. For instance, divide an RNA sample with a volume of 15 µL RNA sample into three aliquots of 5 µL.
Add an equal volume of formic acid to achieve 50% (v/v) formic acid in the reaction mixture⁸^,⁹.
Incubate the reaction at 40 °C in a PCR machine, with one reaction running for 2 min, one for 5 min, and one for 15 min, respectively.
Quench the acid degradation by immediately freezing the sample on dry ice after each reaction finishes.
Use a centrifugal vacuum concentrator to dry the sample. The sample is typically completely dried within 30 min, and formic acid is removed together with H₂O during the drying process because formic acid has a boiling point (100.8 °C) similar to that of H₂O (100 °C).
Suspend and combine a total of three dried samples in 20 µL of DEPC-treated H₂O for LC-MS measurement.
NOTE: Samples can be stored at -20 °C at this stage while waiting for LC-MS measurement.

5. Convert ψ to CMC-ψ adduct

Add 80 µL of DEPC-treated H₂O into a 1.5 mL RNase-free microcentrifuge tube containing 0.0141 g of N-cyclohexyl-Nʹ-(2-morpholinoethyl)-carbodiimide metho-p-toluenesulfonate (CMC) and 0.07 g of urea. Add 10 µL of the 100 µM sample stock of the RNA to be sequenced, 8 µL of 1 M bicine buffer (pH 8.3), and 1.28 µL of 0.5 M EDTA. Add DEPC-treated H₂O to reach a total volume of 160 µL. Final concentrations are 0.17 M CMC, 7 M urea, and 4 mM EDTA in 50 mM bicine (pH 8.3)¹¹.
NOTE: This protocol is applicable to either a single synthetic RNA sequence or RNA mixtures.
Divide the 160 µL reaction solution into four equal aliquots in RNase-free, thin walled 0.2 mL PCR tubes and incubate at 37 °C for 20 min in a PCR machine.
NOTE: 50 µL per tube is the maximum reaction volume that can be used in a PCR machine.
Quench each reaction with 10 µL of 1.5 M sodium acetate and 0.5 mM EDTA (pH 5.6).
Perform column purification with four parallel spin columns to remove excessive reactants according to the procedure as described in steps 2.1.5-2.1.8. Dissolve the purified product in 15 µL of DEPC-treated H₂O in each 1.5 mL RNase-free microcentrifuge tube.
Transfer the purified product to four RNase-free, thin walled 0.2 mL PCR tubes. Add 20 µL of 0.1 M Na₂CO₃ buffer (pH 10.4) into each 15 µL of purified product and add DEPC-treated H₂O to make a final volume of 40 µL for each reaction tube (in total four tubes). Incubate the reaction at 37 °C for 2 h in a PCR machine.
Quench and purify the reaction by column purification with four parallel spin columns as described in step 2.1.5. Elute the CMC-ψ converted product to a 1.5 mL RNase-free microcentrifuge tube each with 15 µL of DEPC-treated H₂O.
Combine the purified CMC-ψ converted sample from four collection tubes into one tube. Perform formic acid degradation 50% (v/v) according to the procedures as described in steps 4.1-4.6 to generate MS ladders for sequencing.

6. LC-MS measurement

Prepare mobile phases for LC-MS measurement. Mobile phase A is 25 mM hexafluoro-2-propanol with 10 mM diisopropylamine in LC-MS grade water; mobile phase B is methanol.
Transfer the sample to LC-MS sample vial for analysis. Each sample injection volume is 20 µL containing 100-400 pmol of RNA.
Use the following LC conditions: column temperature of 35 °C, flow rate of 0.3 mL/min; a linear gradient from 2–20% mobile phase B over 15 min followed by a 2 min wash step with 90% mobile phase B.
NOTE: For more hydrophobic end-labels such as Cy3 and sulfo-Cy3 as mentioned in Section 2, a higher percentage of organic solvent may be necessary for sample elution (i.e., a similar gradient can be used but with an increased percentage range of mobile phase B). For instance, 2–38% mobile phase B over 30 min with a 2 min wash step with 90% mobile phase B.
Separate and analyze samples on an Agilent Q-TOF (Quadrupole Time-of-Flight) mass spectrometer coupled to an LC system equipped with an autosampler and an MS HPLC (High Performance Liquid Chromatography) system. The LC column is a 50 mm x 2.1 mm C18 column with a particle size of 1.7 μm. Use the following MS settings: negative ion mode; range, 350 m/z to 3200 m/z; scan rate, 2 spectra/s; drying gas flow, 17 L/min; drying gas temperature, 250 °C; nebulizer pressure, 30 psig; capillary voltage, 3500 V; and fragmentor voltage, 365 V. Please note that these parameters are specific to the type or model of mass spectrometer being used.
Acquire data with Agilent MassHunter acquisition software. Use Agilent molecular feature extraction (MFE) workflow to extract compound information including mass, retention time, volume (the MFE abundance for the respective ion species), and quality score, etc. Use the following MFE settings: “centroid data format, small molecules (chromatographic), peak with height ≥ 100, up to a maximum of 1000, quality score ≥ 50”.
NOTE: Optimize MFE settings to extract as many potential compounds as possible, up to a maximum of 1000, with quality scores of ≥ 50.

7. Automate RNA sequence generation by a computational algorithm

NOTE: This procedure is shown only for RNA #1 in Figure 1c.

Sort MFE extracted compounds in order of decreasing volume (peak intensity) and t_R. Perform data pre-selection via 1) setting t_R from 4 to 10 min to select the RNA fragments labeled by the biotin, since the t_Rs of the biotin-labeled mass ladder components are shifted to this t_R window (4 min to 10 min), and 2) using an order-of-magnitude higher of input compounds than the number of ladder fragments for algorithm computation to reduce data amount based on volume. For instance, for a 20 nt RNA, 20 labeled mass-t_R ladder components will be required for sequencing of the 20 nt RNA, and thus, 200 compounds from MFE data file will be selected based on volume. Please note that the t_R window may be different when a different type or model of mass spectrometer is used.
Perform data processing and sequence generation of RNA #1 using a revised version of a published algorithm⁸. The source codes of the revised algorithm are described previously (https://academic.oup.com/nar/article/47/20/e125/5558343#supplementary-data)⁹.
In addition to automating sequence generation using the algorithm, manually calculate the mass differences between two adjacent ladder components for base calling. All bases in the RNA can be called manually and matched with the theoretical ones in the RNA nucleotide and modification database⁸; thus, the complete sequence of the RNA strand can be accurately read out manually, which is used to confirm the accuracy of the algorithm-reported sequence read. More structures of RNA modifications can be found in RNA modification databases¹², and their corresponding theoretical masses are obtained by ChemBioDraw. In Tables S1–S2, the ppm (parts-per-million) mass difference is shown when comparing the observed mass to its theoretical mass for a specific ladder component, and a value less than 10 ppm is considered a good match for each base calling.

8. Sequencing RNA mixtures

Label a mixture of five RNA strands (RNA #1 to #5) at their 3´-ends with A(5´)pp(5´)Cp-TEG-biotin using a one-step protocol described in step 2.2. In a total volume of 150 µL reaction solution, add 15 µL of 10x T4 RNA ligase reaction buffer, 1.5 µL of each RNA strand (100 µM stock of RNA #1 to #5, respectively, for a total volume of 7.5 µL), 10 µL of 150 µM A(5´)pp(5´)Cp-TEG-biotin, 15 µL of anhydrous DMSO, 5 µL of T4 RNA ligase (10 units/µL), and 97.5 µL of DEPC-treated H₂O. Equally distribute the reaction solution into five aliquots. Each RNase-free microcentrifuge tube contains 30 µL of reaction solution.
Incubate the reaction overnight at 16 °C in a PCR machine.
Perform column purification according to the procedure as described in steps 2.1.5-2.1.8 with five parallel spin columns. Elute a mixture sample of 3´-biotinylated 5 RNA strands (mixture of RNA #1 to #5) to a 1.5 mL RNase-free microcentrifuge tube each with 15 µL of DEPC-treated H₂O.
Combine the purified mixture samples from the five collection tubes into one tube. Perform formic acid degradation according to the procedure described in Section 4.
Measure samples by LC-MS as described in Section 6, and analyze the data using the data analysis software with optimized MFE settings to extract data containing mass, t_R, and volume as described in step 6.5. The typical processing and base-calling algorithm is not applied due to the significantly increased data complexity resulting from the mixture. All bases in the RNA of the mixed sample are called manually in a method similar to Section 7.3 and match well with the theoretical ones in the RNA nucleotide and modification database⁸, thus the complete sequences of all five RNA strands in the mixed sample are accurately read out. In Tables S7–S11, all information is listed including observed mass, t_R, volume, quality score and ppm mass difference.

Results

Introducing a biotin tag to the 3´-end of RNA to produce easily-identifiable mass-t_R ladders. The workflow of the 2D-HELS MS Seq approach is demonstrated in Figure 1a. The hydrophobic biotin label introduced to the 3´-end of the RNA (see Section 2) increases the masses and t_Rs of the 3´-labeled ladder components when compared to those of their unlabeled counterparts. Thus, the 3´-ladder curve is shifted to greater y-axis values (due ...

Discussion

Unlike tandem-based MS fragmentation, highly controlled acidic hydrolysis is used in the 2D-HELS MS Seq approach to fragment the RNA before analysis with a mass spectrometer⁹^,¹⁰. As a result, each acid-degraded fragment can be detected by the instrument, forming the equivalent of a sequencing ladder. Under optimal conditions, this method creates an “ideal” sequence ladder from RNA via, on average, one-per-molecule site-specific RNA cleavage e...

Disclosures

The authors have filed a provisional patent related to the technology discussed in this manuscript.

Acknowledgements

The authors acknowledge the R21 grant from National Institutes of Health (1R21HG009576) to S. Z. and W. L. and New York Institute of Technology (NYIT) Institutional Support for Research and Creativity grants to S. Z., which supported this work. The authors would like to thank PhD student Xuanting Wang (Columbia University) for assisting in figure-making, and thank Prof. Michael Hadjiargyrou (NYIT), Prof. Jingyue Ju (Columbia University), Drs. James Russo, Shiv Kumar, Xiaoxu Li, Steffen Jockusch, and other members of the Ju lab (Columbia University), Dr. Yongdong Wang (Cerno Bioscience), Meina Aziz (NYIT), and Wenhao Ni (NYIT) for helpful discussions and suggestions for our manuscript.

Materials

Name	Company	Catalog Number	Comments
5' DNA Adenylation kit	New England Biolabs	E2610S	50uM concentration
6550 Q-TOF mass spectrometer	Agilent Technologies	5991-2116EN	Coupled to a 1290 Infinity LC system
A(5´)pp(5´)Cp-TEG-biotin-3´	ChemGenes	91718	HPLC purified
ATPγS	Sigma-Aldrich	11162306001	Lithium salt
Bicine	Sigma-Aldrich	B8660	BioXtra, ≥99% (titration)
Biotin maleimide	Vector Laboratories	SP-1501	Long arm
C18 column	Waters	186003532	50 mm × 2.1 mm Xbridge C18 column with a particle size of 1.7 μm
Centrifugal Vacuum Concentrator	Labconco	Refrig 115v/60hz 7310022	Labconco CentriVap
ChemBioDraw	PerkinElmer	ChemDraw Prime	Generate a chemical structure and property data of structures & fragments
CMC (N-cyclohexyl-Nʹ-(2-morpholinoethyl)-carbodiimide metho-p-toluenesulfonate)	Sigma-Aldrich	2491-17-0	95% Purifiy
Cyanine3 maleimide (Cy3)	Lumiprobe	11080	Water insoluble
DEPC-treated water	Thermo Fisher Scientific	AM9906	Autoclaved, certified nuclease-free
Diisopropylamine (DIPA)	Thermo Fisher Scientific	108-18-9	99% Alfa Aesar
DMSO	Sigma-Aldrich	276855	Anhydrous dimethyl sulfoxide, 99.9%
EDTA	Sigma-Aldrich	E6758	Anhydrous, crystalline, BioReagent, suitable for cell culture
Formic acid	Merck	64-18-6	98-100%, ACS reag, Ph Eur
Hexafluoro-2-propanol (HFIP)	Thermo Fisher Scientific	920-66-1	99% Acros Organics
LC-MS sample vials	Thermo Fisher Scientific	C4000-11	Plastic screw thread vials
LC-MS vial caps	Thermo Fisher Scientific	C5000-54A	Autosampler vial screw thread caps
Na₂CO₃ buffer	Sigma-Aldrich	88975	BioUltra, >0.1 M Na₂CO₃, >0.2 M NaHCO₃
Oligo Clean & Concentrator	Zymo Research	D4060	Spin column
OriginLab	OriginLab	OriginPro	Data analysis and graphing software
pCp-biotin	TriLink BioTechnologies	NU-1706-BIO	20 ul (1 mM)
RNA #1--#6	Integrated DNA Technologies	Custom RNA oligos	19nt-21nt single-stranded RNAs, used without further purification
Rocking platform shaker	VWR	Orbital Shaker Standard 1000	Speed Range 40 to 300 rpm
Streptavidin magnetic beads	Thermo Fisher Scientific	88816	Binding approx. 55ug biotinylated rabbit lgG per mg of beads
Sulfonated Cyanine3 maleimide	Lumiprobe	11380	Water soluble
T4 DNA ligase 1	New England Biolabs	M0202S	400 units/uL
T4 polynucleotide kinase	Sigma-Aldrich	T4PNK-RO	From phage T4 am N81 pse T1 infected Escherichia coli BB
Tris-HCl buffer	Sigma-Aldrich	T6455	Tris-HCl Buffer, pH 10, 10×, Antigen Retriever
Urea	Sigma-Aldrich	81871	Urea for synthesis. CAS No. 57-13-6, EC Number 200-315-5.

References

Addepalli, B., Venus, S., Thakur, P., Limbach, P. A. Novel ribonuclease activity of cusativin from Cucumis sativus for mapping nucleoside modifications in RNA. Analytical and Bioanalytical Chemistry. 409 (24), 5645-5654 (2017).
Gao, H., Liu, Y., Rumley, M., Yuan, H., Mao, B. Sequence confirmation of chemically modified RNAs using exonuclease digestion and matrix-assisted laser desorption/ionization time-of-flight mass spectrometry. Rapid Communications in Mass Spectrometry. 23 (21), 3423-3430 (2009).
McLuckey, S. A., Van Berkel, G. J., Glish, G. L. Tandem mass spectrometry of small, multiply charged oligonucleotides. Journal of The American Society for Mass Spectrometry. 3 (1), 60-70 (1992).
Fountain, K. J., Gilar, M., Gebler, J. C. Analysis of native and chemically modified oligonucleotides by tandem ion-pair reversed-phase high-performance liquid chromatography/electrospray ionization mass spectrometry. Rapid Communications in Mass Spectrometry. 17 (7), 646-653 (2003).
Taucher, M., Breuker, K. Characterization of modified RNA by top-down mass spectrometry. Angewandte Chemie International Edition in English. 51 (45), 11289-11292 (2012).
Kellner, S., Burhenne, J., Helm, M. Detection of RNA modifications. RNA Biology. 7 (2), 237-247 (2010).
Thomas, B., Akoulitchev, A. V. Mass spectrometry of RNA. Trends in Biochemical Sciences. 31 (3), 173-181 (2006).
Bjorkbom, A., et al. Bidirectional direct sequencing of noncanonical RNA by two-dimensional analysis of mass chromatograms. Journal of the American Chemical Society. 137 (45), 14430-14438 (2015).
Zhang, N., et al. A general LC-MS-based RNA sequencing method for direct analysis of multiple-base modifications in RNA mixtures. Nucleic Acids Research. 47 (20), 125 (2019).
Zhang, N., et al. Direct sequencing of tRNA by 2D-HELS-AA MS Seq reveals its different isoforms and dynamic base modifications. ACS Chemical Biology. 15 (6), 1464-1472 (2020).
Bakin, A., Ofengand, J. Four newly located pseudouridylate residues in Escherichia coli 23S ribosomal RNA are all at the peptidyltransferase center: analysis by the application of a new sequencing technique. Biochemistry. 32 (37), 9754-9762 (1993).
Cantara, W. A., et al. The RNA Modification Database, RNAMDB: 2011 update. Nucleic Acids Research. 39 (Database issue), D195-D201 (2011).

Reprints and Permissions

Request permission to reuse the text or figures of this JoVE article

Request Permission

Explore More Articles

2D HELS MS Seq RNA Sequencing Mass Spectrometry De Novo Sequencing Modified Nucleotides Pseudouridine 5 methylcytosine LC MS Therapeutic RNA Oligonucleotides

This article has been published

Video Coming Soon

Keep me updated:

Method Article