Skip to main content
eScholarship
Open Access Publications from the University of California

UC Santa Cruz

UC Santa Cruz Electronic Theses and Dissertations bannerUC Santa Cruz

Modification Detection using Nanopore Sequencing

Creative Commons 'BY-SA' version 4.0 license
Abstract

Both DNA and RNA modifications play critical roles in cell regulation. Traditionally, a chemical selection process alters base pairing or sequencing coverage is used to sequence modified nucleotides. Therefore, a new chemical labeling process needs to be created for each modification. Currently, we do not have methods for sequencing the majority of the over 150 RNA and over 40 DNA modifications. However, with nanopore sequencing, we can directly detect modifications on native DNA or RNA reads without any selection or chemical labeling techniques. Nanopore sequencing measures the change in current across a nanopore as a polynucleotide threads through the nanopore and we can use this signal to identify modifications.

In chapter 1, we present a framework for the unsupervised determination of the number of nucleotide modifications from nanopore sequencing readouts. We demonstrate the approach can effectively recapitulate the number of modifications, the corresponding ionic current signal levels, as well as mixing proportions under both DNA and RNA contexts. We further show, by integrating information from multiple detected modification regions, that the modification status of DNA and RNA molecules can be inferred. This method forms a key step of de novo characterization of nucleotide modifications.

In chapter 2, we present a graph convolutional network-based deep learning framework for predicting the mean of kmer distributions from corresponding chemical structures. We show such a framework can generalize the chemical information of the 5-methyl group from thymine to cytosine by correctly predicting 5-methylcytosine-containing DNA 6mers.

In chapter 3, using a combination of yeast genetics and nanopore direct RNA sequencing, we have developed a reliable method to track the modification status of single rRNA molecules at 37 sites in 18S rRNA and 73 sites in 25S rRNA. We use our method to characterize patterns of modification heterogeneity and identify concerted modification of nucleotides found near functional centers of the ribosome. Distinct undermodified subpopulations of rRNAs accumulate when ribosome biogenesis is compromised by loss of Dbp3 or Prp43-related RNA helicase function. Modification profiles are surprisingly resistant to change in response to many genetic and environmental conditions that affect translation, ribosome biogenesis, and pre-mRNA splicing. The ability to capture complete modification profiles for RNAs at single-molecule resolution will provide new insights into the roles of nucleotide modifications in RNA function.

Main Content
For improved accessibility of PDF content, download the file to your device.
Current View