Skip to main content
eScholarship
Open Access Publications from the University of California

UCSF

UC San Francisco Previously Published Works bannerUCSF

Urn models for stochastic gene expression yield intuitive insights into the probability distributions of single-cell mRNA and protein counts

Published Web Location

https://doi.org/10.1088/1478-3975/aba50f
The data associated with this publication are in the supplemental files.
Abstract

Fitting the probability mass functions from analytical solutions of stochastic models of gene expression to the single-cell count distributions of mRNA and protein molecules can yield valuable insights into mechanisms underlying gene expression. Solutions of chemical master equations are available for various kinetic schemes but, even for the basic ON-OFF genetic switch, they take complex forms with generating functions given as hypergeometric functions. Interpretation of gene expression dynamics in terms of bursts is not consistent with the complete range of parameters for these functions. Physical insights into the probability mass functions are essential to ensure proper interpretations but are lacking for models considering genetic switches. To fill this gap, we develop urn models for stochastic gene expression. We sample RNA polymerases or ribosomes from a master urn, which represents the cytosol, and assign them to recipient urns of two or more colors, which represent time intervals in which no switching occurs. Colors of the recipient urns represent sub-systems of the promoter states, and the assignments to urns of a specific color represent gene expression. We use elementary principles of discrete probability theory to solve a range of kinetic models without feedback, including the Peccoud-Ycart model, the Shahrezaei-Swain model, and models with an arbitrary number of promoter states. In the last case, we obtain a novel result for the protein distribution. For activated genes, we show that transcriptional lapses, which are events of gene inactivation for short time intervals separated by long active intervals, quantify the transcriptional dynamics better than bursts. We show that the intuition gained from our urn models may also be useful in understanding existing solutions for models with feedback. We contrast our models with urn models for related distributions, discuss a generalization of the Delaporte distribution for single-cell data analysis, and highlight the limitations of our models.

Many UC-authored scholarly publications are freely available on this site because of the UC's open access policies. Let us know how this access is important for you.

Main Content
For improved accessibility of PDF content, download the file to your device.
Current View