Skip to main content
eScholarship
Open Access Publications from the University of California

Department of Statistics, UCLA

Open Access Policy Deposits bannerUCLA

This series is automatically populated with publications deposited by UCLA Department of Statistics researchers in accordance with the University of California’s open access policies. For more information see Open Access Policy Deposits and the UC Publication Management System.

scCDC: a computational method for gene-specific contamination detection and correction in single-cell and single-nucleus RNA-seq data

(2024)

In droplet-based single-cell and single-nucleus RNA-seq assays, systematic contamination of ambient RNA molecules biases the quantification of gene expression levels. Existing methods correct the contamination for all genes globally. However, there lacks specific evaluation of correction efficacy for varying contamination levels. Here, we show that DecontX and CellBender under-correct highly contaminating genes, while SoupX and scAR over-correct lowly/non-contaminating genes. Here, we develop scCDC as the first method to detect the contamination-causing genes and only correct expression levels of these genes, some of which are cell-type markers. Compared with existing decontamination methods, scCDC excels in decontaminating highly contaminating genes while avoiding over-correction of other genes.

Cover page of Estimating Asymptomatic and Symptomatic Transmission of the COVID-19 First Few Cases in Selenge Province, Mongolia.

Estimating Asymptomatic and Symptomatic Transmission of the COVID-19 First Few Cases in Selenge Province, Mongolia.

(2024)

BACKGROUND: Following the first locally transmitted case in Sukhbaatar soum, Selenge Province, we aimed to investigate the ultimate scale of the epidemic in the scenario of uninterrupted transmission. METHODS: This was a prospective case study following the locally modified WHO FFX cases generic protocol. A rapid response team collected data from November 14 to 29, 2020. We created a stochastic process to draw many transmission chains from this greater distribution to better understand and make inferences regarding the outbreak under investigation. RESULTS: The majority of the cases involved household transmissions (35, 52.2%), work transmissions (20, 29.9%), index (5, 7.5%), same apartment transmissions (2, 3.0%), school transmissions (2, 3.0%), and random contacts between individuals transmissions (1, 1.5%). The posterior means of the basic reproduction number of both the asymptomatic cases R 0 Asy $$ {R}_0^{Asy} $$ and the presymptomatic cases R 0 Pre $$ {R}_0^{Pre} $$ (1.35 [95% CrI 0.88-1.86] and 1.29 [95% CrI 0.67-2.10], respectively) were lower than that of the symptomatic cases (2.00 [95% Crl 1.38-2.76]). CONCLUSION: Our study highlights the heterogeneity of COVID-19 transmission across different symptom statuses and underscores the importance of early identification and isolation of symptomatic cases in disease control. Our approach, which combines detailed contact tracing data with advanced statistical methods, can be applied to other infectious diseases, facilitating a more nuanced understanding of disease transmission dynamics.

A genome-wide spectrum of tandem repeat expansions in 338,963 humans

(2024)

The Genome Aggregation Database (gnomAD), widely recognized as the gold-standard reference map of human genetic variation, has largely overlooked tandem repeat (TR) expansions, despite the fact that TRs constitute ∼6% of our genome and are linked to over 50 human diseases. Here, we introduce the TR-gnomAD (https://wlcb.oit.uci.edu/TRgnomAD), a biobank-scale reference of 0.86 million TRs derived from 338,963 whole-genome sequencing (WGS) samples of diverse ancestries (39.5% non-European samples). TR-gnomAD offers critical insights into ancestry-specific disease prevalence using disparities in TR unit number frequencies among ancestries. Moreover, TR-gnomAD is able to differentiate between common, presumably benign TR expansions, which are prevalent in TR-gnomAD, from those potentially pathogenic TR expansions, which are found more frequently in disease groups than within TR-gnomAD. Together, TR-gnomAD is an invaluable resource for researchers and physicians to interpret TR expansions in individuals with genetic diseases.

Cover page of Engineering a Programmed Death-Ligand 1-Targeting Monobody Via Directed Evolution for SynNotch-Gated Cell Therapy.

Engineering a Programmed Death-Ligand 1-Targeting Monobody Via Directed Evolution for SynNotch-Gated Cell Therapy.

(2024)

Programmed death-ligand 1 (PD-L1) is a promising target for cancer immunotherapy due to its ability to inhibit T cell activation; however, its expression on various noncancer cells may cause on-target off-tumor toxicity when designing PD-L1-targeting Chimeric Antigen Receptor (CAR) T cell therapies. Combining rational design and directed evolution of the human fibronectin-derived monobody scaffold, PDbody was engineered to bind to PD-L1 with a preference for a slightly lower pH, which is typical in the tumor microenvironment. PDbody was further utilized as a CAR to target the PD-L1-expressing triple negative MDA-MB-231 breast cancer cell line. To mitigate on-target off-tumor toxicity associated with targeting PD-L1, a Cluster of Differentiation 19 (CD19)-recognizing SynNotch IF THEN gate was integrated into the system. This CD19-SynNotch PDbody-CAR system was then expressed in primary human T cells to target CD19-expressing MDA-MB-231 cancer cells. These CD19-SynNotch PDbody-CAR T cells demonstrated both specificity and efficacy in vitro, accurately eradicating cancer targets in cytotoxicity assays. Moreover, in an in vivo bilateral murine tumor model, they exhibited the capability to effectively restrain tumor growth. Overall, CD19-SynNotch PDbody-CAR T cells represent a distinct development over previously published designs due to their increased efficacy, proliferative capability, and mitigation of off-tumor toxicity for solid tumor treatment.

Cover page of An Unexpected Decline in Spring Atmospheric Humidity in the Interior Southwestern United States and Implications for Forest Fires

An Unexpected Decline in Spring Atmospheric Humidity in the Interior Southwestern United States and Implications for Forest Fires

(2024)

Abstract: On seasonal time scales, vapor pressure deficit (VPD) is a known predictor of burned area in the southwestern United States (“the Southwest”). VPD increases with atmospheric warming due to the exponential relationship between temperature and saturation vapor pressure. Another control on VPD is specific humidity, such that increases in specific humidity can counteract temperature-driven increases in VPD. Unexpectedly, despite the increased capacity of a warmer atmosphere to hold water vapor, near-surface specific humidity decreased from 1970 to 2019 in much of the Southwest, particularly in spring, summer, and fall. Here, we identify declining near-surface humidity from 1970 to 2019 in the southwestern United States with both reanalysis and in situ station data. Focusing on the interior Southwest in the months preceding the summer forest fire season, we explain the decline in terms of changes in atmospheric circulation and moisture fluxes between the surface and the atmosphere. We find that an early spring decline in precipitation in the interior region induced a decline in soil moisture and evapotranspiration, drying the lower troposphere in summer. This prior season precipitation decline is in turn related to a trend toward a Northern Hemisphere stationary wave pattern. Finally, using fixed humidity scenarios and the observed exponential relationship between VPD and burned forest area, we estimate that with no increase in temperature at all, the humidity decline alone would still lead to nearly one-quarter of the observed VPD-induced increase in burned area over 1984–2019. Significance Statement: Burned forest area has increased significantly in the southwestern United States in recent decades, driven in part by an increase in atmospheric aridity [vapor pressure deficit (VPD)]. Increases in VPD can be caused by a combination of increasing temperature and decreasing specific humidity. As the atmosphere warms with climate change, its capacity to hold moisture increases. Despite this, there is a decrease in near-surface air humidity in the interior southwestern United States over 1970–2019, which during the summer is likely caused by a decline in early spring precipitation leading to limited soil moisture and evaporation in spring and summer. We estimate that this declining humidity alone, without an increase in temperature, would cause about one-quarter of the VPD-induced increase in burned forest area in this region over 1984–2019.

Cover page of Causal inference over stochastic networks

Causal inference over stochastic networks

(2024)

Abstract: Claiming causal inferences in network settings necessitates careful consideration of the often complex dependency between outcomes for actors. Of particular importance are treatment spillover or outcome interference effects. We consider causal inference when the actors are connected via an underlying network structure. Our key contribution is a model for causality when the underlying network is endogenous; where the ties between actors and the actor covariates are statistically dependent. We develop a joint model for the relational and covariate generating process that avoids restrictive separability and fixed network assumptions, as these rarely hold in realistic social settings. While our framework can be used with general models, we develop the highly expressive class of Exponential-family Random Network models (ERNM) of which Markov random fields and Exponential-family Random Graph models are special cases. We present potential outcome-based inference within a Bayesian framework and propose a modification to the exchange algorithm to allow for sampling from ERNM posteriors. We present results of a simulation study demonstrating the validity of the approach. Finally, we demonstrate the value of the framework in a case study of smoking in the context of adolescent friendship networks.