Causal models and prediction in cell line perturbation experiments

Abstract In cell line perturbation experiments, a collection of cells is perturbed with external agents and responses such as protein expression measured. Due to cost constraints, only a small fraction of all possible perturbations can be tested in vitro. This has led to the development of computati...

Full description

Saved in:
Bibliographic Details
Main Authors: James P. Long, Yumeng Yang, Shohei Shimizu, Thong Pham, Kim-Anh Do
Format: Article
Language:English
Published: BMC 2025-01-01
Series:BMC Bioinformatics
Subjects:
Online Access:https://doi.org/10.1186/s12859-024-06027-7
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841544236773670912
author James P. Long
Yumeng Yang
Shohei Shimizu
Thong Pham
Kim-Anh Do
author_facet James P. Long
Yumeng Yang
Shohei Shimizu
Thong Pham
Kim-Anh Do
author_sort James P. Long
collection DOAJ
description Abstract In cell line perturbation experiments, a collection of cells is perturbed with external agents and responses such as protein expression measured. Due to cost constraints, only a small fraction of all possible perturbations can be tested in vitro. This has led to the development of computational models that can predict cellular responses to perturbations in silico. A central challenge for these models is to predict the effect of new, previously untested perturbations that were not used in the training data. Here we propose causal structural equations for modeling how perturbations effect cells. From this model, we derive two estimators for predicting responses: a Linear Regression (LR) estimator and a causal structure learning estimator that we term Causal Structure Regression (CSR). The CSR estimator requires more assumptions than LR, but can predict the effects of drugs that were not applied in the training data. Next we present Cellbox, a recently proposed system of ordinary differential equations (ODEs) based model that obtained the best prediction performance on a Melanoma cell line perturbation data set (Yuan et al. in Cell Syst 12:128–140, 2021). We derive analytic results that show a close connection between CSR and Cellbox, providing a new causal interpretation for the Cellbox model. We compare LR and CSR/Cellbox in simulations, highlighting the strengths and weaknesses of the two approaches. Finally we compare the performance of LR and CSR/Cellbox on the benchmark Melanoma data set. We find that the LR model has comparable or slightly better performance than Cellbox.
format Article
id doaj-art-758c85ba91f546da84091b731152fcce
institution Kabale University
issn 1471-2105
language English
publishDate 2025-01-01
publisher BMC
record_format Article
series BMC Bioinformatics
spelling doaj-art-758c85ba91f546da84091b731152fcce2025-01-12T12:41:55ZengBMCBMC Bioinformatics1471-21052025-01-0126111710.1186/s12859-024-06027-7Causal models and prediction in cell line perturbation experimentsJames P. Long0Yumeng Yang1Shohei Shimizu2Thong Pham3Kim-Anh Do4Department of Biostatistics, The University of Texas MD Anderson Cancer CenterBiomedical Informatics, The University of Texas Health Science Center at HoustonFaculty of Data Science, Shiga UniversityFaculty of Data Science, Shiga UniversityDepartment of Biostatistics, The University of Texas MD Anderson Cancer CenterAbstract In cell line perturbation experiments, a collection of cells is perturbed with external agents and responses such as protein expression measured. Due to cost constraints, only a small fraction of all possible perturbations can be tested in vitro. This has led to the development of computational models that can predict cellular responses to perturbations in silico. A central challenge for these models is to predict the effect of new, previously untested perturbations that were not used in the training data. Here we propose causal structural equations for modeling how perturbations effect cells. From this model, we derive two estimators for predicting responses: a Linear Regression (LR) estimator and a causal structure learning estimator that we term Causal Structure Regression (CSR). The CSR estimator requires more assumptions than LR, but can predict the effects of drugs that were not applied in the training data. Next we present Cellbox, a recently proposed system of ordinary differential equations (ODEs) based model that obtained the best prediction performance on a Melanoma cell line perturbation data set (Yuan et al. in Cell Syst 12:128–140, 2021). We derive analytic results that show a close connection between CSR and Cellbox, providing a new causal interpretation for the Cellbox model. We compare LR and CSR/Cellbox in simulations, highlighting the strengths and weaknesses of the two approaches. Finally we compare the performance of LR and CSR/Cellbox on the benchmark Melanoma data set. We find that the LR model has comparable or slightly better performance than Cellbox.https://doi.org/10.1186/s12859-024-06027-7Causal inferencePredictionPerturbation biologySystems biology
spellingShingle James P. Long
Yumeng Yang
Shohei Shimizu
Thong Pham
Kim-Anh Do
Causal models and prediction in cell line perturbation experiments
BMC Bioinformatics
Causal inference
Prediction
Perturbation biology
Systems biology
title Causal models and prediction in cell line perturbation experiments
title_full Causal models and prediction in cell line perturbation experiments
title_fullStr Causal models and prediction in cell line perturbation experiments
title_full_unstemmed Causal models and prediction in cell line perturbation experiments
title_short Causal models and prediction in cell line perturbation experiments
title_sort causal models and prediction in cell line perturbation experiments
topic Causal inference
Prediction
Perturbation biology
Systems biology
url https://doi.org/10.1186/s12859-024-06027-7
work_keys_str_mv AT jamesplong causalmodelsandpredictionincelllineperturbationexperiments
AT yumengyang causalmodelsandpredictionincelllineperturbationexperiments
AT shoheishimizu causalmodelsandpredictionincelllineperturbationexperiments
AT thongpham causalmodelsandpredictionincelllineperturbationexperiments
AT kimanhdo causalmodelsandpredictionincelllineperturbationexperiments