Causal models and prediction in cell line perturbation experiments
Abstract In cell line perturbation experiments, a collection of cells is perturbed with external agents and responses such as protein expression measured. Due to cost constraints, only a small fraction of all possible perturbations can be tested in vitro. This has led to the development of computati...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2025-01-01
|
Series: | BMC Bioinformatics |
Subjects: | |
Online Access: | https://doi.org/10.1186/s12859-024-06027-7 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1841544236773670912 |
---|---|
author | James P. Long Yumeng Yang Shohei Shimizu Thong Pham Kim-Anh Do |
author_facet | James P. Long Yumeng Yang Shohei Shimizu Thong Pham Kim-Anh Do |
author_sort | James P. Long |
collection | DOAJ |
description | Abstract In cell line perturbation experiments, a collection of cells is perturbed with external agents and responses such as protein expression measured. Due to cost constraints, only a small fraction of all possible perturbations can be tested in vitro. This has led to the development of computational models that can predict cellular responses to perturbations in silico. A central challenge for these models is to predict the effect of new, previously untested perturbations that were not used in the training data. Here we propose causal structural equations for modeling how perturbations effect cells. From this model, we derive two estimators for predicting responses: a Linear Regression (LR) estimator and a causal structure learning estimator that we term Causal Structure Regression (CSR). The CSR estimator requires more assumptions than LR, but can predict the effects of drugs that were not applied in the training data. Next we present Cellbox, a recently proposed system of ordinary differential equations (ODEs) based model that obtained the best prediction performance on a Melanoma cell line perturbation data set (Yuan et al. in Cell Syst 12:128–140, 2021). We derive analytic results that show a close connection between CSR and Cellbox, providing a new causal interpretation for the Cellbox model. We compare LR and CSR/Cellbox in simulations, highlighting the strengths and weaknesses of the two approaches. Finally we compare the performance of LR and CSR/Cellbox on the benchmark Melanoma data set. We find that the LR model has comparable or slightly better performance than Cellbox. |
format | Article |
id | doaj-art-758c85ba91f546da84091b731152fcce |
institution | Kabale University |
issn | 1471-2105 |
language | English |
publishDate | 2025-01-01 |
publisher | BMC |
record_format | Article |
series | BMC Bioinformatics |
spelling | doaj-art-758c85ba91f546da84091b731152fcce2025-01-12T12:41:55ZengBMCBMC Bioinformatics1471-21052025-01-0126111710.1186/s12859-024-06027-7Causal models and prediction in cell line perturbation experimentsJames P. Long0Yumeng Yang1Shohei Shimizu2Thong Pham3Kim-Anh Do4Department of Biostatistics, The University of Texas MD Anderson Cancer CenterBiomedical Informatics, The University of Texas Health Science Center at HoustonFaculty of Data Science, Shiga UniversityFaculty of Data Science, Shiga UniversityDepartment of Biostatistics, The University of Texas MD Anderson Cancer CenterAbstract In cell line perturbation experiments, a collection of cells is perturbed with external agents and responses such as protein expression measured. Due to cost constraints, only a small fraction of all possible perturbations can be tested in vitro. This has led to the development of computational models that can predict cellular responses to perturbations in silico. A central challenge for these models is to predict the effect of new, previously untested perturbations that were not used in the training data. Here we propose causal structural equations for modeling how perturbations effect cells. From this model, we derive two estimators for predicting responses: a Linear Regression (LR) estimator and a causal structure learning estimator that we term Causal Structure Regression (CSR). The CSR estimator requires more assumptions than LR, but can predict the effects of drugs that were not applied in the training data. Next we present Cellbox, a recently proposed system of ordinary differential equations (ODEs) based model that obtained the best prediction performance on a Melanoma cell line perturbation data set (Yuan et al. in Cell Syst 12:128–140, 2021). We derive analytic results that show a close connection between CSR and Cellbox, providing a new causal interpretation for the Cellbox model. We compare LR and CSR/Cellbox in simulations, highlighting the strengths and weaknesses of the two approaches. Finally we compare the performance of LR and CSR/Cellbox on the benchmark Melanoma data set. We find that the LR model has comparable or slightly better performance than Cellbox.https://doi.org/10.1186/s12859-024-06027-7Causal inferencePredictionPerturbation biologySystems biology |
spellingShingle | James P. Long Yumeng Yang Shohei Shimizu Thong Pham Kim-Anh Do Causal models and prediction in cell line perturbation experiments BMC Bioinformatics Causal inference Prediction Perturbation biology Systems biology |
title | Causal models and prediction in cell line perturbation experiments |
title_full | Causal models and prediction in cell line perturbation experiments |
title_fullStr | Causal models and prediction in cell line perturbation experiments |
title_full_unstemmed | Causal models and prediction in cell line perturbation experiments |
title_short | Causal models and prediction in cell line perturbation experiments |
title_sort | causal models and prediction in cell line perturbation experiments |
topic | Causal inference Prediction Perturbation biology Systems biology |
url | https://doi.org/10.1186/s12859-024-06027-7 |
work_keys_str_mv | AT jamesplong causalmodelsandpredictionincelllineperturbationexperiments AT yumengyang causalmodelsandpredictionincelllineperturbationexperiments AT shoheishimizu causalmodelsandpredictionincelllineperturbationexperiments AT thongpham causalmodelsandpredictionincelllineperturbationexperiments AT kimanhdo causalmodelsandpredictionincelllineperturbationexperiments |