Systematic benchmarking of deep-learning methods for tertiary RNA structure prediction.
The 3D structure of RNA critically influences its functionality, and understanding this structure is vital for deciphering RNA biology. Experimental methods for determining RNA structures are labour-intensive, expensive, and time-consuming. Computational approaches have emerged as valuable tools, le...
Saved in:
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2024-12-01
|
Series: | PLoS Computational Biology |
Online Access: | https://doi.org/10.1371/journal.pcbi.1012715 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1841533245066313728 |
---|---|
author | Akash Bahai Chee Keong Kwoh Yuguang Mu Yinghui Li |
author_facet | Akash Bahai Chee Keong Kwoh Yuguang Mu Yinghui Li |
author_sort | Akash Bahai |
collection | DOAJ |
description | The 3D structure of RNA critically influences its functionality, and understanding this structure is vital for deciphering RNA biology. Experimental methods for determining RNA structures are labour-intensive, expensive, and time-consuming. Computational approaches have emerged as valuable tools, leveraging physics-based-principles and machine learning to predict RNA structures rapidly. Despite advancements, the accuracy of computational methods remains modest, especially when compared to protein structure prediction. Deep learning methods, while successful in protein structure prediction, have shown some promise for RNA structure prediction as well, but face unique challenges. This study systematically benchmarks state-of-the-art deep learning methods for RNA structure prediction across diverse datasets. Our aim is to identify factors influencing performance variation, such as RNA family diversity, sequence length, RNA type, multiple sequence alignment (MSA) quality, and deep learning model architecture. We show that generally ML-based methods perform much better than non-ML methods on most RNA targets, although the performance difference isn't substantial when working with unseen novel or synthetic RNAs. The quality of the MSA and secondary structure prediction both play an important role and most methods aren't able to predict non-Watson-Crick pairs in the RNAs. Overall among the automated 3D RNA structure prediction methods, DeepFoldRNA has the best prediction results followed by DRFold as the second best method. Finally, we also suggest possible mitigations to improve the quality of the prediction for future method development. |
format | Article |
id | doaj-art-81174e33552a498e978043f093bf1c5a |
institution | Kabale University |
issn | 1553-734X 1553-7358 |
language | English |
publishDate | 2024-12-01 |
publisher | Public Library of Science (PLoS) |
record_format | Article |
series | PLoS Computational Biology |
spelling | doaj-art-81174e33552a498e978043f093bf1c5a2025-01-17T05:30:55ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582024-12-012012e101271510.1371/journal.pcbi.1012715Systematic benchmarking of deep-learning methods for tertiary RNA structure prediction.Akash BahaiChee Keong KwohYuguang MuYinghui LiThe 3D structure of RNA critically influences its functionality, and understanding this structure is vital for deciphering RNA biology. Experimental methods for determining RNA structures are labour-intensive, expensive, and time-consuming. Computational approaches have emerged as valuable tools, leveraging physics-based-principles and machine learning to predict RNA structures rapidly. Despite advancements, the accuracy of computational methods remains modest, especially when compared to protein structure prediction. Deep learning methods, while successful in protein structure prediction, have shown some promise for RNA structure prediction as well, but face unique challenges. This study systematically benchmarks state-of-the-art deep learning methods for RNA structure prediction across diverse datasets. Our aim is to identify factors influencing performance variation, such as RNA family diversity, sequence length, RNA type, multiple sequence alignment (MSA) quality, and deep learning model architecture. We show that generally ML-based methods perform much better than non-ML methods on most RNA targets, although the performance difference isn't substantial when working with unseen novel or synthetic RNAs. The quality of the MSA and secondary structure prediction both play an important role and most methods aren't able to predict non-Watson-Crick pairs in the RNAs. Overall among the automated 3D RNA structure prediction methods, DeepFoldRNA has the best prediction results followed by DRFold as the second best method. Finally, we also suggest possible mitigations to improve the quality of the prediction for future method development.https://doi.org/10.1371/journal.pcbi.1012715 |
spellingShingle | Akash Bahai Chee Keong Kwoh Yuguang Mu Yinghui Li Systematic benchmarking of deep-learning methods for tertiary RNA structure prediction. PLoS Computational Biology |
title | Systematic benchmarking of deep-learning methods for tertiary RNA structure prediction. |
title_full | Systematic benchmarking of deep-learning methods for tertiary RNA structure prediction. |
title_fullStr | Systematic benchmarking of deep-learning methods for tertiary RNA structure prediction. |
title_full_unstemmed | Systematic benchmarking of deep-learning methods for tertiary RNA structure prediction. |
title_short | Systematic benchmarking of deep-learning methods for tertiary RNA structure prediction. |
title_sort | systematic benchmarking of deep learning methods for tertiary rna structure prediction |
url | https://doi.org/10.1371/journal.pcbi.1012715 |
work_keys_str_mv | AT akashbahai systematicbenchmarkingofdeeplearningmethodsfortertiaryrnastructureprediction AT cheekeongkwoh systematicbenchmarkingofdeeplearningmethodsfortertiaryrnastructureprediction AT yuguangmu systematicbenchmarkingofdeeplearningmethodsfortertiaryrnastructureprediction AT yinghuili systematicbenchmarkingofdeeplearningmethodsfortertiaryrnastructureprediction |