SequenceCraft: machine learning-based resource for exploratory analysis of RNA-cleaving deoxyribozymes
Abstract Background Deoxyribozymes or DNAzymes represent artificial short DNA sequences bearing many catalytic properties. In particular, DNAzymes able to cleave RNA sequences have a huge potential in gene therapy and sequence-specific analytic detection of disease markers. This activity is provided...
Saved in:
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2025-01-01
|
Series: | BMC Bioinformatics |
Subjects: | |
Online Access: | https://doi.org/10.1186/s12859-024-06019-7 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1841544251424374784 |
---|---|
author | M. Eremeyeva Y. Din N. Shirokii N. Serov |
author_facet | M. Eremeyeva Y. Din N. Shirokii N. Serov |
author_sort | M. Eremeyeva |
collection | DOAJ |
description | Abstract Background Deoxyribozymes or DNAzymes represent artificial short DNA sequences bearing many catalytic properties. In particular, DNAzymes able to cleave RNA sequences have a huge potential in gene therapy and sequence-specific analytic detection of disease markers. This activity is provided by catalytic cores able to perform site-specific hydrolysis of the phosphodiester bond of an RNA substrate. However, the vast majority of existing DNAzyme catalytic cores have low efficacy in in vivo experiments, whereas SELEX based on in vitro screening offers long and expensive selection cycle with the average success rate of ~ 30%, moreover not allowing the direct selection of chemically modified DNAzymes, which were previously shown to demonstrate higher activity in vivo. Therefore, there is a huge need in in silico approach for exploratory analysis of RNA-cleaving DNAzyme cores to drastically ease the discovery of novel catalytic cores with superior activities. Results In this work, we develop a machine learning based open-source platform SequenceCraft allowing experimental scientists to perform DNAzyme exploratory analysis via quantitative observed rate constant (kobs) estimation as well as statistical and clustering data analysis. This became possible with the development of a unique curated database of > 350 RNA-cleaving catalytic cores, property-based sequence representations allowing to work with both conventional and chemically modified nucleotides, and optimized kobs predicting algorithm achieving Q2 > 0.9 on experimental data published to date. Conclusions This work represents a significant advancement in DNAzyme research, providing a tool for more efficient discovery of RNA-cleaving DNAzymes. The SequenceCraft platform offers an in silico alternative to traditional experimental approaches, potentially accelerating the development of DNAzymes. |
format | Article |
id | doaj-art-f2b3c8bacb9f49c1a4927e9773a60a91 |
institution | Kabale University |
issn | 1471-2105 |
language | English |
publishDate | 2025-01-01 |
publisher | BMC |
record_format | Article |
series | BMC Bioinformatics |
spelling | doaj-art-f2b3c8bacb9f49c1a4927e9773a60a912025-01-12T12:41:57ZengBMCBMC Bioinformatics1471-21052025-01-0126111810.1186/s12859-024-06019-7SequenceCraft: machine learning-based resource for exploratory analysis of RNA-cleaving deoxyribozymesM. Eremeyeva0Y. Din1N. Shirokii2N. Serov3International Institute “Solution Chemistry of Advanced Materials and Technologies”, ITMO UniversityInternational Institute “Solution Chemistry of Advanced Materials and Technologies”, ITMO UniversityInternational Institute “Solution Chemistry of Advanced Materials and Technologies”, ITMO UniversityInternational Institute “Solution Chemistry of Advanced Materials and Technologies”, ITMO UniversityAbstract Background Deoxyribozymes or DNAzymes represent artificial short DNA sequences bearing many catalytic properties. In particular, DNAzymes able to cleave RNA sequences have a huge potential in gene therapy and sequence-specific analytic detection of disease markers. This activity is provided by catalytic cores able to perform site-specific hydrolysis of the phosphodiester bond of an RNA substrate. However, the vast majority of existing DNAzyme catalytic cores have low efficacy in in vivo experiments, whereas SELEX based on in vitro screening offers long and expensive selection cycle with the average success rate of ~ 30%, moreover not allowing the direct selection of chemically modified DNAzymes, which were previously shown to demonstrate higher activity in vivo. Therefore, there is a huge need in in silico approach for exploratory analysis of RNA-cleaving DNAzyme cores to drastically ease the discovery of novel catalytic cores with superior activities. Results In this work, we develop a machine learning based open-source platform SequenceCraft allowing experimental scientists to perform DNAzyme exploratory analysis via quantitative observed rate constant (kobs) estimation as well as statistical and clustering data analysis. This became possible with the development of a unique curated database of > 350 RNA-cleaving catalytic cores, property-based sequence representations allowing to work with both conventional and chemically modified nucleotides, and optimized kobs predicting algorithm achieving Q2 > 0.9 on experimental data published to date. Conclusions This work represents a significant advancement in DNAzyme research, providing a tool for more efficient discovery of RNA-cleaving DNAzymes. The SequenceCraft platform offers an in silico alternative to traditional experimental approaches, potentially accelerating the development of DNAzymes.https://doi.org/10.1186/s12859-024-06019-7DeoxyribozymesMachine learningCatalytic activityArtificial intelligenceWeb resource |
spellingShingle | M. Eremeyeva Y. Din N. Shirokii N. Serov SequenceCraft: machine learning-based resource for exploratory analysis of RNA-cleaving deoxyribozymes BMC Bioinformatics Deoxyribozymes Machine learning Catalytic activity Artificial intelligence Web resource |
title | SequenceCraft: machine learning-based resource for exploratory analysis of RNA-cleaving deoxyribozymes |
title_full | SequenceCraft: machine learning-based resource for exploratory analysis of RNA-cleaving deoxyribozymes |
title_fullStr | SequenceCraft: machine learning-based resource for exploratory analysis of RNA-cleaving deoxyribozymes |
title_full_unstemmed | SequenceCraft: machine learning-based resource for exploratory analysis of RNA-cleaving deoxyribozymes |
title_short | SequenceCraft: machine learning-based resource for exploratory analysis of RNA-cleaving deoxyribozymes |
title_sort | sequencecraft machine learning based resource for exploratory analysis of rna cleaving deoxyribozymes |
topic | Deoxyribozymes Machine learning Catalytic activity Artificial intelligence Web resource |
url | https://doi.org/10.1186/s12859-024-06019-7 |
work_keys_str_mv | AT meremeyeva sequencecraftmachinelearningbasedresourceforexploratoryanalysisofrnacleavingdeoxyribozymes AT ydin sequencecraftmachinelearningbasedresourceforexploratoryanalysisofrnacleavingdeoxyribozymes AT nshirokii sequencecraftmachinelearningbasedresourceforexploratoryanalysisofrnacleavingdeoxyribozymes AT nserov sequencecraftmachinelearningbasedresourceforexploratoryanalysisofrnacleavingdeoxyribozymes |