SequenceCraft: machine learning-based resource for exploratory analysis of RNA-cleaving deoxyribozymes

Abstract Background Deoxyribozymes or DNAzymes represent artificial short DNA sequences bearing many catalytic properties. In particular, DNAzymes able to cleave RNA sequences have a huge potential in gene therapy and sequence-specific analytic detection of disease markers. This activity is provided...

Full description

Saved in:
Bibliographic Details
Main Authors: M. Eremeyeva, Y. Din, N. Shirokii, N. Serov
Format: Article
Language:English
Published: BMC 2025-01-01
Series:BMC Bioinformatics
Subjects:
Online Access:https://doi.org/10.1186/s12859-024-06019-7
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841544251424374784
author M. Eremeyeva
Y. Din
N. Shirokii
N. Serov
author_facet M. Eremeyeva
Y. Din
N. Shirokii
N. Serov
author_sort M. Eremeyeva
collection DOAJ
description Abstract Background Deoxyribozymes or DNAzymes represent artificial short DNA sequences bearing many catalytic properties. In particular, DNAzymes able to cleave RNA sequences have a huge potential in gene therapy and sequence-specific analytic detection of disease markers. This activity is provided by catalytic cores able to perform site-specific hydrolysis of the phosphodiester bond of an RNA substrate. However, the vast majority of existing DNAzyme catalytic cores have low efficacy in in vivo experiments, whereas SELEX based on in vitro screening offers long and expensive selection cycle with the average success rate of ~ 30%, moreover not allowing the direct selection of chemically modified DNAzymes, which were previously shown to demonstrate higher activity in vivo. Therefore, there is a huge need in in silico approach for exploratory analysis of RNA-cleaving DNAzyme cores to drastically ease the discovery of novel catalytic cores with superior activities. Results In this work, we develop a machine learning based open-source platform SequenceCraft allowing experimental scientists to perform DNAzyme exploratory analysis via quantitative observed rate constant (kobs) estimation as well as statistical and clustering data analysis. This became possible with the development of a unique curated database of > 350 RNA-cleaving catalytic cores, property-based sequence representations allowing to work with both conventional and chemically modified nucleotides, and optimized kobs predicting algorithm achieving Q2 > 0.9 on experimental data published to date. Conclusions This work represents a significant advancement in DNAzyme research, providing a tool for more efficient discovery of RNA-cleaving DNAzymes. The SequenceCraft platform offers an in silico alternative to traditional experimental approaches, potentially accelerating the development of DNAzymes.
format Article
id doaj-art-f2b3c8bacb9f49c1a4927e9773a60a91
institution Kabale University
issn 1471-2105
language English
publishDate 2025-01-01
publisher BMC
record_format Article
series BMC Bioinformatics
spelling doaj-art-f2b3c8bacb9f49c1a4927e9773a60a912025-01-12T12:41:57ZengBMCBMC Bioinformatics1471-21052025-01-0126111810.1186/s12859-024-06019-7SequenceCraft: machine learning-based resource for exploratory analysis of RNA-cleaving deoxyribozymesM. Eremeyeva0Y. Din1N. Shirokii2N. Serov3International Institute “Solution Chemistry of Advanced Materials and Technologies”, ITMO UniversityInternational Institute “Solution Chemistry of Advanced Materials and Technologies”, ITMO UniversityInternational Institute “Solution Chemistry of Advanced Materials and Technologies”, ITMO UniversityInternational Institute “Solution Chemistry of Advanced Materials and Technologies”, ITMO UniversityAbstract Background Deoxyribozymes or DNAzymes represent artificial short DNA sequences bearing many catalytic properties. In particular, DNAzymes able to cleave RNA sequences have a huge potential in gene therapy and sequence-specific analytic detection of disease markers. This activity is provided by catalytic cores able to perform site-specific hydrolysis of the phosphodiester bond of an RNA substrate. However, the vast majority of existing DNAzyme catalytic cores have low efficacy in in vivo experiments, whereas SELEX based on in vitro screening offers long and expensive selection cycle with the average success rate of ~ 30%, moreover not allowing the direct selection of chemically modified DNAzymes, which were previously shown to demonstrate higher activity in vivo. Therefore, there is a huge need in in silico approach for exploratory analysis of RNA-cleaving DNAzyme cores to drastically ease the discovery of novel catalytic cores with superior activities. Results In this work, we develop a machine learning based open-source platform SequenceCraft allowing experimental scientists to perform DNAzyme exploratory analysis via quantitative observed rate constant (kobs) estimation as well as statistical and clustering data analysis. This became possible with the development of a unique curated database of > 350 RNA-cleaving catalytic cores, property-based sequence representations allowing to work with both conventional and chemically modified nucleotides, and optimized kobs predicting algorithm achieving Q2 > 0.9 on experimental data published to date. Conclusions This work represents a significant advancement in DNAzyme research, providing a tool for more efficient discovery of RNA-cleaving DNAzymes. The SequenceCraft platform offers an in silico alternative to traditional experimental approaches, potentially accelerating the development of DNAzymes.https://doi.org/10.1186/s12859-024-06019-7DeoxyribozymesMachine learningCatalytic activityArtificial intelligenceWeb resource
spellingShingle M. Eremeyeva
Y. Din
N. Shirokii
N. Serov
SequenceCraft: machine learning-based resource for exploratory analysis of RNA-cleaving deoxyribozymes
BMC Bioinformatics
Deoxyribozymes
Machine learning
Catalytic activity
Artificial intelligence
Web resource
title SequenceCraft: machine learning-based resource for exploratory analysis of RNA-cleaving deoxyribozymes
title_full SequenceCraft: machine learning-based resource for exploratory analysis of RNA-cleaving deoxyribozymes
title_fullStr SequenceCraft: machine learning-based resource for exploratory analysis of RNA-cleaving deoxyribozymes
title_full_unstemmed SequenceCraft: machine learning-based resource for exploratory analysis of RNA-cleaving deoxyribozymes
title_short SequenceCraft: machine learning-based resource for exploratory analysis of RNA-cleaving deoxyribozymes
title_sort sequencecraft machine learning based resource for exploratory analysis of rna cleaving deoxyribozymes
topic Deoxyribozymes
Machine learning
Catalytic activity
Artificial intelligence
Web resource
url https://doi.org/10.1186/s12859-024-06019-7
work_keys_str_mv AT meremeyeva sequencecraftmachinelearningbasedresourceforexploratoryanalysisofrnacleavingdeoxyribozymes
AT ydin sequencecraftmachinelearningbasedresourceforexploratoryanalysisofrnacleavingdeoxyribozymes
AT nshirokii sequencecraftmachinelearningbasedresourceforexploratoryanalysisofrnacleavingdeoxyribozymes
AT nserov sequencecraftmachinelearningbasedresourceforexploratoryanalysisofrnacleavingdeoxyribozymes