Transcriptome sequencing and SSR prediction of Clematis calyx based on SMRT sequencing platform

Abstract Clematis is an excellent vertical greening plant for garden viewing vines, with great ornamental and high medicinal value. To obtain transcriptome information and functional gene data for the Clematis calyx, this study utilized three biological replicates of the calyx from each of the three...

Full description

Saved in:
Bibliographic Details
Main Authors: Song Liu, Wei Song, Wei Pan, Gangshan Wu
Format: Article
Language:English
Published: Nature Portfolio 2024-11-01
Series:Scientific Reports
Subjects:
Online Access:https://doi.org/10.1038/s41598-024-80504-0
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1846158576975872000
author Song Liu
Wei Song
Wei Pan
Gangshan Wu
author_facet Song Liu
Wei Song
Wei Pan
Gangshan Wu
author_sort Song Liu
collection DOAJ
description Abstract Clematis is an excellent vertical greening plant for garden viewing vines, with great ornamental and high medicinal value. To obtain transcriptome information and functional gene data for the Clematis calyx, this study utilized three biological replicates of the calyx from each of the three Clematis varieties: ‘Henryi’, ‘Polish spirit’, and ‘Mme Julia Correvon’ Single-molecule real-time sequencing technology was employed for full-length transcriptome sequencing. The study revealed that 21,673,173 non-chimeric sequences were identified through full-length transcriptome sequencing. After clustering, correction, and redundancy removal, 40,465 high-quality, full-length transcripts were obtained. Among these transcripts, there were 15,488 long non-coding RNA (lncRNA), 9,212 simple sequence repeats (SSR) sites, 1,247 transcription factors (TFs), 1,228 alternative splicing events, and 7,189 selectively polyadenylation sites predicted. In addition, 7,442 primer pairs were designed based on the SSR sites, covering 80.76% of the total SSR. 15 primer pairs were randomly selected for amplification, and 73.3% of them were successfully amplified. Transcript annotation results showed that 38,439, 38,094, 26,815, and 33,407 transcripts were annotated to the Nr, KEGG, KOG, and SwissProt databases, respectively. Among these, the KEGG database identified 137 metabolic pathways, including biosynthesis of secondary metabolites, carbon metabolism, and amino acid biosynthesis. 104 and 7 transcripts are involved in the flavonoid and anthocyanin metabolism pathways, respectively. The above results provide initial insights into the transcriptome information and functional characteristics of Clematis calyx. This critical data supports future research on marker primers for the color mechanism of Clematis calyx, the pathways and regulatory mechanisms of flavonoids and other products, as well as specific trait genes.
format Article
id doaj-art-1b4aa6fc32c34f9cb00e8659a2dfe6c5
institution Kabale University
issn 2045-2322
language English
publishDate 2024-11-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj-art-1b4aa6fc32c34f9cb00e8659a2dfe6c52024-11-24T12:23:29ZengNature PortfolioScientific Reports2045-23222024-11-0114111510.1038/s41598-024-80504-0Transcriptome sequencing and SSR prediction of Clematis calyx based on SMRT sequencing platformSong Liu0Wei Song1Wei Pan2Gangshan Wu3Jiangsu Vocational College of Agriculture and ForestryJiangsu Vocational College of Agriculture and ForestryJiangsu Vocational College of Agriculture and ForestryJiangsu Vocational College of Agriculture and ForestryAbstract Clematis is an excellent vertical greening plant for garden viewing vines, with great ornamental and high medicinal value. To obtain transcriptome information and functional gene data for the Clematis calyx, this study utilized three biological replicates of the calyx from each of the three Clematis varieties: ‘Henryi’, ‘Polish spirit’, and ‘Mme Julia Correvon’ Single-molecule real-time sequencing technology was employed for full-length transcriptome sequencing. The study revealed that 21,673,173 non-chimeric sequences were identified through full-length transcriptome sequencing. After clustering, correction, and redundancy removal, 40,465 high-quality, full-length transcripts were obtained. Among these transcripts, there were 15,488 long non-coding RNA (lncRNA), 9,212 simple sequence repeats (SSR) sites, 1,247 transcription factors (TFs), 1,228 alternative splicing events, and 7,189 selectively polyadenylation sites predicted. In addition, 7,442 primer pairs were designed based on the SSR sites, covering 80.76% of the total SSR. 15 primer pairs were randomly selected for amplification, and 73.3% of them were successfully amplified. Transcript annotation results showed that 38,439, 38,094, 26,815, and 33,407 transcripts were annotated to the Nr, KEGG, KOG, and SwissProt databases, respectively. Among these, the KEGG database identified 137 metabolic pathways, including biosynthesis of secondary metabolites, carbon metabolism, and amino acid biosynthesis. 104 and 7 transcripts are involved in the flavonoid and anthocyanin metabolism pathways, respectively. The above results provide initial insights into the transcriptome information and functional characteristics of Clematis calyx. This critical data supports future research on marker primers for the color mechanism of Clematis calyx, the pathways and regulatory mechanisms of flavonoids and other products, as well as specific trait genes.https://doi.org/10.1038/s41598-024-80504-0Clematis CalyxSingle-molecule real-time sequencing technologyFull-length transcriptomeGene function annotation
spellingShingle Song Liu
Wei Song
Wei Pan
Gangshan Wu
Transcriptome sequencing and SSR prediction of Clematis calyx based on SMRT sequencing platform
Scientific Reports
Clematis Calyx
Single-molecule real-time sequencing technology
Full-length transcriptome
Gene function annotation
title Transcriptome sequencing and SSR prediction of Clematis calyx based on SMRT sequencing platform
title_full Transcriptome sequencing and SSR prediction of Clematis calyx based on SMRT sequencing platform
title_fullStr Transcriptome sequencing and SSR prediction of Clematis calyx based on SMRT sequencing platform
title_full_unstemmed Transcriptome sequencing and SSR prediction of Clematis calyx based on SMRT sequencing platform
title_short Transcriptome sequencing and SSR prediction of Clematis calyx based on SMRT sequencing platform
title_sort transcriptome sequencing and ssr prediction of clematis calyx based on smrt sequencing platform
topic Clematis Calyx
Single-molecule real-time sequencing technology
Full-length transcriptome
Gene function annotation
url https://doi.org/10.1038/s41598-024-80504-0
work_keys_str_mv AT songliu transcriptomesequencingandssrpredictionofclematiscalyxbasedonsmrtsequencingplatform
AT weisong transcriptomesequencingandssrpredictionofclematiscalyxbasedonsmrtsequencingplatform
AT weipan transcriptomesequencingandssrpredictionofclematiscalyxbasedonsmrtsequencingplatform
AT gangshanwu transcriptomesequencingandssrpredictionofclematiscalyxbasedonsmrtsequencingplatform