Enhancing bowel sound recognition with self-attention and self-supervised pre-training.

Bowel sounds, a reflection of the gastrointestinal tract's peristalsis, are essential for diagnosing and monitoring gastrointestinal conditions. However, the absence of an effective, non-invasive method for assessing digestion through auscultation has resulted in a reliance on time-consuming an...

Full description

Saved in:
Bibliographic Details
Main Authors: Yansuo Yu, Mingwu Zhang, Zhennian Xie, Qiang Liu
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2024-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0311503
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841555527996276736
author Yansuo Yu
Mingwu Zhang
Zhennian Xie
Qiang Liu
author_facet Yansuo Yu
Mingwu Zhang
Zhennian Xie
Qiang Liu
author_sort Yansuo Yu
collection DOAJ
description Bowel sounds, a reflection of the gastrointestinal tract's peristalsis, are essential for diagnosing and monitoring gastrointestinal conditions. However, the absence of an effective, non-invasive method for assessing digestion through auscultation has resulted in a reliance on time-consuming and laborious manual analysis by clinicians. This study introduces an innovative deep learning-based method designed to automate and enhance the recognition of bowel sounds. Our approach integrates the Branchformer architecture, which leverages the power of self-attention and convolutional gating for robust feature extraction, with a self-supervised pre-training strategy. Specifically, the Branchformer model employs parallel processing of self-attention and convolutional gated Multi-layer Perceptron branches to capture both global and local dependencies in audio signals, thereby enabling effective characterization of complex bowel sound patterns. Furthermore, a self-supervised pre-training strategy is employed, leveraging a large corpus of unlabeled audio data to learn general sound wave representations, followed by fine-tuning on a limited set of bowel sound data to optimize the model's recognition performance for specific tasks. Experimental results on public bowel sound datasets demonstrate the superior recognition performance of the proposed method compared to existing baseline models, particularly under data-limited conditions, thereby confirming the effectiveness of the self-supervised pre-training strategy. This work provides an efficient and automated solution for clinical bowel sound monitoring, facilitating early diagnosis and treatment of gastrointestinal disorders.
format Article
id doaj-art-9c74e52359694ac4aa21a70a34d3f8f8
institution Kabale University
issn 1932-6203
language English
publishDate 2024-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj-art-9c74e52359694ac4aa21a70a34d3f8f82025-01-08T05:32:18ZengPublic Library of Science (PLoS)PLoS ONE1932-62032024-01-011912e031150310.1371/journal.pone.0311503Enhancing bowel sound recognition with self-attention and self-supervised pre-training.Yansuo YuMingwu ZhangZhennian XieQiang LiuBowel sounds, a reflection of the gastrointestinal tract's peristalsis, are essential for diagnosing and monitoring gastrointestinal conditions. However, the absence of an effective, non-invasive method for assessing digestion through auscultation has resulted in a reliance on time-consuming and laborious manual analysis by clinicians. This study introduces an innovative deep learning-based method designed to automate and enhance the recognition of bowel sounds. Our approach integrates the Branchformer architecture, which leverages the power of self-attention and convolutional gating for robust feature extraction, with a self-supervised pre-training strategy. Specifically, the Branchformer model employs parallel processing of self-attention and convolutional gated Multi-layer Perceptron branches to capture both global and local dependencies in audio signals, thereby enabling effective characterization of complex bowel sound patterns. Furthermore, a self-supervised pre-training strategy is employed, leveraging a large corpus of unlabeled audio data to learn general sound wave representations, followed by fine-tuning on a limited set of bowel sound data to optimize the model's recognition performance for specific tasks. Experimental results on public bowel sound datasets demonstrate the superior recognition performance of the proposed method compared to existing baseline models, particularly under data-limited conditions, thereby confirming the effectiveness of the self-supervised pre-training strategy. This work provides an efficient and automated solution for clinical bowel sound monitoring, facilitating early diagnosis and treatment of gastrointestinal disorders.https://doi.org/10.1371/journal.pone.0311503
spellingShingle Yansuo Yu
Mingwu Zhang
Zhennian Xie
Qiang Liu
Enhancing bowel sound recognition with self-attention and self-supervised pre-training.
PLoS ONE
title Enhancing bowel sound recognition with self-attention and self-supervised pre-training.
title_full Enhancing bowel sound recognition with self-attention and self-supervised pre-training.
title_fullStr Enhancing bowel sound recognition with self-attention and self-supervised pre-training.
title_full_unstemmed Enhancing bowel sound recognition with self-attention and self-supervised pre-training.
title_short Enhancing bowel sound recognition with self-attention and self-supervised pre-training.
title_sort enhancing bowel sound recognition with self attention and self supervised pre training
url https://doi.org/10.1371/journal.pone.0311503
work_keys_str_mv AT yansuoyu enhancingbowelsoundrecognitionwithselfattentionandselfsupervisedpretraining
AT mingwuzhang enhancingbowelsoundrecognitionwithselfattentionandselfsupervisedpretraining
AT zhennianxie enhancingbowelsoundrecognitionwithselfattentionandselfsupervisedpretraining
AT qiangliu enhancingbowelsoundrecognitionwithselfattentionandselfsupervisedpretraining