A Review on Language-Independent Search on Speech and its Applications

A thorough analysis of language-independent search methods and models for speech detection, a crucial task in retrieving audio file from large archives based on spoken queries was presented in this study. Unlike traditional speech recognition, this “zero-resource task” doesn&am...

Full description

Saved in:
Bibliographic Details
Main Authors: Sushil Venkatesh Kulkarni, Sukomal Pal
Format: Article
Language:English
Published: IEEE 2024-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10807177/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849219990532653056
author Sushil Venkatesh Kulkarni
Sukomal Pal
author_facet Sushil Venkatesh Kulkarni
Sukomal Pal
author_sort Sushil Venkatesh Kulkarni
collection DOAJ
description A thorough analysis of language-independent search methods and models for speech detection, a crucial task in retrieving audio file from large archives based on spoken queries was presented in this study. Unlike traditional speech recognition, this “zero-resource task” doesn’t require specific training data or lexical information, relying on hypothesis testing and pattern matching instead. Spoken term detection is the process of searching for large audio databases. Typically, this consists of text-based “spoken term datasets” of specific languages, where sufficient data are available to train automatic speech recognition systems. Speech recognition enables human-machine communication through a variety of voice commands and clear instructions. Telephones and cellular systems are examples of these applications. The study examines modern spoken-term detection systems, highlighting significant advancements and performance improvements. It delves into various speech recognition techniques used in cross-media retrieval systems and machine learning methodologies, emphasizing the practical information retrieval capabilities of cross-modal learning approaches. The research aims to provide an in-depth analysis of methods combining text and image features, addressing topics previously overlooked in surveys. The motivation behind this study stems from the lack of comprehensive reviews on “image and text modalities,” ongoing challenges in the “cross-modal retrieval field,” and the untapped potential of image and text features in cross-modal retrieval development. By exploring state-of-the-art language-independent searches for speech recognition, this study sheds light on sophisticated applications and paves the way for further advancements in the field.
format Article
id doaj-art-2dbb1566e2f84a5fa2437bb59e4431ff
institution Kabale University
issn 2169-3536
language English
publishDate 2024-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj-art-2dbb1566e2f84a5fa2437bb59e4431ff2024-12-25T00:01:28ZengIEEEIEEE Access2169-35362024-01-011219418219420210.1109/ACCESS.2024.352039410807177A Review on Language-Independent Search on Speech and its ApplicationsSushil Venkatesh Kulkarni0https://orcid.org/0000-0003-3466-2996Sukomal Pal1Department of Computer Science and Engineering, Indian Institute of Technology (BHU) Varanasi, Varanasi, Uttar Pradesh, IndiaDepartment of Computer Science and Engineering, Indian Institute of Technology (BHU) Varanasi, Varanasi, Uttar Pradesh, IndiaA thorough analysis of language-independent search methods and models for speech detection, a crucial task in retrieving audio file from large archives based on spoken queries was presented in this study. Unlike traditional speech recognition, this “zero-resource task” doesn’t require specific training data or lexical information, relying on hypothesis testing and pattern matching instead. Spoken term detection is the process of searching for large audio databases. Typically, this consists of text-based “spoken term datasets” of specific languages, where sufficient data are available to train automatic speech recognition systems. Speech recognition enables human-machine communication through a variety of voice commands and clear instructions. Telephones and cellular systems are examples of these applications. The study examines modern spoken-term detection systems, highlighting significant advancements and performance improvements. It delves into various speech recognition techniques used in cross-media retrieval systems and machine learning methodologies, emphasizing the practical information retrieval capabilities of cross-modal learning approaches. The research aims to provide an in-depth analysis of methods combining text and image features, addressing topics previously overlooked in surveys. The motivation behind this study stems from the lack of comprehensive reviews on “image and text modalities,” ongoing challenges in the “cross-modal retrieval field,” and the untapped potential of image and text features in cross-modal retrieval development. By exploring state-of-the-art language-independent searches for speech recognition, this study sheds light on sophisticated applications and paves the way for further advancements in the field.https://ieeexplore.ieee.org/document/10807177/Automatic speech recognitioncross modal representationinformation retrievalmachine learningspeech detection techniques
spellingShingle Sushil Venkatesh Kulkarni
Sukomal Pal
A Review on Language-Independent Search on Speech and its Applications
IEEE Access
Automatic speech recognition
cross modal representation
information retrieval
machine learning
speech detection techniques
title A Review on Language-Independent Search on Speech and its Applications
title_full A Review on Language-Independent Search on Speech and its Applications
title_fullStr A Review on Language-Independent Search on Speech and its Applications
title_full_unstemmed A Review on Language-Independent Search on Speech and its Applications
title_short A Review on Language-Independent Search on Speech and its Applications
title_sort review on language independent search on speech and its applications
topic Automatic speech recognition
cross modal representation
information retrieval
machine learning
speech detection techniques
url https://ieeexplore.ieee.org/document/10807177/
work_keys_str_mv AT sushilvenkateshkulkarni areviewonlanguageindependentsearchonspeechanditsapplications
AT sukomalpal areviewonlanguageindependentsearchonspeechanditsapplications
AT sushilvenkateshkulkarni reviewonlanguageindependentsearchonspeechanditsapplications
AT sukomalpal reviewonlanguageindependentsearchonspeechanditsapplications