EgoSep: Egocentric On-Screen Sound Source Separation for Real-Time Edge Computing

EgoSep: Egocentric On-Screen Sound Source Separation for Real-Time Edge Computing

The ability to identify specific sounds in noisy environments can be improved by incorporating visual information through audio-visual integration, leveraging visual cues such as lip reading and sound-producing object recognition. Recent advancements in deep learning have enabled effective audio-vis...

Full description

Saved in:

Bibliographic Details
Main Authors:	Donghyeok Jo, Jun-Hwa Kim, Jihoon Jeon, Chee Sun Won
Format:	Article
Language:	English
Published:	IEEE 2025-01-01
Series:	IEEE Access
Subjects:	Audio-visual deep learning on-screen sound separation edge computing
Online Access:	https://ieeexplore.ieee.org/document/10830501/
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Everyday Urban Sounds: Ben Katchor’s Julius Knipl Radio Cartoons
by: Benjamin Fraser
Published: (2024-11-01)

A Generalized Bandsplit Neural Network for Cinematic Audio Source Separation
by: Karn N. Watcharasupat, et al.
Published: (2024-01-01)

Sound recurrence analysis for acoustic scene classification
by: Jakob Abeßer, et al.
Published: (2025-01-01)

Comics Aloud: Introduction
by: Benoît Glaude, et al.
Published: (2024-11-01)

La bande dessinée à voix haute : introduction
by: Benoît Glaude, et al.
Published: (2024-11-01)

Gothic Sounds and the Foreshadowing of Victorian Soundscapes
by: Lucie Ratail
Published: (2021-11-01)

Antipredator behaviour in semi-feral horses: innate response and the influence of external factors
by: Antoine Bercy, et al.
Published: (2025-02-01)

Sound and recording : applications and theory /
by: Rumsey, Francis
Published: (2021)

Sound and recording : applications and theory /
by: Rumsey, Francis
Published: (2021)

Influence and prediction mechanisms for discomfort and memory disturbance due to structure borne sound from a metro masked with fountain sound
by: Wang Qiaochu, et al.
Published: (2025-01-01)

Abnormal Respiratory Sounds Classification Using Deep CNN Through Artificial Noise Addition
by: Rizwana Zulfiqar, et al.
Published: (2021-11-01)

Corrigendum: Abnormal respiratory sounds classification using deep CNN through artificial noise addition
by: Rizwana Zulfiqar, et al.
Published: (2025-01-01)

<em>(Sounding) Silence</em>: Dysfluency Mediated Otherwise
by: Andi Lois Gilker
Published: (2024-12-01)

Information from sound: exploring sounds and listening in information practices research
by: Owen Stewart-Robertson
Published: (2024-06-01)

Seasonal variation of underwater sound propagation in the Beibu Gulf
by: Feihong Zeng, et al.
Published: (2025-01-01)

FEATURE EXTRACTION METHOD OF SOUND SIGNAL TO ROLLING BEARING BASED ON BLIND SOURCE SEPARATION AND MORLET WAVELET
by: LI JingJiao, et al.
Published: (2018-01-01)

Evaluating causes and gestures: source-related and crossmodal features in the perception of environmental sounds
by: Sven-Amin Lembke
Published: (2025-02-01)

The impact of binaural auralizations on sound source localization and social presence in audiovisual virtual reality: converging evidence from placement and eye-tracking paradigms★
by: Roßkopf Sarah, et al.
Published: (2024-01-01)

Optimization of parallel coiled cavities of different depths in microperforated panel sound absorbers
by: Hequn Min, et al.
Published: (2025-01-01)

Making Dementia Matter Through Sound
by: Marjolein Gysels, et al.
Published: (2024-03-01)

Formation model of sound engineers’ communicative competence
by: Yuriy A. Kachalov
Published: (2024-11-01)

The Potential for a Networked Ionospheric Sounding Constellation
by: A. T. Chartier
Published: (2022-05-01)

Comparison of Machine Learning Algorithms on Classification of Covid-19 Cough Sounds Using MFCC Extraction
by: Mohammad Reza Faisal, et al.
Published: (2023-12-01)

The Sound Masking Assessment of Recycled Partition
by: Phylicia Deosephine Soegiono, et al.
Published: (2024-12-01)

Sound in Daniel Deronda
by: Michael Hollington
Published: (2021-11-01)

Within, Without, Withinwards: The Circulation of Sound in Shakespeare’s Theater
by: Bruce R. Smith
Published: (2013-06-01)

Analysis and Optimization of SUV Drive Axle Abnormal Sound
by: Lü Chenming, et al.
Published: (2018-01-01)

The Development of Digital Audio Coding
by: Guo Ke
Published: (1995-01-01)

Elephant Sound Classification Using Deep Learning Optimization
by: Hiruni Dewmini, et al.
Published: (2025-01-01)

Audio-Language Datasets of Scenes and Events: A Survey
by: Gijs Wijngaard, et al.
Published: (2025-01-01)

Vertical Electric Sounding of Leachate Contaminant Plumes at a Dumpsite in Obigbo, Rivers State, Nigeria
by: W. Tamuno, et al.
Published: (2020-01-01)

Vertical Electric Sounding of Leachate Contaminant Plumes at a Dumpsite in Obigbo, Rivers State, Nigeria
by: W. Tamuno, et al.
Published: (2020-01-01)

Sound as Environment in Tom Konyves’ Urban Pastorals
by: Bénédicte CHORIER-FRYD
Published: (2017-06-01)

Improved sound absorption with 3D-printed micro-perforated sandwich structures
by: Peng-Fei Zhang, et al.
Published: (2025-01-01)

Stereo robust watermark algorithm based on parameter optimization
by: Yiming XUE, et al.
Published: (2023-07-01)

System and application of video surrveillance based on edge computing
by: Sanming PAN, et al.
Published: (2020-06-01)

A Novel Audio Copy Move Forgery Detection Method With Classification of Graph-Based Representations
by: Beste Ustubioglu, et al.
Published: (2025-01-01)

In-vivo high-resolution χ-separation at 7T
by: Jiye Kim, et al.
Published: (2025-03-01)

A STUDY ON ACEHNESE EFL LEARNERS’ PHONETICS: PRONUNCIATION ERRORS IN PRODUCING ENGLISH CONSONANT SOUNDS
by: Fera Busfina Zalha, et al.
Published: (2025-01-01)

The Acoustic Dimension of Social Protest: Notes from an Ethnography of Sound
by: José Luis Martin, et al.
Published: (2017-09-01)