EgoSep: Egocentric On-Screen Sound Source Separation for Real-Time Edge Computing

The ability to identify specific sounds in noisy environments can be improved by incorporating visual information through audio-visual integration, leveraging visual cues such as lip reading and sound-producing object recognition. Recent advancements in deep learning have enabled effective audio-vis...

Full description

Saved in:
Bibliographic Details
Main Authors: Donghyeok Jo, Jun-Hwa Kim, Jihoon Jeon, Chee Sun Won
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10830501/
Tags: Add Tag
No Tags, Be the first to tag this record!

Similar Items