DSENet : Directional Signal Extraction Network for Hearing Improvement on Edge Devices

In this paper, we propose a directional signal extraction network (DSENet). DSENet is a low-latency, real-time neural network that, given a reverberant mixture of signals captured by a microphone array, aims at extracting the reverberant signal whose source is located within a directional region of interest. If there are multiple sources situated within the directional region of interest, DSENet will aim at extracting a combination of their reverberant signals. As such, the formulation of DSENet circumvents the well-known crosstalk problem in beamforming while providing an alternative and perhaps more practical approach to other spatially constrained signal extraction methods proposed in the literature. DSENet is based on a computationally efficient and low-distortion linear model formulated in the time domain. As a result, an important application of our work is hearing improvement on edge devices. Simulation results show that DSENet outperforms oracle beamformers, as well as state-of-the-art in low-latency causal speech separation, while incurring a system latency of only 4 ms. Additionally, DSENet has been successfully deployed as a real-time application on a smartphone.

Medienart:

E-Artikel

Erscheinungsjahr:

2023

Erschienen:

2023

Enthalten in:

Zur Gesamtaufnahme - volume:11

Enthalten in:

IEEE access : practical innovations, open solutions - 11(2023) vom: 11., Seite 4350-4358

Sprache:

Englisch

Beteiligte Personen:

Kovalyov, Anton [VerfasserIn]
Patel, Kashyap [VerfasserIn]
Panahi, Issa [VerfasserIn]

Links:

Volltext

Themen:

Beamforming
Directional signal extraction
Journal Article
Microphone array
Real-time
Signal separation

Anmerkungen:

Date Revised 29.08.2023

published: Print-Electronic

Citation Status PubMed-not-MEDLINE

doi:

10.1109/access.2023.3235948

funding:

Förderinstitution / Projekttitel:

PPN (Katalog-ID):

NLM361199945