Microwave Speech Recognizer Empowered by a Programmable Metasurface

© 2024 The Authors. Advanced Science published by Wiley-VCH GmbH..

Speech recognition becomes increasingly important in the modern society, especially for human-machine interactions, but its deployment is still severely thwarted by the struggle of machines to recognize voiced commands in challenging real-life settings: oftentimes, ambient noise drowns the acoustic sound signals, and walls, face masks or other obstacles hide the mouth motion from optical sensors. To address these formidable challenges, an experimental prototype of a microwave speech recognizer empowered by programmable metasurface is presented here that can remotely recognize human voice commands and speaker identities even in noisy environments and if the speaker's mouth is hidden behind a wall or face mask. The programmable metasurface is the pivotal hardware ingredient of the system because its large aperture and huge number of degrees of freedom allows the system to perform a complex sequence of sensing tasks, orchestrated by artificial-intelligence tools. Relying solely on microwave data, the system avoids visual privacy infringements. The developed microwave speech recognizer can enable privacy-respecting voice-commanded human-machine interactions is experimentally demonstrated in many important but to-date inaccessible application scenarios. The presented strategy will unlock new possibilities and have expectations for future smart homes, ambient-assisted health monitoring, as well as intelligent surveillance and security.

Medienart:

E-Artikel

Erscheinungsjahr:

2024

Erschienen:

2024

Enthalten in:

Zur Gesamtaufnahme - year:2024

Enthalten in:

Advanced science (Weinheim, Baden-Wurttemberg, Germany) - (2024) vom: 21. Feb., Seite e2309826

Sprache:

Englisch

Beteiligte Personen:

Zhang, Hongrui [VerfasserIn]
Ruan, Hengxin [VerfasserIn]
Zhao, Hanting [VerfasserIn]
Wang, Zhuo [VerfasserIn]
Hu, Shengguo [VerfasserIn]
Cui, Tie Jun [VerfasserIn]
Del Hougne, Philipp [VerfasserIn]
Li, Lianlin [VerfasserIn]

Links:

Volltext

Themen:

Artificial intelligence (AI)
Human-machine interactions
Journal Article
Microwave sensing
Programmable metasurface
Speech recognition

Anmerkungen:

Date Revised 21.02.2024

published: Print-Electronic

Citation Status Publisher

doi:

10.1002/advs.202309826

funding:

Förderinstitution / Projekttitel:

PPN (Katalog-ID):

NLM368705757