Details der Publikation - Feasibility of decoding covert speech in ECoG with a Transformer trained on overt speech

Feasibility of decoding covert speech in ECoG with a Transformer trained on overt speech

ABSTRACT Several attempts for speech brain–computer interfacing (BCI) have been made to decode phonemes, sub-words, words, or sentences using invasive measurements, such as the electrocorticogram (ECoG), during auditory speech perception, overt speech, or imagined (covert) speech. Decoding sentences from covert speech is a challenging task. Sixteen epilepsy patients with intracranially implanted electrodes participated in this study, and ECoGs were recorded during overt speech and covert speech of eight Japanese sentences, each consisting of three tokens. In particular, Transformer neural network model was applied to decode text sentences from covert speech, which was trained using ECoGs obtained during overt speech. We first examined the proposed Transformer model using the same task for training and testing, and then evaluated the model’s performance when trained with overt task for decoding covert speech. The Transformer model trained on covert speech achieved an average token error rate (TER) of 46.6% for decoding covert speech, whereas the model trained on overt speech achieved a TER of 46.3% (p >0.05;d= 0.07). Therefore, the challenge of collecting training data for covert speech can be addressed using overt speech. The performance of covert speech can improve by employing several overt speeches..

Medienart:	Preprint

Erscheinungsjahr:	2024
Erschienen:	2024

Enthalten in:	bioRxiv.org - (2024) vom: 16. Apr. Zur Gesamtaufnahme - year:2024

Sprache:	Englisch

Beteiligte Personen:	Komeiji, Shuji [VerfasserIn] Mitsuhashi, Takumi [VerfasserIn] Iimura, Yasushi [VerfasserIn] Suzuki, Hiroharu [VerfasserIn] Sugano, Hidenori [VerfasserIn] Shinoda, Koichi [VerfasserIn] Tanaka, Toshihisa [VerfasserIn]

Links:	Volltext [kostenfrei]

Themen:	570 Biology

doi:	10.1101/2024.02.05.578911

funding:
Förderinstitution / Projekttitel:

PPN (Katalog-ID):	XBI042455537

Internformat


LEADER	01000caa a22002652 4500
001	XBI042455537
003	DE-627
005	20240419090807.0
007	cr uuu---uuuuu
008	240210s2024 xx \|\|\|\|\|o 00\| \|\|eng c
024	7		\|a 10.1101/2024.02.05.578911 \|2 doi
035			\|a (DE-627)XBI042455537
035			\|a (biorXiv)10.1101/2024.02.05.578911
040			\|a DE-627 \|b ger \|c DE-627 \|e rakwb
041			\|a eng
100	1		\|a Komeiji, Shuji \|e verfasserin \|4 aut
245	1	0	\|a Feasibility of decoding covert speech in ECoG with a Transformer trained on overt speech
264		1	\|c 2024
336			\|a Text \|b txt \|2 rdacontent
337			\|a Computermedien \|b c \|2 rdamedia
338			\|a Online-Ressource \|b cr \|2 rdacarrier
520			\|a ABSTRACT Several attempts for speech brain–computer interfacing (BCI) have been made to decode phonemes, sub-words, words, or sentences using invasive measurements, such as the electrocorticogram (ECoG), during auditory speech perception, overt speech, or imagined (covert) speech. Decoding sentences from covert speech is a challenging task. Sixteen epilepsy patients with intracranially implanted electrodes participated in this study, and ECoGs were recorded during overt speech and covert speech of eight Japanese sentences, each consisting of three tokens. In particular, Transformer neural network model was applied to decode text sentences from covert speech, which was trained using ECoGs obtained during overt speech. We first examined the proposed Transformer model using the same task for training and testing, and then evaluated the model’s performance when trained with overt task for decoding covert speech. The Transformer model trained on covert speech achieved an average token error rate (TER) of 46.6% for decoding covert speech, whereas the model trained on overt speech achieved a TER of 46.3% (p >0.05;d= 0.07). Therefore, the challenge of collecting training data for covert speech can be addressed using overt speech. The performance of covert speech can improve by employing several overt speeches.
650		4	\|a Biology \|7 (dpeaa)DE-84
650		4	\|a 570 \|7 (dpeaa)DE-84
700	1		\|a Mitsuhashi, Takumi \|e verfasserin \|4 aut
700	1		\|a Iimura, Yasushi \|e verfasserin \|4 aut
700	1		\|a Suzuki, Hiroharu \|e verfasserin \|4 aut
700	1		\|a Sugano, Hidenori \|e verfasserin \|4 aut
700	1		\|a Shinoda, Koichi \|e verfasserin \|4 aut
700	1		\|a Tanaka, Toshihisa \|e verfasserin \|4 aut
773	0	8	\|i Enthalten in \|t bioRxiv.org \|g (2024) vom: 16. Apr.
773	1	8	\|g year:2024 \|g day:16 \|g month:04
856	4	0	\|u http://dx.doi.org/10.1101/2024.02.05.578911 \|x 0 \|z kostenfrei \|3 Volltext
912			\|a GBV_XBI
951			\|a AR
952			\|j 2024 \|b 16 \|c 04

Feasibility of decoding covert speech in ECoG with a Transformer trained on overt speech

Zugang & Verfügbarkeit

Zugehörige Publikationen/Bände