Effective Phoneme Decoding With Hyperbolic Neural Networks for High-Performance Speech BCIs

Objective: Speech brain-computer interfaces (speech BCIs), which convert brain signals into spoken words or sentences, have demonstrated great potential for high-performance BCI communication. Phonemes are the basic pronunciation units. For monosyllabic languages such as Chinese Mandarin, where a wo...

Full description

Saved in:

Bibliographic Details
Main Authors:	Xianhan Tan (Author), Qi Lian (Author), Junming Zhu (Author), Jianmin Zhang (Author), Yueming Wang (Author), Yu Qi (Author)
Format:	Book
Published:	IEEE, 2024-01-01T00:00:00Z.
Subjects:	article
Online Access:	Connect to this object online.
Tags:	Add Tag No Tags, Be the first to tag this record!

MARC


LEADER	00000 am a22000003u 4500
001	doaj_edc4bcb94e3a4f5daac6780b8950f7cc
042			\|a dc
100	1	0	\|a Xianhan Tan \|e author
700	1	0	\|a Qi Lian \|e author
700	1	0	\|a Junming Zhu \|e author
700	1	0	\|a Jianmin Zhang \|e author
700	1	0	\|a Yueming Wang \|e author
700	1	0	\|a Yu Qi \|e author
245	0	0	\|a Effective Phoneme Decoding With Hyperbolic Neural Networks for High-Performance Speech BCIs
260			\|b IEEE, \|c 2024-01-01T00:00:00Z.
500			\|a 1534-4320
500			\|a 1558-0210
500			\|a 10.1109/TNSRE.2024.3457313
520			\|a Objective: Speech brain-computer interfaces (speech BCIs), which convert brain signals into spoken words or sentences, have demonstrated great potential for high-performance BCI communication. Phonemes are the basic pronunciation units. For monosyllabic languages such as Chinese Mandarin, where a word usually contains less than three phonemes, accurate decoding of phonemes plays a vital role. We found that in the neural representation space, phonemes with similar pronunciations are often inseparable, leading to confusion in phoneme classification. Methods: We mapped the neural signals of phoneme pronunciation into a hyperbolic space for a more distinct phoneme representation. Critically, we proposed a hyperbolic hierarchical clustering approach to specifically learn a phoneme-level structure to guide the representation. Results: We found such representation facilitated greater distance between similar phonemes, effectively reducing confusion. In the phoneme decoding task, our approach demonstrated an average accuracy of 75.21% for 21 phonemes and outperformed existing methods across different experimental days. Conclusion: Our approach showed high accuracy in phoneme classification. By learning the phoneme-level neural structure, the representations of neural signals were more discriminative and interpretable. Significance: Our approach can potentially facilitate high-performance speech BCIs for Chinese and other monosyllabic languages.
546			\|a EN
690			\|a Brain-computer interface
690			\|a speech BCIs
690			\|a neural decoding
690			\|a hyperbolic network
690			\|a hyperbolic clustering
690			\|a Medical technology
690			\|a R855-855.5
690			\|a Therapeutics. Pharmacology
690			\|a RM1-950
655	7		\|a article \|2 local
786	0		\|n IEEE Transactions on Neural Systems and Rehabilitation Engineering, Vol 32, Pp 3432-3441 (2024)
787	0		\|n https://ieeexplore.ieee.org/document/10672534/
787	0		\|n https://doaj.org/toc/1534-4320
787	0		\|n https://doaj.org/toc/1558-0210
856	4	1	\|u https://doaj.org/article/edc4bcb94e3a4f5daac6780b8950f7cc \|z Connect to this object online.

Effective Phoneme Decoding With Hyperbolic Neural Networks for High-Performance Speech BCIs

MARC

Similar Items