Effective Phoneme Decoding With Hyperbolic Neural Networks for High-Performance Speech BCIs
Objective: Speech brain-computer interfaces (speech BCIs), which convert brain signals into spoken words or sentences, have demonstrated great potential for high-performance BCI communication. Phonemes are the basic pronunciation units. For monosyllabic languages such as Chinese Mandarin, where a wo...
Saved in:
| Main Authors: | , , , , , |
|---|---|
| Format: | Book |
| Published: |
IEEE,
2024-01-01T00:00:00Z.
|
| Subjects: | |
| Online Access: | Connect to this object online. |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
MARC
| LEADER | 00000 am a22000003u 4500 | ||
|---|---|---|---|
| 001 | doaj_edc4bcb94e3a4f5daac6780b8950f7cc | ||
| 042 | |a dc | ||
| 100 | 1 | 0 | |a Xianhan Tan |e author |
| 700 | 1 | 0 | |a Qi Lian |e author |
| 700 | 1 | 0 | |a Junming Zhu |e author |
| 700 | 1 | 0 | |a Jianmin Zhang |e author |
| 700 | 1 | 0 | |a Yueming Wang |e author |
| 700 | 1 | 0 | |a Yu Qi |e author |
| 245 | 0 | 0 | |a Effective Phoneme Decoding With Hyperbolic Neural Networks for High-Performance Speech BCIs |
| 260 | |b IEEE, |c 2024-01-01T00:00:00Z. | ||
| 500 | |a 1534-4320 | ||
| 500 | |a 1558-0210 | ||
| 500 | |a 10.1109/TNSRE.2024.3457313 | ||
| 520 | |a Objective: Speech brain-computer interfaces (speech BCIs), which convert brain signals into spoken words or sentences, have demonstrated great potential for high-performance BCI communication. Phonemes are the basic pronunciation units. For monosyllabic languages such as Chinese Mandarin, where a word usually contains less than three phonemes, accurate decoding of phonemes plays a vital role. We found that in the neural representation space, phonemes with similar pronunciations are often inseparable, leading to confusion in phoneme classification. Methods: We mapped the neural signals of phoneme pronunciation into a hyperbolic space for a more distinct phoneme representation. Critically, we proposed a hyperbolic hierarchical clustering approach to specifically learn a phoneme-level structure to guide the representation. Results: We found such representation facilitated greater distance between similar phonemes, effectively reducing confusion. In the phoneme decoding task, our approach demonstrated an average accuracy of 75.21% for 21 phonemes and outperformed existing methods across different experimental days. Conclusion: Our approach showed high accuracy in phoneme classification. By learning the phoneme-level neural structure, the representations of neural signals were more discriminative and interpretable. Significance: Our approach can potentially facilitate high-performance speech BCIs for Chinese and other monosyllabic languages. | ||
| 546 | |a EN | ||
| 690 | |a Brain-computer interface | ||
| 690 | |a speech BCIs | ||
| 690 | |a neural decoding | ||
| 690 | |a hyperbolic network | ||
| 690 | |a hyperbolic clustering | ||
| 690 | |a Medical technology | ||
| 690 | |a R855-855.5 | ||
| 690 | |a Therapeutics. Pharmacology | ||
| 690 | |a RM1-950 | ||
| 655 | 7 | |a article |2 local | |
| 786 | 0 | |n IEEE Transactions on Neural Systems and Rehabilitation Engineering, Vol 32, Pp 3432-3441 (2024) | |
| 787 | 0 | |n https://ieeexplore.ieee.org/document/10672534/ | |
| 787 | 0 | |n https://doaj.org/toc/1534-4320 | |
| 787 | 0 | |n https://doaj.org/toc/1558-0210 | |
| 856 | 4 | 1 | |u https://doaj.org/article/edc4bcb94e3a4f5daac6780b8950f7cc |z Connect to this object online. |