Future Speech Interfaces with Sensors and Machine Intelligence

Speech is the most spontaneous and natural means of communication, as well as the preferred modality for interacting with mobile or fixed electronic devices, but speech in-terfaces have drawbacks, such as a lack of user privacy; non-inclusivity for certain users; poor robustness in noisy conditions;...

Full description

Saved in:

Bibliographic Details
Other Authors:	Denby, Bruce (Editor), Gábor Csapó, Tamás (Editor), Wand, Michael (Editor)
Format:	Electronic Book Chapter
Language:	English
Published:	Basel MDPI - Multidisciplinary Digital Publishing Institute 2023
Subjects:	Technology: general issues History of engineering & technology neural machine translation (NMT) transformer Arabic dialects modern standard Arabic subword units multi-head attention shared vocabulary self-attention 3D densely connected CNN 3D multi-layer feature fusion CNN convolutional neural network deep learning lipreading speech recognition visual speech recognition silent speech continuous-wave radar European Portuguese machine learning multimodal speech lip reading ultrasound tongue imaging pose estimation speech kinematics keypoints landmarks audio-visual speech recognition lip-reading application programming interface multi-modal interaction deep neural networks multi-view VSR attention mechanism spatial attention module local self-attention connectionist temporal classification text-to-lip speech synthesis text-to-speech speech-to-lip zero-shot adaptation generative models artificial intelligence objective measures hybrid models end-to-end recognition reliability measures decision fusion net articulation-to-speech synthesis silent speech interface speaker adaption voice conversion audiovisual speech recognition multimodal interaction edutainment virtual aquarium speech processing ultrasound imaging silent speech interfaces speech sensors
Online Access:	DOAB: download the publication DOAB: description of the publication
Tags:	Add Tag No Tags, Be the first to tag this record!

MARC


LEADER	00000naaaa2200000uu 4500
001	doab_20_500_12854_98937
005	20230405
003	oapen
006	m o d
007	cr\|mn\|---annan
008	20230405s2023 xx \|\|\|\|\|o \|\|\| 0\|eng d
020			\|a books978-3-0365-6939-0
020			\|a 9783036569383
020			\|a 9783036569390
040			\|a oapen \|c oapen
024	7		\|a 10.3390/books978-3-0365-6939-0 \|c doi
041	0		\|a eng
042			\|a dc
072		7	\|a TB \|2 bicssc
072		7	\|a TBX \|2 bicssc
100	1		\|a Denby, Bruce \|4 edt
700	1		\|a Gábor Csapó, Tamás \|4 edt
700	1		\|a Wand, Michael \|4 edt
700	1		\|a Denby, Bruce \|4 oth
700	1		\|a Gábor Csapó, Tamás \|4 oth
700	1		\|a Wand, Michael \|4 oth
245	1	0	\|a Future Speech Interfaces with Sensors and Machine Intelligence
260			\|a Basel \|b MDPI - Multidisciplinary Digital Publishing Institute \|c 2023
300			\|a 1 electronic resource (252 p.)
336			\|a text \|b txt \|2 rdacontent
337			\|a computer \|b c \|2 rdamedia
338			\|a online resource \|b cr \|2 rdacarrier
506	0		\|a Open Access \|2 star \|f Unrestricted online access
520			\|a Speech is the most spontaneous and natural means of communication, as well as the preferred modality for interacting with mobile or fixed electronic devices, but speech in-terfaces have drawbacks, such as a lack of user privacy; non-inclusivity for certain users; poor robustness in noisy conditions; and the difficulty of creating complex man-machine interfaces. The Special Issue "Future Speech Interfaces with Sensors and Machine Intelligence" assembles eleven contributions that cover multimodal and silent speech interfaces; lip reading applications; novel sensors for speech interfaces; and enhanced speech inclusivity tools for future speech interfaces. The articles make important improvements beyond the state of the art, advancing the state of the art to new frontiers in some cases. Short summaries of all articles, grouped by topic, are presented, followed by a global commentary and evaluation.
540			\|a Creative Commons \|f https://creativecommons.org/licenses/by/4.0/ \|2 cc \|4 https://creativecommons.org/licenses/by/4.0/
546			\|a English
650		7	\|a Technology: general issues \|2 bicssc
650		7	\|a History of engineering & technology \|2 bicssc
653			\|a neural machine translation (NMT)
653			\|a transformer
653			\|a Arabic dialects
653			\|a modern standard Arabic
653			\|a subword units
653			\|a multi-head attention
653			\|a shared vocabulary
653			\|a self-attention
653			\|a 3D densely connected CNN
653			\|a 3D multi-layer feature fusion CNN
653			\|a convolutional neural network
653			\|a deep learning
653			\|a lipreading
653			\|a speech recognition
653			\|a visual speech recognition
653			\|a silent speech
653			\|a continuous-wave radar
653			\|a European Portuguese
653			\|a machine learning
653			\|a multimodal speech
653			\|a lip reading
653			\|a ultrasound tongue imaging
653			\|a pose estimation
653			\|a speech kinematics
653			\|a keypoints
653			\|a landmarks
653			\|a audio-visual speech recognition
653			\|a lip-reading
653			\|a application programming interface
653			\|a multi-modal interaction
653			\|a deep neural networks
653			\|a multi-view VSR
653			\|a attention mechanism
653			\|a spatial attention module
653			\|a local self-attention
653			\|a connectionist temporal classification
653			\|a text-to-lip
653			\|a speech synthesis
653			\|a text-to-speech
653			\|a speech-to-lip
653			\|a zero-shot adaptation
653			\|a generative models
653			\|a artificial intelligence
653			\|a objective measures
653			\|a hybrid models
653			\|a end-to-end recognition
653			\|a reliability measures
653			\|a decision fusion net
653			\|a articulation-to-speech synthesis
653			\|a silent speech interface
653			\|a speaker adaption
653			\|a voice conversion
653			\|a audiovisual speech recognition
653			\|a multimodal interaction
653			\|a edutainment
653			\|a virtual aquarium
653			\|a speech processing
653			\|a ultrasound imaging
653			\|a silent speech interfaces
653			\|a speech sensors
856	4	0	\|a www.oapen.org \|u https://mdpi.com/books/pdfview/book/6990 \|7 0 \|z DOAB: download the publication
856	4	0	\|a www.oapen.org \|u https://directory.doabooks.org/handle/20.500.12854/98937 \|7 0 \|z DOAB: description of the publication

Future Speech Interfaces with Sensors and Machine Intelligence

MARC

Similar Items