Brain-to-text: Decoding spoken phrases from phone representations in the brain

Herff, C.; Heger, D.; Pesters, A. de; Telaar, D.; Brunner, P.; Schalk, G.; Schultz, T.

doi:10.3389/fnins.2015.00217

Brain-to-text: Decoding spoken phrases from phone representations in the brain

Herff, C. ¹; Heger, D. ¹; Pesters, A. de; Telaar, D. ¹; Brunner, P.; Schalk, G.; Schultz, T. ¹
¹ Institut für Anthropomatik und Robotik (IAR), Karlsruher Institut für Technologie (KIT)

Abstract:

It has long been speculated whether communication between humans and machines based on natural speech related cortical activity is possible. Over the past decade, studies have suggested that it is feasible to recognize isolated aspects of speech from neural signals, such as auditory features, phones or one of a few isolated words. However, until now it remained an unsolved challenge to decode continuously spoken speech from the neural substrate associated with speech and language processing. Here, we show for the first time that continuously spoken speech can be decoded into the expressed words from intracranial electrocorticographic (ECoG) recordings. Specifically, we implemented a system, which we call Brain-To-Text that models single phones, employs techniques from automatic speech recognition (ASR), and thereby transforms brain activity while speaking into the corresponding textual representation. Our results demonstrate that our system can achieve word error rates as low as 25% and phone error rates below 50%. Additionally, our approach contributes to the current understanding of the neural basis of continuous speech production by identifying those cortical regions that hold substantial information about individual phones. ... mehr

KITopen-Download

Verlagsausgabe

DOI: 10.5445/IR/1000049750

Externe Links

Originalveröffentlichung
DOI: 10.3389/fnins.2015.00217

Scopus

Web of Science
Zitationen: 159

Dimensions
Zitationen: 247

Export

Statistiken

Seitenaufrufe: 202
seit 18.05.2018

Downloads: 214
seit 13.10.2015

Zugehörige Institution(en) am KIT	Deutsch-Französisches Institut für Automation und Robotik (Dt..-Fr. IAR) Universität Karlsruhe (TH) – Interfakultative Einrichtungen (Interfakultative Einrichtungen)
Publikationstyp	Zeitschriftenaufsatz
Publikationsjahr	2015
Sprache	Englisch
Identifikator	ISSN: 1662-453X, 1662-4548 urn:nbn:de:swb:90-497506 KITopen-ID: 1000049750
Erschienen in	Frontiers in neuroscience
Verlag	Frontiers Media SA
Band	9
Heft	JUN
Seiten	217
Bemerkung zur Veröffentlichung	Gefördert durch den KIT-Publikationsfonds
Nachgewiesen in	Web of Science Scopus Dimensions

Repository KITopen

Brain-to-text: Decoding spoken phrases from phone representations in the brain

Abstract: