KIT | KIT-Bibliothek | Impressum | Datenschutz

Estimating speaker direction on a humanoid robot with binaural acoustic signals

Barot, Pranav; Mombaur, Katja ORCID iD icon 1; MacDonald, Ewen N.
1 Institut für Anthropomatik und Robotik (IAR), Karlsruher Institut für Technologie (KIT)

Abstract:

To achieve human-like behaviour during speech interactions, it is necessary for a humanoid robot to estimate the location of a human talker. Here, we present a method to optimize the parameters used for the direction of arrival (DOA) estimation, while also considering real-time applications for human-robot interaction scenarios. This method is applied to binaural sound source localization framework on a humanoid robotic head. Real data is collected and annotated for this work. Optimizations are performed via a brute force method and a Bayesian model based method, results are validated and discussed, and effects on latency for real-time use are also explored.


Verlagsausgabe §
DOI: 10.5445/IR/1000167568
Veröffentlicht am 24.01.2024
Cover der Publikation
Zugehörige Institution(en) am KIT Institut für Anthropomatik und Robotik (IAR)
Publikationstyp Zeitschriftenaufsatz
Publikationsjahr 2024
Sprache Englisch
Identifikator ISSN: 1932-6203
KITopen-ID: 1000167568
Erschienen in PLOS ONE
Verlag Public Library of Science (PLoS)
Band 19
Heft 1
Seiten Art.-Nr.: e0296452
Vorab online veröffentlicht am 02.01.2024
Nachgewiesen in Dimensions
Web of Science
Scopus
Relationen in KITopen
KIT – Die Forschungsuniversität in der Helmholtz-Gemeinschaft
KITopen Landing Page