AusKidTalk : an auditory-visual corpus of 3- to 12-year-old Australian children's speech

Beena Ahmed, Kirrie J. Ballard, Denis Burnham, Tharmakulasingam Sirojan, Hadi Mehmood, Dominique Estival, Elise Baker, Felicity Cox, Joanne Arciuli, Titia Benders, Katherine Demuth, Barbara Kelly, Chloé Diskin-Holdaway, Mostafa Shahin, Vidhyasaharan Sethu, Julien Epps, Chwee Beng Lee, Eliathamby Ambikairajah

Research output: Chapter in Book / Conference PaperConference Paperpeer-review

8 Citations (Scopus)

Abstract

Here we present AusKidTalk [1], an audio-visual (AV) corpus of Australian children's speech collected to facilitate the development of speech based technological solutions for children. It builds upon the technology and expertise developed through the collection of an earlier corpus of Australian adult speech, AusTalk [2,3]. This multi-site initiative was established to remedy the dire shortage of children's speech corpora in Australia and around the world that are sufficiently sized to train accurate automated speech processing tools for children. We are collecting ∼600 hours of speech from children aged 3- 12 years that includes single word and sentence productions as well as narrative and emotional speech. In this paper, we discuss the key requirements for AusKidTalk and how we designed the recording setup and protocol to meet them. We also discuss key findings from our feasibility study of the recording protocol, recording tools, and user interface.

Original languageEnglish
Title of host publicationProceedings of Interspeech 2021: 30 August - 3 September 2021, Brno, Czechia
PublisherInternational Speech and Communication Association
Pages4351-4355
Number of pages5
DOIs
Publication statusPublished - 2021
EventINTERSPEECH (Conference) -
Duration: 30 Aug 2021 → …

Conference

ConferenceINTERSPEECH (Conference)
Period30/08/21 → …

Bibliographical note

Publisher Copyright:
Copyright © 2021 ISCA.

Fingerprint

Dive into the research topics of 'AusKidTalk : an auditory-visual corpus of 3- to 12-year-old Australian children's speech'. Together they form a unique fingerprint.

Cite this