Development of a Database and Models for Children’s Speech in the Slovak Language for Speech-oriented Applications

Autori

Ján Staš
Stanislav Ondáš
Matúš Pleva
Matej Horváth
Richard Ševc
Patrik Michalanský

DOI:

https://doi.org/10.2478/jazcas-2025-0020

Kľúčové slová:

acoustic model, automatic speech recognition, data augmentation, children’s speech, speech database

Abstrakt

Children’s speech differs significantly from adult speech due to physiological and cognitive developmental factors. Key differences include higher pitch, a shorter vocal tract, greater formant frequencies, slower speaking rates, and greater variability in pronunciation and articulation. These differences result in acoustic mismatches between children’s and adult speech, making traditional automatic speech recognition models trained on adult speech less effective for children. Additionally, linguistic differences, such as limited vocabulary and evolving grammar, further contribute to this challenge. This paper focuses on the creation of a children’s speech database for the low-resource Slovak language. This database has been used to train acoustic models for the automatic recognition of spontaneous children’s speech in Slovak. In this research, we compared three different approaches to speech recognition, with self-supervised learning achieving results comparable to similar studies in this area, despite using relatively small amounts of training data.

Sťahovanie

PDF (English)

Publikované

31-03-2025

Číslo

Ročník 76 Číslo 1 (2025): Jazykovedný časopis

Rubrika

Štúdie

Licencia

Táto práca je licencovaná pod Medzinárodnou licenciou Creative Commons Attribution-NonCommercial-NoDerivatives 4.0.

Ako citovať

Development of a Database and Models for Children’s Speech in the Slovak Language for Speech-oriented Applications. (2025). Jazykovedný časopis, 76(1), 223-233. https://doi.org/10.2478/jazcas-2025-0020

Stiahnuť citáciu