Is Spoken Hungarian Low-resource?: A Quantitative Survey of Hungarian Speech Data Sets.
Péter MihajlikKatalin MádyAnna KoháriFruzsina Sára FruzsinaGábor KissTekla Etelka GrácziA. Seza DogruözPublished in: LREC/COLING (2024)
Keyphrases
- text to speech
- data sets
- speech recognition
- spoken language
- automatic speech recognition
- language independent
- speech synthesis
- broadcast news
- dynamic time warping
- qualitative and quantitative
- speech signal
- benchmark data sets
- database
- resource allocation
- resource constraints
- cross language
- facial expressions
- real world data sets
- dialogue system
- training set
- data streams
- word processing
- training data
- learning algorithm
- spoken documents
- spoken words