Lahjoita puhetta: a large-scale corpus of spoken Finnish with some benchmarks.
Anssi MoisioDejan PorjazovskiAku RouheYaroslav GetmanAnja VirkkunenRagheb Al-GheziMietta LennesTamás GrószKrister LindénMikko KurimoPublished in: Lang. Resour. Evaluation (2023)
Keyphrases
- small scale
- conversational speech
- manually annotated
- case study
- real life
- data sets
- language understanding
- web scale
- neural network
- real time
- text classification
- social networks
- automatic speech recognition
- learning algorithm
- machine learning
- training corpus
- supervised machine learning
- million images
- spontaneous speech
- document corpus