CORAA ASR: a large corpus of spontaneous and prepared speech manually validated for speech recognition in Brazilian Portuguese.
Arnaldo Cândido JúniorEdresson CasanovaAnderson da Silva SoaresFrederico Santos de OliveiraLucas OliveiraRicardo Corso Fernandes JuniorDaniel Peixoto Pinto da SilvaFernando Gorgulho FayetBruno Baldissera CarlottoLucas Rafael Stefanel GrisSandra Maria AluísioPublished in: Lang. Resour. Evaluation (2023)
Keyphrases
- n gram
- speech recognition
- automatic speech recognition
- conversational speech
- brazilian portuguese
- language model
- speech signal
- word error rate
- spontaneous speech
- broadcast news
- speech retrieval
- machine translation
- speech recognizer
- speech synthesis
- hidden markov models
- speech processing
- noisy environments
- keyword spotting
- speech recognizers
- speaker identification
- word recognition
- recognition engine
- speech recognition systems
- isolated word
- handwriting recognition
- speaker independent
- speaker dependent
- speech recognition technology
- natural language
- speech recognition errors
- pattern recognition