CORAA NURC-SP Minimal Corpus: a manually annotated corpus of Brazilian Portuguese spontaneous speech.
Vinícius G. dos SantosCaroline AlvesBruno Baldissera CarlottoBruno A. Papa DiasLucas Rafael Stefanel GrisRenan de Lima IzaiasMaria Luiza Azevedo de MoraisPaula Marin de OliveiraRafael SicoliFlaviane Romani Fernandes SvartmanMarli Quadros LeiteSandra Maria AluísioPublished in: IberSPEECH (2022)
Keyphrases
- annotated corpus
- spontaneous speech
- brazilian portuguese
- named entities
- named entity recognition
- linguistic features
- relation extraction
- human machine interaction
- machine translation
- automatic annotation
- spoken language
- information extraction
- spoken document retrieval
- automatic speech recognition
- hand crafted
- natural language processing
- automatic extraction
- semantic relations
- multi modal
- high level
- bayesian networks
- text mining
- information retrieval
- conditional random fields
- co occurrence