EMALG: An Enhanced Mandarin Lombard Grid Corpus with Meaningful Sentences.
Baifeng LiQingmu LiuYuhong YangHongyang ChenWeiping TuSong LinPublished in: CoRR (2023)
Keyphrases
- sentence level
- lexical features
- linguistic features
- link grammar
- text corpus
- linguistic patterns
- syntactic features
- semantic roles
- multiword
- training corpus
- manually annotated
- natural language
- penn treebank
- keyphrases
- inter annotator agreement
- word sense
- multi document summarization
- tree bank
- probabilistic context free grammars
- noun phrases
- machine translation system
- sentiment classification
- grid computing
- grid points
- sentiment analysis
- information retrieval
- plain text
- grid environment
- word frequency
- word alignment
- speech recognition
- test set
- statistical machine translation
- information extraction
- document level
- text corpora