Exploratory Analysis of Word Use and Sentence Length in the Spoken Dutch Corpus.
Pascal WiggersLéon J. M. RothkrantzPublished in: TSD (2007)
Keyphrases
- exploratory analysis
- sentence level
- training corpus
- text corpus
- sentence pairs
- noun phrases
- parallel corpus
- ambiguous words
- recognizing textual entailment
- sentiment analysis
- probabilistic context free grammars
- word frequency
- document level
- information visualization
- word pairs
- data visualization
- text classification
- word level
- part of speech
- linguistic features
- statistical machine translation
- speech recognition
- word sense
- data mining
- data analysis
- spontaneous speech
- stop words
- machine translation system
- natural language
- word sense disambiguation
- machine translation
- natural language text
- semantic roles
- automatic speech recognition
- information retrieval
- co occurrence
- n gram
- association patterns
- spoken language