Towards High Speed Grammar Induction on Large Text Corpora.
Pieter W. AdriaansMarten TrautweinMarco VervoortPublished in: SOFSEM (2000)
Keyphrases
- text corpora
- grammar induction
- natural language processing
- language processing
- text mining
- computational linguistics
- text analysis
- context free grammars
- statistical machine translation
- text documents
- document collections
- text classification
- machine translation
- training corpus
- text collections
- topic models
- unsupervised methods
- concept hierarchy
- text classifiers
- text categorization
- machine learning
- knowledge representation
- information extraction
- information retrieval systems
- query processing
- natural language
- information retrieval
- text data
- data sets
- wordnet