TICCLops: Text-Induced Corpus Clean-up as online processing system.
Martin ReynaertPublished in: COLING (Demos) (2014)
Keyphrases
- real time
- supervised machine learning
- open domain
- text data
- broad coverage
- text processing
- online learning
- text corpora
- information retrieval
- data processing
- natural language text
- english words
- text corpus
- newspaper articles
- text collections
- plain text
- text mining
- information processing
- text retrieval
- manually annotated
- world knowledge
- lexical features
- noun phrases
- text classification
- recognizing textual entailment
- temporal expressions
- text indexing
- named entity disambiguation
- linguistic information
- document level
- keywords
- textual features
- training corpus
- scientific papers
- spontaneous speech
- wikipedia articles
- free text
- search engine