Testing Structural Properties in Textual Data: Beyond Document Grammars.
Felix SasakiJens PönninghausPublished in: Lit. Linguistic Comput. (2003)
Keyphrases
- structural properties
- textual data
- text documents
- natural language processing
- information extraction
- text mining
- structured data
- textual information
- web documents
- text categorization
- information retrieval
- raw data
- document images
- document collections
- text data
- text classification
- information retrieval systems
- wordnet
- keywords
- bag of words
- machine learning
- natural language
- news articles
- topic models
- co occurrence
- databases
- semantic information
- retrieval systems
- named entities
- xml documents
- high dimensional
- question answering
- metadata
- k nearest neighbor