Automating XML markup of text documents.
Shazia AkhtarRonan G. ReillyJohn DunnionPublished in: HLT-NAACL (2003)
Keyphrases
- text documents
- markup language
- document structure
- document representation
- text mining
- text categorization
- xml schema
- information extraction
- xml documents
- text classification
- keywords
- topic models
- document clustering
- document classification
- wordnet
- named entities
- bag of words
- text data
- relational databases
- automatic text categorization
- xml data
- text collections
- databases
- structured data
- knowledge discovery
- search engine
- machine learning
- semi structured
- information retrieval
- data mining