Enhancing Cross Document Coreference of Web Documents with Context Similarity and Very Large Scale Text Categorization.
Jian HuangPucktada TreeratpitukSarah M. TaylorC. Lee GilesPublished in: COLING (2010)
Keyphrases
- text categorization
- web documents
- cross document
- information extraction
- coreference resolution
- knn
- text classification
- feature selection
- text documents
- semi structured
- keywords
- web search engines
- contextual information
- k nearest neighbor
- semi supervised learning
- multi document summarization
- relation extraction
- similarity measure
- structured data
- information retrieval
- text mining
- natural language processing
- document representation
- named entities
- term frequency
- data mining
- tf idf
- named entity recognition
- semi supervised
- web content
- n gram
- question answering
- machine learning
- unlabeled data
- distance measure
- related documents