Extracting _Carbon Copy_ Names and Organizations from a Heterogeneous Document Collection.
Kazem TaghvaRussell BeckleyJeffrey S. CoombsPublished in: ICDAR (2007)
Keyphrases
- document collections
- information retrieval systems
- information retrieval
- document retrieval
- test collection
- text retrieval
- document clustering
- digital libraries
- relevant documents
- named entities
- index terms
- ad hoc retrieval
- document representation
- cross language
- document archives
- result lists
- geographic information retrieval
- xml retrieval
- document set
- similar documents
- information access
- retrieval systems
- keywords