Monotone Increasing Binary Similarity and Its Application to Automatic Document-Acquisition of a Category.
Izumi SuzukiYoshiki MikamiArio OhsatoPublished in: IEICE Trans. Inf. Syst. (2008)
Keyphrases
- document similarity
- similarity measure
- hamming distance
- semantic similarity
- information retrieval
- document retrieval
- text documents
- document images
- semi automatic
- content similarity
- web documents
- document collections
- cosine similarity
- information retrieval systems
- euclidean distance
- training documents
- database
- similarity measurement
- tf idf
- document classification
- document representation
- upper bound
- document clustering
- structural similarity
- binary codes
- retrieval strategies
- text categorization
- relevant documents
- distance function
- cf loadingtexthtml
- test collection