Using neural-network based paragraph embeddings for the calculation of within and between document similarities.
Bart ThijsPublished in: Scientometrics (2020)
Keyphrases
- information retrieval systems
- document collections
- neural network
- document images
- information retrieval
- database
- document level
- document classification
- vector space
- web documents
- retrieval systems
- relevant documents
- data sets
- document representation
- document retrieval
- user queries
- dimensionality reduction
- similarity measure
- euclidean space
- keywords
- vector space model
- structured documents
- machine learning
- document content
- cf loadingtexthtml
- content similarity