Investigating the impact of preprocessing on document embedding: an empirical comparison.
Nourelhouda YahiHacene BelhadefMathieu RochePublished in: Int. J. Data Min. Model. Manag. (2021)
Keyphrases
- preprocessing
- post processing
- keywords
- web documents
- information retrieval
- retrieval systems
- document images
- document retrieval
- document collections
- information retrieval systems
- text documents
- vector space
- preprocessing step
- database
- document classification
- textual content
- information hiding
- document structure
- cf loadingtexthtml