Generating an Overview Report over Many Documents.
Jingwen WangHao ZhangCheng ZhangWenjing YangLiqun ShaoJie WangPublished in: CoRR (2019)
Keyphrases
- document collections
- information retrieval
- xml documents
- information retrieval systems
- text documents
- metadata
- multimedia documents
- keywords
- document retrieval
- text retrieval
- legal documents
- plagiarism detection
- document classification
- relevant documents
- retrieval systems
- web documents
- data sets
- free text
- text analysis
- structured documents
- database
- language model
- digital libraries
- vector space model
- latent semantic analysis
- similarity measure
- document analysis
- textual content