On the Programmatic Generation of Reproducible Documents.
Michael J. KaneXun JiangSimon UrbanekPublished in: J. Stat. Softw. (2022)
Keyphrases
- document collections
- metadata
- web documents
- xml documents
- information retrieval systems
- information retrieval
- legal documents
- text documents
- relevant documents
- document retrieval
- multi document summarization
- document clustering
- web data
- time stamped
- document analysis
- plagiarism detection
- text analysis
- document representation
- vector space model
- language model
- free text
- vector space
- keywords
- semantic information
- document classification
- xml format
- neural network
- generation process
- digital libraries
- text mining