The BDCamões Collection of Portuguese Literary Documents: a Research Resource for Digital Humanities and Language Technology.
Sara GriloMárcia BolrinhaJoão SilvaRui VazAntónio BrancoPublished in: LREC (2020)
Keyphrases
- language technology
- document collections
- digital collections
- digital libraries
- text analysis
- distributed information retrieval
- software internationalisation
- text collections
- resource selection
- natural language processing
- information retrieval systems
- semantic knowledge
- information society
- information retrieval
- computational linguistics
- metadata
- document classification
- web documents
- software engineering
- document clustering
- digital resources
- text documents
- xml documents
- database
- natural language text
- keywords
- human language
- query based sampling
- text corpora