Langforia: Language Pipelines for Annotating Large Collections of Documents.
Marcus KlangPierre NuguesPublished in: COLING (Demos) (2016)
Keyphrases
- metadata
- document collections
- information retrieval
- data collections
- heterogeneous collections
- multilingual documents
- information retrieval systems
- text collections
- digital libraries
- natural language
- document archives
- distributed information retrieval
- text retrieval
- language learning
- document retrieval
- text documents
- parallel corpus
- linguistic analysis
- digital collections
- web documents
- digital objects
- document classification
- relevant documents
- vector space model
- keywords
- effective retrieval
- document clustering
- programming language
- extensible markup language
- indian languages
- database
- bibliographic databases
- text categorization
- logical structure
- test collection
- cultural heritage
- free text
- vector space