Corpus Conversion Service: A Machine Learning Platform to Ingest Documents at Scale.
Peter W. J. StaarMichele DolfiChristoph AuerCostas BekasPublished in: CoRR (2018)
Keyphrases
- learning platform
- word frequencies
- newspaper articles
- document collections
- learning materials
- person names
- text corpus
- learning activities
- information retrieval
- text corpora
- text data
- web documents
- document level
- natural language text
- learning environment
- students learning
- web services
- text collections
- learning games
- training corpus
- similar documents
- service providers
- document corpus
- information retrieval systems
- xml documents
- document clustering
- document retrieval
- training documents
- metadata
- document representation
- multiword
- information extraction
- linguistic information
- free text
- word frequency
- sentence level
- text documents
- retrieval systems
- database
- relevant documents
- learning systems
- java programming
- text classification
- learning objects
- data mining