Corpus Conversion Service: A Machine Learning Platform to Ingest Documents at Scale.
Michele DolfiChristoph AuerPeter W. J. StaarCostas BekasPublished in: ERCIM News (2018)
Keyphrases
- learning platform
- learning activities
- word frequencies
- newspaper articles
- learning environment
- learning materials
- person names
- text corpus
- students learning
- information retrieval systems
- information retrieval
- training corpus
- multiword
- document collections
- document level
- text corpora
- text data
- learning games
- web services
- document retrieval
- text documents
- web documents
- similar documents
- parallel corpora
- natural language text
- metadata
- relevant documents
- topic segmentation
- document corpus
- xml documents
- text collections
- parallel corpus
- document representation
- database
- document clustering
- service providers
- keywords
- sentence level
- linguistic information
- wikipedia articles
- vector space model
- text categorization
- training documents
- writing style
- mobile devices
- learning algorithm
- machine learning