Corpus Conversion Service: A machine learning platform to ingest documents at scale [Poster abstract].
Peter W. J. StaarMichele DolfiChristoph AuerCostas BekasPublished in: CoRR (2018)
Keyphrases
- learning platform
- newspaper articles
- learning activities
- word frequencies
- learning environment
- learning materials
- person names
- similar documents
- text collections
- document collections
- training corpus
- document clustering
- information retrieval
- text corpus
- xml documents
- document retrieval
- document level
- students learning
- text documents
- text data
- multiword
- parallel corpus
- learning games
- linguistic information
- wikipedia articles
- web documents
- information retrieval systems
- document representation
- topic segmentation
- relevant documents
- database
- web services
- text corpora
- word frequency
- keywords
- service providers
- natural language text
- multi document summarization
- metadata
- document corpus
- java programming
- training documents
- user queries
- text classification
- learning objects
- parallel corpora
- text mining
- query terms
- vector space model
- word pairs