Mining for Domain-specific Parallel Text from Wikipedia.
Magdalena PlamadaMartin VolkPublished in: BUCC@ACL (2013)
Keyphrases
- domain specific
- text mining
- named entity disambiguation
- world knowledge
- domain independent
- knowledge sources
- natural language text
- short texts
- text retrieval
- semantic information
- knowledge base
- wikipedia pages
- general purpose
- relation extraction
- semi automatically
- knowledge discovery
- text data
- web mining
- parallel processing
- pattern mining
- data mining
- information retrieval
- keywords
- named entities
- document collections
- wikipedia articles
- text classification
- wordnet
- database
- mining algorithm
- web documents
- short text
- semantic relations
- text documents
- sequential patterns
- parallel implementation
- probabilistic topic models
- document corpus
- named entity recognizer