Expanding a multilingual media monitoring and information extraction tool to a new language: Swahili.
Ralf SteinbergerSylvia OmbuyaMijail A. KabadjovBruno PouliquenLeonida Della RoccaJenya BelyaevaMonica de PaolaCamelia IgnatErik Van der GootPublished in: Lang. Resour. Evaluation (2011)
Keyphrases
- information extraction
- natural language
- language specific
- text mining
- real time
- language resources
- question answering
- machine translation
- language independent
- multimedia
- natural language processing
- machine learning
- monitoring system
- digital libraries
- cross language
- parallel corpus
- multimedia content
- precision and recall
- conceptual model
- semi structured
- tool wear
- text documents
- scripting language
- programming language
- information retrieval
- script language
- multi lingual
- textual data
- named entity recognition
- free text
- web mining
- named entities
- co occurrence