A Multilingual Information Extraction Pipeline for Investigative Journalism.
Gregor WiedemannSeid Muhie YimamChris BiemannPublished in: CoRR (2018)
Keyphrases
- information extraction
- natural language processing
- precision and recall
- text mining
- named entity recognition
- free text
- machine learning
- ontology based information extraction
- language resources
- information retrieval
- relational learning
- machine translation
- question answering
- conditional random fields
- named entities
- cross language
- semi structured
- web documents
- processing pipeline
- structured data
- open domain
- relation extraction
- cross language information retrieval
- semantic tagging
- digital libraries
- text processing
- pipeline architecture
- language independent
- cross lingual
- web mining
- text documents
- knowledge base
- hidden markov models
- text summarization
- natural language
- document retrieval
- unstructured text
- multi lingual
- information retrieval systems
- neural network
- multilingual documents
- information extraction systems
- co occurrence
- case study
- textual data
- artificial intelligence