TwitIE: An Open-Source Information Extraction Pipeline for Microblog Text.
Kalina BontchevaLeon DerczynskiAdam FunkMark A. GreenwoodDiana MaynardNiraj AswaniPublished in: RANLP (2013)
Keyphrases
- information extraction
- open source
- free text
- text mining
- textual data
- text documents
- text processing
- ontology based information extraction
- information retrieval
- natural language text
- information extraction systems
- web documents
- unstructured text
- natural language processing
- text analysis
- text summarization
- machine learning
- open domain
- named entity recognition
- open source software
- precision and recall
- named entities
- text retrieval
- text data
- source code
- semi structured
- structured data
- text corpora
- social media
- topic detection
- linguistic patterns
- web mining
- keywords
- real world
- text classification
- sentence level
- information retrieval systems
- relation extraction
- question answering
- social networks
- textual information
- conditional random fields