Design and implementation of the Sweble Wikitext parser: unlocking the structured data of Wikipedia.
Hannes DohrnDirk RiehlePublished in: Int. Sym. Wikis (2011)
Keyphrases
- structured data
- semi structured
- structured information
- xml documents
- unstructured text
- information extraction
- relational data
- keyword search
- textual data
- linked data
- structured databases
- unstructured data
- parse tree
- dependency parsing
- keyword queries
- text data
- databases
- data sources
- relational databases
- natural language
- machine learning