GraWiTas: a Grammar-based Wikipedia Talk Page Parser.
Benjamin CabreraLaura SteinertBjörn RossPublished in: EACL (Software Demonstrations) (2017)
Keyphrases
- link structure
- wikipedia pages
- website
- natural language
- anchor text
- hyperlink structure
- web pages
- natural language processing
- wikipedia articles
- dependency parsing
- wordnet
- knowledge base
- speech understanding
- page content
- semantic relations
- pagerank algorithm
- document collections
- topic distillation
- stochastic context free grammars
- dependency structure
- named entities
- semantic information
- web documents
- wide coverage
- information retrieval
- named entity recognizer
- entity ranking
- context free grammars
- document representation
- link analysis