Lexical Categories for Improved Parsing of Web Data.
Lilja ØvrelidArne SkjærholtPublished in: COLING (Posters) (2012)
Keyphrases
- web data
- web mining
- natural language processing
- semi structured
- web usage mining
- web content
- web information
- web documents
- linguistic analysis
- web sources
- web pages
- incremental mining
- wordnet
- web information extraction
- website
- parse tree
- database
- syntactic categories
- machine learning
- real world
- page contents
- query logs
- web crawling
- information integration
- natural language
- databases