Regular Expression Guided Entity Mention Mining from Noisy Web Data.
Shanshan ZhangLihong HeSlobodan VuceticEduard C. DragutPublished in: EMNLP (2018)
Keyphrases
- web data
- web mining
- regular expressions
- incremental mining
- semistructured data
- pattern matching
- web usage mining
- frequent sequences
- semi structured
- semi structured data
- knowledge discovery
- social network analysis
- page contents
- web content
- query language
- web logs
- data mining techniques
- data mining
- xml schema
- text mining
- social networks
- web pages
- web documents
- sequential pattern mining
- data sets
- user profiles
- machine learning
- knowledge base
- data sources
- query logs
- information extraction
- web search