Web Entity Detection for Semi-structured Text Data Records with Unlabeled Data.
Chunliang LuLidong BingWai LamKi ChanYuan GuPublished in: Int. J. Comput. Linguistics Appl. (2013)
Keyphrases
- semi structured
- data extraction
- unlabeled data
- data records
- web documents
- labeled data
- semi supervised learning
- semi supervised
- structured data
- text mining
- web data
- web pages
- textual data
- text classification
- active learning
- information extraction
- supervised learning
- web data extraction
- training data
- training set
- information integration
- data points
- query result
- learning algorithm
- machine learning
- data model
- named entities
- information retrieval
- web databases
- xml databases
- website
- database
- keyword queries
- multi dimensional
- knowledge base
- web mining
- data distribution
- web search engines
- pairwise
- multi view
- knowledge discovery
- natural language processing
- data mining
- nearest neighbor