Grouping Web Pages about Persons and Organizations for Information Extraction.
Shiren YeTat-Seng ChuaJimin LiuJeremy R. KeiPublished in: ICADL (2002)
Keyphrases
- information extraction
- web pages
- web documents
- web information extraction
- unstructured information
- data extraction
- text mining
- website
- precision and recall
- search engine
- natural language processing
- web mining
- semi structured
- information technology
- named entity recognition
- web page classification
- structured data
- machine learning
- web content mining
- information retrieval
- information systems
- question answering
- free text
- geographical locations
- web search engines
- web search
- perceptual grouping
- google search engine
- web server
- web users
- textual data
- deep web
- extraction rules
- decision making
- end user computing
- web content
- hierarchical structure
- organizational learning
- text documents
- anchor text
- structured information
- natural language
- web logs
- knowledge management
- web data
- named entities
- machine translation