Postal Address Detection from Web Documents.
Can LinQian ZhangXiaofeng MengWenyin LinPublished in: WIRI (2005)
Keyphrases
- web documents
- information extraction
- semi structured
- web pages
- web search engines
- document classification
- web content
- postal address
- html documents
- web data
- document representation
- vector space model
- keywords
- textual information
- topic specific
- machine learning
- natural language processing
- structured documents
- focused crawling
- search engine