Rule-Based HierarchicalRank: An Unsupervised Approach to Visible Tag Extraction from Semi-structured Chinese Text.
Jicheng LeiJiali YuChunhui HeChong ZhangBin GeYiping BaoPublished in: PRICAI (3) (2019)
Keyphrases
- semi structured
- chinese text
- data extraction
- information extraction
- structured data
- web data extraction
- web information extraction
- word segmentation
- data model
- web documents
- information integration
- semi structured data
- semi structured documents
- text mining
- wrapper generation
- web data
- wrapper induction
- free text
- structured knowledge
- expert systems
- knowledge rich
- web data sources
- data sets
- html documents
- automatic extraction
- natural language processing
- keywords
- web mining
- real world
- database