A novel text mining approach for scholar information extraction from web content in Chinese.
Xia XieYu FuHai JinYaliang ZhaoWenzhi CaoPublished in: Future Gener. Comput. Syst. (2020)
Keyphrases
- web content
- text mining
- information extraction
- web documents
- text summarization
- website
- natural language processing
- traditional chinese medicine
- named entity recognition
- semi structured
- text classification
- text documents
- textual data
- named entities
- free text
- precision and recall
- user generated
- machine learning
- web pages
- unstructured text
- biomedical literature
- web mining
- semantic browsing
- web usage mining
- web users
- web data
- information retrieval
- relation extraction
- word segmentation
- structured data
- topic models
- knowledge discovery
- link analysis
- social media
- web browsing
- conditional random fields
- document clustering
- natural language text
- web information
- data analysis
- open domain
- data mining
- search engine
- digital libraries
- active learning
- web resources
- question answering
- databases