基于词频统计规律的文本数据预处理方法 (Text Data Preprocessing Based on Term Frequency Statistics Rules).
Yunxian ChiShuliang ZhaoYan LuoLin GaoJunpeng ZhaoChao LiPublished in: 计算机科学 (2017)
Keyphrases
- co occurrence
- data preprocessing
- term frequency
- text documents
- wordnet
- topic models
- data mining
- tf idf
- preprocessing
- text categorization
- text classification
- text mining
- preprocessing step
- feature selection
- average precision
- bag of words
- retrieval model
- keywords
- association rules
- information extraction
- knn
- web documents
- information retrieval
- document representation
- database
- neural network
- knowledge discovery
- probabilistic model
- search engine
- machine learning