Automated labeling of PDF mathematical exercises with word N-grams VSM classification.
Taisei YamauchiBrendan FlanaganRyosuke NakamotoYiling DaiKyosuke TakamiHiroaki OgataPublished in: Smart Learn. Environ. (2023)
Keyphrases
- n gram
- text classification
- language model
- language independent
- bag of words
- word segmentation
- machine learning
- word level
- language modeling
- character n grams
- feature extraction
- vector space model
- part of speech
- variable length
- decision trees
- term frequency
- co occurrence
- labeled data
- language modelling
- inside outside algorithm
- document representation
- probability density function
- text retrieval
- semi supervised learning
- image classification
- natural language processing