Measuring similarity between Karel programs using character and word n-grams.
Grigori SidorovMartín Ibarra RomeroIlia MarkovRafael Guzmán-CabreraLiliana Chanona-HernándezFrancisco VelasquezPublished in: Program. Comput. Softw. (2017)
Keyphrases
- n gram
- measuring similarity
- similarity measure
- language model
- language independent
- text classification
- bag of words
- language modeling
- word segmentation
- web documents
- word level
- inside outside algorithm
- part of speech
- probabilistic model
- character n grams
- nearest neighbor
- pairwise
- information retrieval
- statistical language modeling
- machine learning