Improving Native Language Identification with TF-IDF Weighting.
Binyam Gebrekidan GebreMarcos ZampieriPeter WittenburgTom HeskesPublished in: BEA@NAACL-HLT (2013)
Keyphrases
- tf idf
- language identification
- weighting scheme
- term weighting
- vector space model
- information retrieval
- text documents
- term frequency
- document clustering
- text categorization
- retrieval model
- inverse document frequency
- speaker identification
- ranking algorithm
- weighting schemes
- document images
- machine learning
- data mining
- document retrieval
- image classification
- semi supervised
- feature selection