Albanian Language Identification in Text Documents.
Klesti HoxhaArtur BaxhakuPublished in: CoRR (2019)
Keyphrases
- text documents
- language identification
- text mining
- text classification
- text categorization
- information extraction
- keywords
- document images
- topic models
- wordnet
- speaker identification
- news articles
- bag of words
- document clustering
- named entities
- text data
- data mining
- information retrieval
- natural language processing
- question answering
- pattern recognition
- search engine
- web search
- image representation
- knn
- multiscale
- knowledge base
- machine learning