LIDE: Language Identification from Text Documents.
Priyank MathurArkajyoti MisraEmrah BudurPublished in: CoRR (2017)
Keyphrases
- text documents
- language identification
- text mining
- keywords
- text classification
- information extraction
- text categorization
- news articles
- wordnet
- topic models
- document images
- speaker identification
- text data
- document clustering
- bag of words
- named entities
- image representation
- computer vision
- natural language processing
- web information retrieval
- pattern recognition
- machine learning
- data analysis
- image classification