Language Variety Identification Using Distributed Representations of Words and Documents.
Marc Franco-SalvadorFrancisco M. Rangel PardoPaolo RossoMariona TauléM. Antònia MartíPublished in: CLEF (2015)
Keyphrases
- distributed representations
- text documents
- indian languages
- keywords
- semantic constraints
- real valued
- neural network
- information retrieval
- document collections
- web documents
- noun phrases
- information retrieval systems
- document retrieval
- document classification
- semantic representation
- n gram
- natural language
- metadata
- natural language text
- database
- text mining
- fuzzy membership functions
- word sense disambiguation
- query processing
- relevant documents
- wordnet
- text classification
- domain specific