Searching for the X-Factor: Exploring Corpus Subjectivity for Word Embeddings.
Maksim TkachenkoChong Cher ChiaHady W. LauwPublished in: ACL (1) (2018)
Keyphrases
- word frequencies
- english words
- training corpus
- sentence level
- noun phrases
- text corpus
- linguistic information
- word pairs
- word sense
- multiword
- natural language text
- unknown words
- lexical features
- co occurrence
- n gram
- parallel corpus
- string matching
- statistical machine translation
- part of speech
- text classification
- sentiment classification
- word recognition
- conversational speech
- related words
- low dimensional
- spontaneous speech
- spoken document retrieval
- world knowledge
- wordnet
- vector space
- sentiment analysis
- manifold learning