COMBINA-PT: A Large Corpus-extracted and Hand-checked Lexical Database of Portuguese Multiword Expressions.
Amália MendesSandra AntunesMaria Fernanda Bacelar do NascimentoJoão Miguel CasteleiroLuísa PereiraTiago SáPublished in: LREC (2006)
Keyphrases
- multiword
- lexical units
- wordnet
- lexical database
- context sensitive
- text clustering
- word sense disambiguation
- language model
- natural language
- part of speech
- natural language processing
- semantic relations
- co occurrence
- word sense
- semantic similarity
- semantic knowledge
- unknown words
- semantic information
- document representation
- machine learning
- semantic content
- text documents
- knowledge base