Wikicorpus: A Word-Sense Disambiguated Multilingual Wikipedia Corpus.
Samuel ReeseGemma BoledaMontse CuadrosLluís PadróGerman RigauPublished in: LREC (2010)
Keyphrases
- word sense
- cross lingual
- word sense disambiguation
- co occurrence
- cross language
- wordnet
- machine translation
- language independent
- language modeling
- text processing
- text classification
- semantic relations
- word meaning
- unknown words
- language specific
- digital libraries
- tf idf
- language model
- news articles
- transfer learning
- cross language information retrieval
- artificial intelligence
- probabilistic topic models
- information retrieval