Login / Signup

Size vs. Structure in Training Corpora for Word Embedding Models: Araneum Russicum Maximum and Russian National Corpus.

Andrey KutuzovMaria Kunilovskaya
Published in: AIST (2017)
Keyphrases
  • training corpus
  • training corpora
  • probabilistic model
  • statistical models
  • sentence level
  • multiword
  • search engine
  • feature selection
  • decision trees
  • co occurrence
  • statistical model
  • text retrieval