Statistical analysis of orthographic and phonemic language corpus for word-based and phoneme-based Polish language modelling.
Piotr KlosowskiPublished in: EURASIP J. Audio Speech Music. Process. (2017)
Keyphrases
- language modelling
- n gram
- statistical analysis
- language specific
- language model
- parallel corpus
- speech recognition
- multiword
- text classification
- statistical machine translation
- language modeling
- natural language
- sentence level
- speech sounds
- context dependent
- bag of words
- co occurrence
- cross lingual
- part of speech
- target language
- machine translation
- ad hoc retrieval
- document ranking
- tf idf
- web documents
- query expansion
- information extraction
- keywords