Quasi-Random Words and Limits of Word Sequences.
Hiêp HànMarcos KiwiMatías Pavez-SignéPublished in: LATIN (2020)
Keyphrases
- related words
- n gram
- english words
- word recognition
- word sense disambiguation
- unknown words
- word frequencies
- word meaning
- word pairs
- word segmentation
- multiword
- linguistic information
- syntactic categories
- variable length
- word spotting
- stop words
- distributional clustering
- word co occurrence
- text corpus
- word similarity
- hidden markov models
- noun phrases
- linguistic knowledge
- chinese word segmentation
- keywords
- query words
- co occurrence
- automatic transcription
- natural language text
- handwritten words
- out of vocabulary
- frequency counts
- language specific
- wordnet
- speech recognition systems
- text classification
- word frequency
- text categorization
- language independent
- word level
- pseudorandom
- numeral strings
- punctuation marks
- compound words
- short list
- word meanings
- lexical information
- parallel corpus
- syntactic analysis
- training corpus
- translation model
- handwriting recognition
- language model
- text corpora
- word sense
- semantic similarity
- natural language
- lexical features
- semantic relatedness between words
- information extraction
- latent topics
- historical manuscripts
- concept space
- syntactic information