Quasi-random words and limits of word sequences.
Hiêp HànMarcos KiwiMatías Pavez-SignéPublished in: CoRR (2020)
Keyphrases
- related words
- n gram
- english words
- word recognition
- unknown words
- word meaning
- word sense disambiguation
- word pairs
- word segmentation
- word frequencies
- lexical information
- text corpus
- stop words
- hidden markov models
- multiword
- keywords
- word similarity
- chinese word segmentation
- syntactic categories
- word level
- noun phrases
- linguistic information
- word co occurrence
- word spotting
- spoken document retrieval
- variable length
- word frequency
- linguistic knowledge
- distributional clustering
- pseudorandom
- frequency counts
- handwritten words
- numeral strings
- query words
- lexical features
- speech recognition systems
- compound words
- punctuation marks
- short list
- wordnet
- training corpus
- chinese text
- co occurrence
- handwriting recognition
- word meanings
- latent topics
- information extraction
- semantic relatedness between words
- automatic transcription
- language model
- topic models
- word order
- translation model
- syntactic information
- parallel corpus
- language specific