Hapax remains: Regularity of low-frequency words in authorial texts.
Dan FaltýnekVladimír MatlachPublished in: Digit. Scholarsh. Humanit. (2022)
Keyphrases
- low frequency
- high frequency
- chinese texts
- english words
- frequency domain
- text documents
- wavelet transform
- keywords
- natural language text
- linguistic information
- world knowledge
- wavelet analysis
- wavelet coefficients
- frequency band
- syntactic structures
- subband
- punctuation marks
- low pass
- text corpus
- high frequency components
- discrete wavelet transform
- high resolution
- training corpus
- syntactic analysis
- natural language
- word sense disambiguation
- n gram
- contourlet transform
- original images
- electromagnetic fields
- image compression
- word sense
- wavelet domain
- low and high frequency