Words ranking and Hirsch index for identifying the core of the hapaxes in political texts.
Valerio FiccadentiRoy CerquetiMarcel AusloosGurjeet DhesiPublished in: CoRR (2020)
Keyphrases
- textual features
- english words
- text documents
- keywords
- chinese texts
- ranking algorithm
- index terms
- world knowledge
- short texts
- linguistic information
- short list
- natural language text
- short text
- training corpus
- ranking functions
- n gram
- syntactic analysis
- syntactic structures
- word sense
- database
- related words
- index structure
- text corpus
- semantic categories
- web search
- linguistic analysis
- punctuation marks
- semantic relatedness between words
- word similarity
- document representation
- e government
- document collections
- information extraction
- natural language
- learning to rank
- ranked list