Grammar-compressed Self-index with Lyndon Words.
Kazuya TsurutaDominik KöpplYuto NakashimaShunsuke InenagaHideo BannaiMasayuki TakedaPublished in: CoRR (2020)
Keyphrases
- compressed text
- linguistic knowledge
- index terms
- grammar rules
- suffix array
- word order
- data structure
- n gram
- word sense disambiguation
- natural language
- index structure
- pattern matching
- data compression
- keywords
- context free grammars
- dependency structure
- inverted file
- english words
- natural language processing
- database
- index table
- syntactic analysis
- related words
- word recognition
- inverted index
- text classification
- information retrieval