Discovery of Frequent Word Sequences in Text.
Helena Ahonen-MykaPublished in: Pattern Detection and Discovery (2002)
Keyphrases
- keywords
- text corpus
- text input
- english words
- frequent pattern discovery
- english text
- lexical features
- text segments
- syntactic information
- related words
- word level
- noun phrases
- natural language text
- sentence level
- word counts
- string matching
- sentence similarity
- linguistic information
- text retrieval
- word pairs
- frequency counts
- chinese text
- pattern discovery
- multiword
- sequential patterns
- hidden markov models
- printed text
- lexical information
- information retrieval
- complex patterns
- frequently occurring
- n gram
- event sequences
- word sense disambiguation
- word sense
- text corpora
- named entity recognizer
- data mining
- text documents
- historical manuscripts
- document analysis
- printed documents
- automatically discovering
- biological sequences
- text queries
- stop words
- syntactic analysis