Finding Syntactic Structure in Unparsed Corpora The Gsearch Corpus Query System.
Steffan CorleyMartin CorleyFrank KellerMatthew W. CrockerShari TrewinPublished in: Comput. Humanit. (2001)
Keyphrases
- document corpus
- text corpus
- text corpora
- finding similar
- database
- query processing
- response time
- data sources
- query expansion
- database queries
- text data
- query evaluation
- statistical machine translation
- natural language processing
- keywords
- information retrieval
- topic segmentation
- wide coverage
- result set
- query formulation
- document clustering
- range queries
- parallel corpus
- user queries
- data structure
- text documents
- hand crafted
- retrieval systems
- annotated corpus
- information extraction