Word Error Correction of Continuous Speech Recognition Using WEB Documents for Spoken Document Indexing.
Hiromitsu NishizakiYoshihiro SekiguchiPublished in: ICCPOL (2006)
Keyphrases
- web documents
- error correction
- document indexing
- term weighting
- n gram
- information retrieval systems
- vector space model
- keywords
- document retrieval
- error correcting
- semi structured
- information extraction
- error detection
- information retrieval
- web pages
- web search engines
- co occurrence
- html documents
- text retrieval
- language modeling
- tf idf
- document representation
- text categorization
- term frequency
- data mining
- watermarking scheme
- multimedia