Spoken document retrieval using both word-based and syllable-based document spaces with latent semantic indexing.
Ken IchikawaSatoru TsugeNorihide KitaokaKazuya TakedaKenji KitaPublished in: APSIPA (2013)
Keyphrases
- spoken document retrieval
- latent semantic indexing
- information retrieval
- text retrieval
- cross language
- spoken documents
- document representation
- test collection
- vector space model
- document collections
- vector space
- n gram
- singular value decomposition
- out of vocabulary
- broadcast news
- latent semantic analysis
- language model
- retrieval systems
- relevant documents
- information retrieval systems
- document space
- bag of words
- information extraction
- text mining
- named entity recognition
- web documents
- document retrieval
- image retrieval
- co occurrence
- language modeling
- query expansion
- word level
- document clustering