Document and sentence alignment in comparable corpora using bipartite graph matching.
Zeinab RahimiKaveh TaghipourShahram KhadiviNasim AfhamiPublished in: IST (2012)
Keyphrases
- bipartite graph matching
- comparable corpora
- text documents
- bipartite graph
- cross language information retrieval
- parallel corpora
- text summarization
- graph matching
- parallel corpus
- natural language
- information retrieval systems
- sentence level
- information retrieval
- keywords
- source language
- dynamic time warping
- document collections
- shape context
- news articles
- feature selection
- cross lingual
- retrieval systems
- query terms
- document clustering
- part of speech
- text corpora
- document retrieval
- text classification
- semantic information
- user queries
- language model
- translation model
- machine translation
- text collections
- information extraction
- query translation
- text mining
- tf idf