X-SCITLDR: Cross-Lingual Extreme Summarization of Scholarly Documents.
Sotaro TakeshitaTommaso GreenNiklas FriedrichKai EckertSimone Paolo PonzettoPublished in: CoRR (2022)
Keyphrases
- cross lingual
- document clustering
- parallel corpora
- parallel corpus
- pseudo feedback
- indian languages
- language modeling
- cross lingual information retrieval
- multi document summarization
- machine translation
- language independent
- cross language
- information retrieval
- document collections
- text classification
- linguistic resources
- word sense
- text documents
- information retrieval systems
- digital libraries
- source language
- clustering algorithm
- document retrieval
- translation model
- relevant documents
- retrieval systems
- web documents
- query expansion
- keywords
- monolingual retrieval
- query translation
- cross language information retrieval
- metadata
- retrieved documents
- latent semantic analysis
- vector space model
- semi supervised learning
- probabilistic topic models
- knowledge discovery
- image retrieval