JDocQA: Japanese Document Question Answering Dataset for Generative Language Models.
Eri OnamiShuhei KuritaTaiki MiyanishiTaro WatanabePublished in: LREC/COLING (2024)
Keyphrases
- passage retrieval
- language model
- question answering
- information retrieval
- document retrieval
- language modeling framework
- language modeling
- retrieval model
- probabilistic model
- speech recognition
- vector space model
- query expansion
- cross language
- query terms
- relevance model
- n gram
- sentence retrieval
- document representation
- generative model
- information extraction
- test collection
- retrieval systems
- question classification
- relevant documents
- document collections
- text retrieval
- tf idf
- natural language processing
- question answering systems
- natural language
- information retrieval systems
- topic models
- audio visual
- text summarization
- text mining
- semantic roles
- keywords