GEM: Gestalt Enhanced Markup Language Model for Web Understanding via Render Tree.
Zirui ShaoFeiyu GaoZhongda QiHangdi XingJiajun BuZhi YuQi ZhengXiaozhong LiuPublished in: EMNLP (2023)
Keyphrases
- language model
- language modeling
- n gram
- probabilistic model
- information retrieval
- document retrieval
- language modelling
- speech recognition
- retrieval model
- smoothing methods
- context sensitive
- mixture model
- ad hoc information retrieval
- query expansion
- test collection
- web documents
- query terms
- document representation
- pseudo relevance feedback
- statistical language models
- document length
- web resources
- language model for information retrieval
- relevance model
- query specific
- document structure
- document ranking
- translation model
- language models for information retrieval
- web pages