LAMBERT: Layout-Aware Language Modeling for Information Extraction.
Lukasz GarncarekRafal PowalskiTomasz StanislawekBartosz TopolskiPiotr HalamaMichal TurskiFilip GralinskiPublished in: ICDAR (1) (2021)
Keyphrases
- language modeling
- information extraction
- information retrieval
- language model
- natural language processing
- retrieval model
- query expansion
- precision and recall
- probabilistic model
- n gram
- text mining
- semi structured
- named entities
- cross lingual
- test collection
- text documents
- web documents
- machine learning
- question answering
- document retrieval
- machine translation
- improvements in retrieval effectiveness
- web mining
- mixture model
- word segmentation
- search engine
- language modeling approaches
- statistical language models
- relevance model
- knowledge discovery
- information retrieval systems
- retrieval effectiveness
- text classification