From Text to Insight: Large Language Models for Materials Science Data Extraction.
Mara Schilling-WilhelmiMartiño Ríos-GarcíaSherjeel ShabihMaría Victoria GilSantiago MiretChristoph T. KochJosé A. MárquezKevin Maik JablonkaPublished in: CoRR (2024)
Keyphrases
- language model
- data extraction
- materials science
- information retrieval
- language modeling
- html pages
- semi structured
- n gram
- information extraction
- document retrieval
- data integration
- retrieval model
- probabilistic model
- test collection
- text retrieval
- statistical language models
- text mining
- web pages
- query expansion
- scientific data
- query terms
- web documents
- text documents
- keywords
- language models for information retrieval
- smoothing methods
- related fields
- database
- relevance model
- databases
- query interface
- web search engines
- natural language
- bayesian networks
- e learning