MatSciBERT: A Materials Domain Language Model for Text Mining and Information Extraction.
Tanishq GuptaMohd ZakiN. M. Anoop Krishnan MausamPublished in: CoRR (2021)
Keyphrases
- language model
- text mining
- information extraction
- information retrieval
- language modeling
- retrieval model
- n gram
- query expansion
- speech recognition
- natural language processing
- document retrieval
- probabilistic model
- text documents
- web mining
- textual data
- test collection
- semi structured
- named entities
- textual documents
- document clustering
- ad hoc information retrieval
- machine translation
- query terms
- mixture model
- text classification
- context sensitive
- machine learning
- language modelling
- statistical language models
- language model for information retrieval
- structured data
- topic models
- language models for information retrieval
- smoothing methods
- translation model
- knowledge discovery
- transfer learning
- data mining
- document ranking
- vector space model
- web documents
- relevant documents
- relevance feedback
- information retrieval systems
- active learning
- pseudo relevance feedback
- relevance model
- sentiment analysis
- statistical machine translation
- text categorization
- document collections
- data analysis
- natural language
- search engine