Large language models overcome the challenges of unstructured text data in ecology.
Andry CastroJoão PintoLuís ReinoPavel PipekCésar CapinhaPublished in: Ecol. Informatics (2024)
Keyphrases
- language model
- text data
- structured data
- language modeling
- text mining
- text classification
- document representation
- high dimensional
- document retrieval
- n gram
- information retrieval
- semi structured
- text documents
- probabilistic model
- retrieval model
- test collection
- document collections
- query expansion
- language models for information retrieval
- high dimensional data
- pattern recognition
- query terms
- data sets
- information extraction
- vector space model
- textual data
- text classifiers
- relevance model
- smoothing methods
- decision trees