LangTest: A comprehensive evaluation library for custom LLM and NLP models.
Arshaan NazirThadaka Kalyan ChakravarthyDavid Amore CecchiniRakshit KhajuriaPrikshit SharmaAli Tarik MirikVeysel KocamanDavid TalbyPublished in: Softw. Impacts (2024)
Keyphrases
- comprehensive evaluation
- databases
- natural language processing
- neural network
- systematic evaluation
- website
- expert systems
- experimental evaluation
- knowledge representation
- domain specific
- database
- case study
- multiscale
- prior knowledge
- probabilistic model
- information extraction
- artificial intelligence
- process model
- neural network model
- genetic algorithm