EBeM@IJCAI

Keyphrases

Publications

2022

Ricardo Baeza-Yates, Marina Estévez-Almenzar
The Relevance of Non-Human Errors in Machine Learning. EBeM@IJCAI (2022)
Anthony G. Cohn, José Hernández-Orallo, Julius Sechang Mboli, Yael Moros-Daval, Zhiliang Xiang, Lexin Zhou
A Framework for Categorising AI Evaluation Instruments. EBeM@IJCAI (2022)
Vicky Charisi, Natalia Díaz Rodríguez, Barbara Mawhin, Luis Merino
On Young Children's Exploration, Aha! Moments and Explanations in Model Building for Self-Regulated Problem-Solving. EBeM@IJCAI (2022)
Chaina Oliveira, Ricardo B. C. Prudêncio
Item Response Theory to Evaluate Speech Synthesis: Beyond Synthetic Speech Difficulty. EBeM@IJCAI (2022)
Lexin Zhou, Fernando Martínez-Plumed, José Hernández-Orallo, Cèsar Ferri, Wout Schellaert
Reject Before You Run: Small Assessors Anticipate Big Language Models. EBeM@IJCAI (2022)
Jesse Davis, Lotte Bransen, Laurens Devos, Wannes Meert, Pieter Robberechts, Jan Van Haaren, Maaike Van Roy
Evaluating Sports Analytics Models: Challenges, Approaches, and Lessons Learned. EBeM@IJCAI (2022)
Konstantinos Voudouris, Niall Donnelly, Danaja Rutar, Ryan Burnell, John Burden, José Hernández-Orallo, Lucy Cheke
Evaluating Object Permanence in Embodied Agents using the Animal-AI Environment. EBeM@IJCAI (2022)
Victor Vikram Odouard, Melanie Mitchell
Evaluating Understanding on Conceptual Abstraction Benchmarks. EBeM@IJCAI (2022)
Raül Fabra-Boluda, Cèsar Ferri, Fernando Martínez-Plumed, María José Ramírez-Quintana
Robustness Testing of Machine Learning Families using Instance-Level IRT-Difficulty. EBeM@IJCAI (2022)
Yeu-Shin Fu, Wenbo Ge, Jo Plested
FERM: A FEature-space Representation Measure for Improved Model Evaluation. EBeM@IJCAI (2022)

volume 3169, 2022

Proceedings of the Workshop on AI Evaluation Beyond Metrics co-located with the 31st International Joint Conference on Artificial Intelligence (IJCAI-ECAI 2022), Vienna, Austria, July 25th, 2022. EBeM@IJCAI 3169 (2022)