Login / Signup
EBeM@IJCAI
2022
2022
2022
Keyphrases
Publications
2022
Ricardo Baeza-Yates
,
Marina Estévez-Almenzar
The Relevance of Non-Human Errors in Machine Learning.
EBeM@IJCAI
(2022)
Anthony G. Cohn
,
José Hernández-Orallo
,
Julius Sechang Mboli
,
Yael Moros-Daval
,
Zhiliang Xiang
,
Lexin Zhou
A Framework for Categorising AI Evaluation Instruments.
EBeM@IJCAI
(2022)
Vicky Charisi
,
Natalia Díaz Rodríguez
,
Barbara Mawhin
,
Luis Merino
On Young Children's Exploration, Aha! Moments and Explanations in Model Building for Self-Regulated Problem-Solving.
EBeM@IJCAI
(2022)
Chaina Oliveira
,
Ricardo B. C. Prudêncio
Item Response Theory to Evaluate Speech Synthesis: Beyond Synthetic Speech Difficulty.
EBeM@IJCAI
(2022)
Lexin Zhou
,
Fernando Martínez-Plumed
,
José Hernández-Orallo
,
Cèsar Ferri
,
Wout Schellaert
Reject Before You Run: Small Assessors Anticipate Big Language Models.
EBeM@IJCAI
(2022)
Jesse Davis
,
Lotte Bransen
,
Laurens Devos
,
Wannes Meert
,
Pieter Robberechts
,
Jan Van Haaren
,
Maaike Van Roy
Evaluating Sports Analytics Models: Challenges, Approaches, and Lessons Learned.
EBeM@IJCAI
(2022)
Konstantinos Voudouris
,
Niall Donnelly
,
Danaja Rutar
,
Ryan Burnell
,
John Burden
,
José Hernández-Orallo
,
Lucy Cheke
Evaluating Object Permanence in Embodied Agents using the Animal-AI Environment.
EBeM@IJCAI
(2022)
Victor Vikram Odouard
,
Melanie Mitchell
Evaluating Understanding on Conceptual Abstraction Benchmarks.
EBeM@IJCAI
(2022)
Raül Fabra-Boluda
,
Cèsar Ferri
,
Fernando Martínez-Plumed
,
María José Ramírez-Quintana
Robustness Testing of Machine Learning Families using Instance-Level IRT-Difficulty.
EBeM@IJCAI
(2022)
Yeu-Shin Fu
,
Wenbo Ge
,
Jo Plested
FERM: A FEature-space Representation Measure for Improved Model Evaluation.
EBeM@IJCAI
(2022)
volume 3169, 2022
Proceedings of the Workshop on AI Evaluation Beyond Metrics co-located with the 31st International Joint Conference on Artificial Intelligence (IJCAI-ECAI 2022), Vienna, Austria, July 25th, 2022.
EBeM@IJCAI
3169 (2022)