|
|
Venues (Conferences, Journals, ...)
|
|
GrowBag graphs for keyword ? (Num. hits/coverage)
Group by:
No Growbag Graphs found.
|
|
|
Results
Found 11 publication records. Showing 11 according to the selection in the facets
Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
1 | Ricardo Baeza-Yates, Marina Estévez-Almenzar |
The Relevance of Non-Human Errors in Machine Learning. |
EBeM@IJCAI |
2022 |
DBLP BibTeX RDF |
|
1 | Anthony G. Cohn 0001, José Hernández-Orallo, Julius Sechang Mboli, Yael Moros-Daval, Zhiliang Xiang, Lexin Zhou |
A Framework for Categorising AI Evaluation Instruments. |
EBeM@IJCAI |
2022 |
DBLP BibTeX RDF |
|
1 | Vicky Charisi, Natalia Díaz Rodríguez, Barbara Mawhin, Luis Merino |
On Young Children's Exploration, Aha! Moments and Explanations in Model Building for Self-Regulated Problem-Solving. |
EBeM@IJCAI |
2022 |
DBLP BibTeX RDF |
|
1 | Chaina Oliveira, Ricardo B. C. Prudêncio |
Item Response Theory to Evaluate Speech Synthesis: Beyond Synthetic Speech Difficulty. |
EBeM@IJCAI |
2022 |
DBLP BibTeX RDF |
|
1 | Lexin Zhou, Fernando Martínez-Plumed, José Hernández-Orallo, Cèsar Ferri, Wout Schellaert |
Reject Before You Run: Small Assessors Anticipate Big Language Models. |
EBeM@IJCAI |
2022 |
DBLP BibTeX RDF |
|
1 | José Hernández-Orallo, Lucy Cheke, Joshua B. Tenebaum, Tomer D. Ullman, Fernando Martínez-Plumed, Danaja Rutar, John Burden, Ryan Burnell, Wout Schellaert (eds.) |
Proceedings of the Workshop on AI Evaluation Beyond Metrics co-located with the 31st International Joint Conference on Artificial Intelligence (IJCAI-ECAI 2022), Vienna, Austria, July 25th, 2022. |
EBeM@IJCAI |
2022 |
DBLP BibTeX RDF |
|
1 | Jesse Davis, Lotte Bransen, Laurens Devos, Wannes Meert, Pieter Robberechts, Jan Van Haaren, Maaike Van Roy |
Evaluating Sports Analytics Models: Challenges, Approaches, and Lessons Learned. |
EBeM@IJCAI |
2022 |
DBLP BibTeX RDF |
|
1 | Konstantinos Voudouris, Niall Donnelly, Danaja Rutar, Ryan Burnell, John Burden, José Hernández-Orallo, Lucy Cheke |
Evaluating Object Permanence in Embodied Agents using the Animal-AI Environment. |
EBeM@IJCAI |
2022 |
DBLP BibTeX RDF |
|
1 | Victor Vikram Odouard, Melanie Mitchell |
Evaluating Understanding on Conceptual Abstraction Benchmarks. |
EBeM@IJCAI |
2022 |
DBLP BibTeX RDF |
|
1 | Raül Fabra-Boluda, Cèsar Ferri, Fernando Martínez-Plumed, María José Ramírez-Quintana |
Robustness Testing of Machine Learning Families using Instance-Level IRT-Difficulty. |
EBeM@IJCAI |
2022 |
DBLP BibTeX RDF |
|
1 | Yeu-Shin Fu, Wenbo Ge, Jo Plested |
FERM: A FEature-space Representation Measure for Improved Model Evaluation. |
EBeM@IJCAI |
2022 |
DBLP BibTeX RDF |
|
Displaying result #1 - #11 of 11 (100 per page; Change: )
|
|