Datasets for Math: From AIMO Competitions to Math Copilots for Research

Name: Datasets for Math: From AIMO Competitions to Math Copilots for Research
Start: 2025-02-26T14:00:00+00:00
End: 2025-02-26T15:00:00+00:00
Location: Virtually

CMSA EVENTS: CMSA NEW TECHNOLOGIES IN MATHEMATICS

When: February 26, 2025

2:00 pm - 3:00 pm

Where: Virtually

Speaker: SIMON FRIEDER - University of Oxford

This talk begins with a brief exposition of the AI Mathematical Olympiad (AIMO) on Kaggle, now in its second iteration, outlining datasets and models available to contestants. Taking a broader perspective, I then examine 1) the overarching issues the current datasets suffer from—such as binary evaluation or constrained sets of use cases— and 2) the trajectory they set for competition-style mathematical problem-solving, which is different from mathematical research practice. I argue for a fundamental shift in dataset structure and composition, both for training and evaluation, and introduce the idea of mapping mathematical workflows to data, a key example underscoring the need for this shift. I touch upon new thinking LLMs and their role in redefining LLM math evaluation, highlighting their implications for dataset design. Finally, I propose general improvements to the current state of mathematical datasets, including mathematical adaptations of dataset documentation (e.g., datasheets).

Via Zoom:

https://harvard.zoom.us/j/92220006185?pwd=V3mrb4cNSbgRXtNJtRJkTvWFVhmbI5.1
Password: cmsa