"FrontierMath: A Benchmark for Evaluating Advanced Mathematical Reasoning ..."

Elliot Glazer et al. (2024)

> Home

Details and statistics

DOI: 10.48550/ARXIV.2411.04872

access: open

type: Informal or Other Publication

metadata version: 2025-01-01

- view
  - electronic edition via DOI (open access)
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-04872
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-04872
Elliot Glazer, Ege Erdil, Tamay Besiroglu, Diego Chicharro, Evan Chen, Alex Gunning, Caroline Falkman Olsson, Jean-Stanislas Denain, Anson Ho, Emily de Oliveira Santos, Olli Järviniemi, Matthew Barnett, Robert Sandler, Matej Vrzala, Jaime Sevilla, Qiuyu Ren, Elizabeth Pratt, Lionel Levine, Grant Barkley, Natalie Stewart, Bogdan Grechuk, Tetiana Grechuk, Shreepranav Varma Enugandla, Mark Wildon:
FrontierMath: A Benchmark for Evaluating Advanced Mathematical Reasoning in AI. CoRR abs/2411.04872 (2024)

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.