🏆 MathBench Leaderboard 🏆
MathBench: Evaluating the Theory and Application Proficiency
of LLMs with a Hierarchical Mathematics Benchmark [ACL2024 findings]
📝 Notes
- Models labeled with 🌍 are Closed-source models, while others are Open-sourced.
- Models labeled with 🧮 are Mathematics-Specialization models, while others are normal Chat models.
- Feel free to file a request to add your models on our leaderboard.