🏆 LiveMathBench Leaderboard 🏆
GPassK: Are Your LLMs Capable of Stable Reasoning?
📢 Calling for Evaluation! If you want to see your model on the leaderboard, feel free to contact us!!!
📝 Notes
- Models labeled with 🌍 are Closed-source models, while others are Open-sourced.
- Models labeled with 🧮 are Mathematics-Specialization models.
- Models labeled with 💡 are o1-like models with Long-cot.