🏆 G-Pass@k Leaderboard 🏆
GPassK: Are Your LLMs Capable of Stable Reasoning?
📝 Notes
- Models labeled with 🌍 are Closed-source models, while others are Open-sourced.
- Models labeled with 🧮 are Mathematics-Specialization models.
- Models labeled with 💡 are o1-like models with Long-cot.