Post
1617
o3-mini is slightly better than R1, but lags behind Claude. Sorry folks, no new SOTA ๐
But OAI definitely owns the fashion of API. temperature and top_p are history now, reasoning_effort will be copied by other vendors.
onekq-ai/WebApp1K-models-leaderboard
But OAI definitely owns the fashion of API. temperature and top_p are history now, reasoning_effort will be copied by other vendors.
onekq-ai/WebApp1K-models-leaderboard