انتقل إلى وضع عدم الاتصال باستخدام تطبيق Player FM !
GPT-5 has Arrived
Manage episode 498916367 series 3611272
GPT-5 will change how hundreds of millions of people use AI. Yes, you might have to forgive the chart crimes, the underwhelming livestream and Altman hype… But it’s a good model. I have read the 50 page system card in full, have the benchmark scores, coding tests, and things you might have missed.
https://app.grayswan.ai/ai-explained
Announcement: https://openai.com/index/introducing-gpt-5/
System Card: https://cdn.openai.com/pdf/8124a3ce-ab78-4f06-96eb-49ea29ffb52f/gpt5-system-card-aug7.pdf
Extra Paper: https://cdn.openai.com/pdf/be60c07b-6bc2-4f54-bcee-4141e1d6c69a/gpt-5-safe_completions.pdf
Altman tweet: https://x.com/sama/status/1953551377873117369
Livestream: https://www.youtube.com/watch?v=0Uu_VJeVVfo
METR Report: https://metr.github.io/autonomy-evals-guide/gpt-5-report/
ARC-AGI-2: https://x.com/fchollet/status/1953511631054680085
Claude Opus 4.1: https://www.anthropic.com/news/claude-opus-4-1
MMMU: https://mmmu-benchmark.github.io/
Cursor Praise: https://x.com/ryolu_/status/1953531724895596669
36 حلقات
Manage episode 498916367 series 3611272
GPT-5 will change how hundreds of millions of people use AI. Yes, you might have to forgive the chart crimes, the underwhelming livestream and Altman hype… But it’s a good model. I have read the 50 page system card in full, have the benchmark scores, coding tests, and things you might have missed.
https://app.grayswan.ai/ai-explained
Announcement: https://openai.com/index/introducing-gpt-5/
System Card: https://cdn.openai.com/pdf/8124a3ce-ab78-4f06-96eb-49ea29ffb52f/gpt5-system-card-aug7.pdf
Extra Paper: https://cdn.openai.com/pdf/be60c07b-6bc2-4f54-bcee-4141e1d6c69a/gpt-5-safe_completions.pdf
Altman tweet: https://x.com/sama/status/1953551377873117369
Livestream: https://www.youtube.com/watch?v=0Uu_VJeVVfo
METR Report: https://metr.github.io/autonomy-evals-guide/gpt-5-report/
ARC-AGI-2: https://x.com/fchollet/status/1953511631054680085
Claude Opus 4.1: https://www.anthropic.com/news/claude-opus-4-1
MMMU: https://mmmu-benchmark.github.io/
Cursor Praise: https://x.com/ryolu_/status/1953531724895596669
36 حلقات
كل الحلقات
×مرحبًا بك في مشغل أف ام!
يقوم برنامج مشغل أف أم بمسح الويب للحصول على بودكاست عالية الجودة لتستمتع بها الآن. إنه أفضل تطبيق بودكاست ويعمل على أجهزة اندرويد والأيفون والويب. قم بالتسجيل لمزامنة الاشتراكات عبر الأجهزة.