Introduction To Mechanistic Interpretability AI Safety Fundamentals: Alignment podcast

Artwork

Tech Society Philosophy Blue Dot Impact

المحتوى المقدم من BlueDot Impact. يتم تحميل جميع محتويات البودكاست بما في ذلك الحلقات والرسومات وأوصاف البودكاست وتقديمها مباشرة بواسطة BlueDot Impact أو شريك منصة البودكاست الخاص بهم. إذا كنت تعتقد أن شخصًا ما يستخدم عملك المحمي بحقوق الطبع والنشر دون إذنك، فيمكنك اتباع العملية الموضحة هنا https://ar.player.fm/legal.

AI Safety Fundamentals: Alignment »
Introduction to Mechanistic Interpretability

23d ago 11:45

مشاركة

MP3•منزل الحلقة

Fetch error

Hmmm there seems to be a problem fetching this series right now. Last successful fetch was on January 02, 2025 12:05 (23d ago)

What now? This series will be checked again in the next hour. If you believe it should be working, please verify the publisher's feed link below is valid and includes actual episode links. You can contact support to request the feed be immediately fetched.

المحتوى المقدم من BlueDot Impact. يتم تحميل جميع محتويات البودكاست بما في ذلك الحلقات والرسومات وأوصاف البودكاست وتقديمها مباشرة بواسطة BlueDot Impact أو شريك منصة البودكاست الخاص بهم. إذا كنت تعتقد أن شخصًا ما يستخدم عملك المحمي بحقوق الطبع والنشر دون إذنك، فيمكنك اتباع العملية الموضحة هنا https://ar.player.fm/legal.

Our introduction introduces common mech interp concepts, to prepare you for the rest of this session's resources.

Original text: https://aisafetyfundamentals.com/blog/introduction-to-mechanistic-interpretability/
Author(s): Sarah Hastings-Woodhouse

A podcast by BlueDot Impact.
Learn more on the AI Safety Fundamentals website.

… continue reading

فصول

1. Introduction to Mechanistic Interpretability (00:00:00)

2. Why might mechanistic interpretability be useful? (00:01:16)

3. Looking inside neural networks (00:03:34)

4. What makes mechanistic interpretability hard? (00:06:33)

5. Addressing polysemanticity (00:08:34)

85 حلقات

#Tech #Society #Philosophy #Blue Dot Impact

Artwork

Introduction to Mechanistic Interpretability

AI Safety Fundamentals: Alignment

published 23d ago

مشاركة

MP3•منزل الحلقة

Fetch error

Hmmm there seems to be a problem fetching this series right now. Last successful fetch was on January 02, 2025 12:05 (23d ago)

What now? This series will be checked again in the next hour. If you believe it should be working, please verify the publisher's feed link below is valid and includes actual episode links. You can contact support to request the feed be immediately fetched.

المحتوى المقدم من BlueDot Impact. يتم تحميل جميع محتويات البودكاست بما في ذلك الحلقات والرسومات وأوصاف البودكاست وتقديمها مباشرة بواسطة BlueDot Impact أو شريك منصة البودكاست الخاص بهم. إذا كنت تعتقد أن شخصًا ما يستخدم عملك المحمي بحقوق الطبع والنشر دون إذنك، فيمكنك اتباع العملية الموضحة هنا https://ar.player.fm/legal.

Our introduction introduces common mech interp concepts, to prepare you for the rest of this session's resources.

Original text: https://aisafetyfundamentals.com/blog/introduction-to-mechanistic-interpretability/
Author(s): Sarah Hastings-Woodhouse

A podcast by BlueDot Impact.
Learn more on the AI Safety Fundamentals website.

… continue reading

فصول

1. Introduction to Mechanistic Interpretability (00:00:00)

2. Why might mechanistic interpretability be useful? (00:01:16)

3. Looking inside neural networks (00:03:34)

4. What makes mechanistic interpretability hard? (00:06:33)

5. Addressing polysemanticity (00:08:34)

85 حلقات

#Tech #Society #Philosophy #Blue Dot Impact

كل الحلقات

×

مرحبًا بك في مشغل أف ام!

يقوم برنامج مشغل أف أم بمسح الويب للحصول على بودكاست عالية الجودة لتستمتع بها الآن. إنه أفضل تطبيق بودكاست ويعمل على أجهزة اندرويد والأيفون والويب. قم بالتسجيل لمزامنة الاشتراكات عبر الأجهزة.

الاستماع إلى +500 موضوع

دليل مرجعي سريع

أعلى المدونة الصوتية

SciDose بودكاست

Quizeculo كويزيكيلو

Faysalosophy Podcast | فيصلُوسُفِي بودكاست

Alkshkool بودكاست الكشكول

المحور الثاني

بودكاست كلام

Arabic News - NHK WORLD RADIO JAPAN

KBS WORLD Radio نشرة الأخبار

بزنس بالعربي (Business بالعربى )

Science Quickly

بودكاست شرفة

بداية الحكاية

Damiri | داميري

mishbilshibshib | مش بالشبشب

استمع إلى هذا العرض أثناء الاستكشاف