Artwork

المحتوى المقدم من a16z and Andreessen Horowitz. يتم تحميل جميع محتويات البودكاست بما في ذلك الحلقات والرسومات وأوصاف البودكاست وتقديمها مباشرة بواسطة a16z and Andreessen Horowitz أو شريك منصة البودكاست الخاص بهم. إذا كنت تعتقد أن شخصًا ما يستخدم عملك المحمي بحقوق الطبع والنشر دون إذنك، فيمكنك اتباع العملية الموضحة هنا https://ar.player.fm/legal.
Player FM - تطبيق بودكاست
انتقل إلى وضع عدم الاتصال باستخدام تطبيق Player FM !

Text to Video: The Next Leap in AI Generation

32:31
 
مشاركة
 

Manage episode 390476681 series 2546451
المحتوى المقدم من a16z and Andreessen Horowitz. يتم تحميل جميع محتويات البودكاست بما في ذلك الحلقات والرسومات وأوصاف البودكاست وتقديمها مباشرة بواسطة a16z and Andreessen Horowitz أو شريك منصة البودكاست الخاص بهم. إذا كنت تعتقد أن شخصًا ما يستخدم عملك المحمي بحقوق الطبع والنشر دون إذنك، فيمكنك اتباع العملية الموضحة هنا https://ar.player.fm/legal.

General Partner Anjney Midha explores the cutting-edge world of text-to-video AI with AI researchers Andreas Blattmann and Robin Rombach.

Released in November, Stable Video Diffusion is their latest open-source generative video model, overcoming challenges in size and dynamic representation.

In this episode Robin and Andreas share why translating text to video is complex, the key role of datasets, current applications, and the future of video editing.

Topics Covered:

00:00 - Text to Video: The Next Leap in AI Generation

02:41 - The Stable Diffusion backstory

04:25 - Diffusion vs autoregressive models

06:09 - The benefits of single step sampling

09:15 - Why generative video?

11:19 - Understanding physics through AI video

12:20 - The challenge of creating generative video

15:36 - Data set selection and training

17:50 - Structural consistency and 3D objects

19:50 - Incorporating LoRAs

21:24 - How should creators think about these tools?

23:46 - Open challenges in video generation

25:42 - Infrastructure challenges and future research

Resources:

Find Robin on Twitter: https://twitter.com/robrombach

Find Andreas on Twitter: https://twitter.com/andi_blatt

Find Anjney on Twitter: https://twitter.com/anjneymidha

Stay Updated:

Find a16z on Twitter: https://twitter.com/a16z

Find a16z on LinkedIn: https://www.linkedin.com/company/a16z

Subscribe on your favorite podcast app: https://a16z.simplecast.com/

Follow our host: https://twitter.com/stephsmithio

Please note that the content here is for informational purposes only; should NOT be taken as legal, business, tax, or investment advice or be used to evaluate any investment or security; and is not directed at any investors or potential investors in any a16z fund. a16z and its affiliates may maintain investments in the companies discussed. For more details please see a16z.com/disclosures.

  continue reading

256 حلقات

Artwork

Text to Video: The Next Leap in AI Generation

a16z Podcast

51,247 subscribers

published

iconمشاركة
 
Manage episode 390476681 series 2546451
المحتوى المقدم من a16z and Andreessen Horowitz. يتم تحميل جميع محتويات البودكاست بما في ذلك الحلقات والرسومات وأوصاف البودكاست وتقديمها مباشرة بواسطة a16z and Andreessen Horowitz أو شريك منصة البودكاست الخاص بهم. إذا كنت تعتقد أن شخصًا ما يستخدم عملك المحمي بحقوق الطبع والنشر دون إذنك، فيمكنك اتباع العملية الموضحة هنا https://ar.player.fm/legal.

General Partner Anjney Midha explores the cutting-edge world of text-to-video AI with AI researchers Andreas Blattmann and Robin Rombach.

Released in November, Stable Video Diffusion is their latest open-source generative video model, overcoming challenges in size and dynamic representation.

In this episode Robin and Andreas share why translating text to video is complex, the key role of datasets, current applications, and the future of video editing.

Topics Covered:

00:00 - Text to Video: The Next Leap in AI Generation

02:41 - The Stable Diffusion backstory

04:25 - Diffusion vs autoregressive models

06:09 - The benefits of single step sampling

09:15 - Why generative video?

11:19 - Understanding physics through AI video

12:20 - The challenge of creating generative video

15:36 - Data set selection and training

17:50 - Structural consistency and 3D objects

19:50 - Incorporating LoRAs

21:24 - How should creators think about these tools?

23:46 - Open challenges in video generation

25:42 - Infrastructure challenges and future research

Resources:

Find Robin on Twitter: https://twitter.com/robrombach

Find Andreas on Twitter: https://twitter.com/andi_blatt

Find Anjney on Twitter: https://twitter.com/anjneymidha

Stay Updated:

Find a16z on Twitter: https://twitter.com/a16z

Find a16z on LinkedIn: https://www.linkedin.com/company/a16z

Subscribe on your favorite podcast app: https://a16z.simplecast.com/

Follow our host: https://twitter.com/stephsmithio

Please note that the content here is for informational purposes only; should NOT be taken as legal, business, tax, or investment advice or be used to evaluate any investment or security; and is not directed at any investors or potential investors in any a16z fund. a16z and its affiliates may maintain investments in the companies discussed. For more details please see a16z.com/disclosures.

  continue reading

256 حلقات

Όλα τα επεισόδια

×
 
Loading …

مرحبًا بك في مشغل أف ام!

يقوم برنامج مشغل أف أم بمسح الويب للحصول على بودكاست عالية الجودة لتستمتع بها الآن. إنه أفضل تطبيق بودكاست ويعمل على أجهزة اندرويد والأيفون والويب. قم بالتسجيل لمزامنة الاشتراكات عبر الأجهزة.

 

دليل مرجعي سريع