18 subscribers
انتقل إلى وضع عدم الاتصال باستخدام تطبيق Player FM !
LLM Evaluation: Opik with Gideon Mendels
Manage episode 461237787 series 2626943
Gideon Mendels (Github: @gidim) is the co-founder and CEO of Comet, the end-to-end model evaluation platform for AI developers. Among the tools in the Comet ecosystem is Opik, an open-source solution for evaluating, testing and monitoring LLM applications. Opik allows users to log traces and spans, define and compute evaluation metrics, score LLM outputs, compare performance across app versions, and more. As a true open-source project, its full featureset is available for use by anyone, completely free.
Contributor is looking for a community manager! If you want to know more, shoot us an email at eric@scalevp.com.
Subscribe to Contributor on Substack for email notifications!
In this episode we discuss:
How Opik’s popularity blew up beyond the Comet team’s expectations
Why CI/CD is especially important in an end-to-end platform
Gideon’s “severe allergy” to “fake open-source” offerings
Why the number of dedicated machine learning engineers is actually going down
Eric’s thoughts on what it means for venture capital to invest in the LLM space
Links:
88 حلقات
Manage episode 461237787 series 2626943
Gideon Mendels (Github: @gidim) is the co-founder and CEO of Comet, the end-to-end model evaluation platform for AI developers. Among the tools in the Comet ecosystem is Opik, an open-source solution for evaluating, testing and monitoring LLM applications. Opik allows users to log traces and spans, define and compute evaluation metrics, score LLM outputs, compare performance across app versions, and more. As a true open-source project, its full featureset is available for use by anyone, completely free.
Contributor is looking for a community manager! If you want to know more, shoot us an email at eric@scalevp.com.
Subscribe to Contributor on Substack for email notifications!
In this episode we discuss:
How Opik’s popularity blew up beyond the Comet team’s expectations
Why CI/CD is especially important in an end-to-end platform
Gideon’s “severe allergy” to “fake open-source” offerings
Why the number of dedicated machine learning engineers is actually going down
Eric’s thoughts on what it means for venture capital to invest in the LLM space
Links:
88 حلقات
كل الحلقات
×
1 Messages, Not Metadata: Session with Kee Jefferys 39:33

1 Moving Money: Formance with Clément Salaün 35:36

1 LLM Evaluation: Opik with Gideon Mendels 37:59

1 Mobile Observability on OpenTelemetry: Embrace with Hanson Ho 28:43

1 No PhD Required: Restate with Stephan Ewen 32:34

1 Ground Control: Lunar with Eyal Solomon 27:17

1 Metadata Management: DataHub with Shirshanka Das 36:56

1 Take Your Own Advice: vlcn with Matt Wonlaw 31:58

1 Secret Sauce: Amplication with Yuval Hazaz 31:22

1 Robust Observability: OpenTelemetry with Austin Parker 35:01

1 Never Build Permissions Again: OPAL with Or Weis 37:15

1 Oxygen Deprivation: FerretDB with Peter Farkas 33:55

1 The Duke of SQLite: Litestream with Ben Johnson 34:14

1 Rust Never Sleeps: Tonic with Lucio Franco 36:24
مرحبًا بك في مشغل أف ام!
يقوم برنامج مشغل أف أم بمسح الويب للحصول على بودكاست عالية الجودة لتستمتع بها الآن. إنه أفضل تطبيق بودكاست ويعمل على أجهزة اندرويد والأيفون والويب. قم بالتسجيل لمزامنة الاشتراكات عبر الأجهزة.