19 subscribers
انتقل إلى وضع عدم الاتصال باستخدام تطبيق Player FM !
المدونة الصوتية تستحق الاستماع
برعاية


1 Phil Wang Pitches Psychological Thriller Starring WHO?! 24:35
🤖 DeepSeek-V3: A 671B Parameter Mixture-of-Experts Language Model
Manage episode 457755280 series 3112408
A 671B parameter Mixture-of-Experts language model. It highlights the model's architecture, including its innovative load balancing and multi-token prediction strategies, and its efficient training process using FP8 precision. Benchmark results demonstrate DeepSeek-V3's strong performance compared to other open-source and some closed-source models, particularly in math and code tasks. The document also provides instructions for running DeepSeek-V3 locally using various frameworks and hardware, including NVIDIA and AMD GPUs and Huawei Ascend NPUs. Finally, licensing and contact information are included.
360 حلقات
Manage episode 457755280 series 3112408
A 671B parameter Mixture-of-Experts language model. It highlights the model's architecture, including its innovative load balancing and multi-token prediction strategies, and its efficient training process using FP8 precision. Benchmark results demonstrate DeepSeek-V3's strong performance compared to other open-source and some closed-source models, particularly in math and code tasks. The document also provides instructions for running DeepSeek-V3 locally using various frameworks and hardware, including NVIDIA and AMD GPUs and Huawei Ascend NPUs. Finally, licensing and contact information are included.
360 حلقات
كل الحلقات
×
1 🔑 AWS IAM Identity Center and CLI Authentication Guide 12:14


1 🧊 BigData - Apache Iceberg and Streaming 29:06






1 📊 BigData Devops - ClickHouse & Grafana: High Cardinality Metrics 13:21




1 🤔 Distributed Systems - Modern Resiliency 23:52

1 🤖 StartupTool - RunLLM - AI Support Engineer 19:50

1 🤔 Cloud - AWS Account Management Strategies 14:35

1 📚 AI - Reinforcement Learning from Human Feedback 22:11

1 ⚡️ BigData - Apache Kafka Architecture 19:19


1 🔑 Startup Tool - Hanko: Open-Source Authentication and User Management 17:45






1 📱 Hyperview: Native Mobile Apps Simplified 20:45

1 📊 Startup Tool - Formbricks: Open Source Survey Platform 19:18

1 🛒 Startup Tool - E-Commerce Platform with Medusa 29:05


1 🤖 DeepSeek-V3: A 671B Parameter Mixture-of-Experts Language Model 30:29

1 🌐 Startup Tool - Tolgee Open-Source Localization Platform 13:44

1 🤖 Startup Tool - Postiz: AI-Powered Social Media Scheduling Tool 17:23


مرحبًا بك في مشغل أف ام!
يقوم برنامج مشغل أف أم بمسح الويب للحصول على بودكاست عالية الجودة لتستمتع بها الآن. إنه أفضل تطبيق بودكاست ويعمل على أجهزة اندرويد والأيفون والويب. قم بالتسجيل لمزامنة الاشتراكات عبر الأجهزة.