Savvy Painter, hosted by Antrese Wood, offers a treasure trove of insights artists can't afford to miss. Visit https://savvypainter.com Antrese's teachings focus on nurturing a creative mindset and prioritizing mastery over perfection, making it a must-listen resource for artists worldwide. Whether you're an emerging artist looking to hone your skills or an established pro seeking fresh perspectives, the show offers practical advice and inspiration But the real magic happens when you apply A ...
…
continue reading
My life in small chunks, by Colin Walker. I use photography and my love for Fire Safety to help keep me sane. My family (wife and three boys) are my rock. I can't thank them enough for their support.
…
continue reading
Why does Springville, Utah have an art museum? Why doesn't it have your favorite restaurant? What will the city look like in 2050? How can you get a recycling can?The Art Cityscape will give you a fast-paced and unique look at Utah's Art City. We'll answer your questions and tell you what's happening in the city and why.
…
continue reading
WFUV's award-winning, weekly public affairs program. Host George Bodarky covers New York City issues from the humorous to the sobering; whether it's an examination of local hipsters, homelessness or historic architecture. "Cityscape gives me 30 minutes to focus on a particular issue, to really delve into it," says Bodarky. "I love to walk," he says. "I will just walk around Manhattan and discover new neighborhoods, new communities, and to me that's the best thing... Much of what I bring to t ...
…
continue reading
Talking Shot is a relaxed look at the world of Photography and Filmmaking with lots of special guests and a variety of topics. We are frequently joined by special guests, who provide an even wider range of topics. Check out our podcast library and find out a bit more about the hosts at http://www.talkingshot.co.uk
…
continue reading
Your go-to podcast for everything Sims related, including The Sims 4, The Sims Mobile and The Sims FreePlay! Plus, other games such as Life by You! Join Dan of BeyondSims.com and Rachael of rachybop.com as they discuss everything about these games, their lives and more in this regular podcast!
…
continue reading
What does the word 'community' mean to you? An homogenous group of people united by faith, sexuality or another form of identity? Or perhaps it's about the place you grew up, or the people you work with? Recovering Community is a podcast series from the University of Glasgow's School of Social and Political Sciences about community; what it means; how it's formed and how it is rebuilt. Les Back is joined by academics, campaigners, volunteers and artists to talk about how communities respond ...
…
continue reading
Novelist, short story writer, Brooklynite, and host of The 24-Hour Room virtual writers space.
…
continue reading
A daily update on the latest AI Research Papers. We provide a high level overview of a handful of papers each day and will link all papers in the description for further reading. This podcast is created entirely with AI by PocketPod. Head over to https://pocketpod.app to learn more.
…
continue reading
Mythology extends past the Greek and Roman pantheon, and European fairy tales are only one section in the worldwide folklore lexicon. Every other Monday, join your host Gree, long time folk tale fan and short time global studies scholar, as we delve into stories from traditionally overlooked cultures from all over the planet. If you’re interested in hearing about myths and fables and legends from civilizations that have honed storytelling over thousands of years, “Colored Folklore” is the po ...
…
continue reading
Take a look behind the scenes of making Blue Planet II and further uncover our hidden underwater world. With insights from Sir David Attenborough, Becky Ripley and Emily Knight dig deeper into the series to uncover even more about our hidden, underwater world. Plus exclusive interviews from the people who made it, you can expect mind blowing facts, exclusive access and a fascinating look at the deep sea. Presented by Becky Ripley and Emily Knight.
…
continue reading
A podcast about planning South Asian destination weddings. Where we share stories, expert insights, and inspiration about planning South Asian destination weddings anywhere in the world.
…
continue reading
1
How to Cultivate Creative Confidence As an Artist
55:45
55:45
التشغيل لاحقا
التشغيل لاحقا
قوائم
إعجاب
احب
55:45
As an artist, you might have been taught (implicitly or explicitly) that your work doesn’t matter. Many artists I’ve worked with have heard it in school, at home, and in the media. Yet, your work as an artist does matter. It can help others feel, connect, and demonstrate the beauty of the world and the human experience. But only when you’re centere…
…
continue reading
1
Unlocking the Secrets of Color: Robert Gamblin and Scott Gellatly Answer Your Questions
1:26:33
1:26:33
التشغيل لاحقا
التشغيل لاحقا
قوائم
إعجاب
احب
1:26:33
Have you been experimenting with your paint colors lately? Do you wonder about different colors and the best way to mix them? Well, you’re in luck! Robert Gamblin and product manager Scott Gellatly are here to answer more of your questions in our special color episode! In this episode of The Savvy Painter podcast, you’ll learn about the pigments us…
…
continue reading
1
Oil Painting Q&A: Tips, Tricks, and More with Gamblin Artists Colors
1:36:49
1:36:49
التشغيل لاحقا
التشغيل لاحقا
قوائم
إعجاب
احب
1:36:49
You’ve heard that old phrase, “Jack of all trades and master of none,” right? Instead of being a jack of all trades, Robert Gamblin and his team at Gamblin Artists Colors have decided to focus on being a master of one: oil paint products. Their narrow focus has paid off as they display an amazing passion for detail and improvement in their product …
…
continue reading
Your go-to podcast for everything Sims related, including The Sims 4, The Sims Mobile and The Sims FreePlay! Plus, other simulation games such as Paralives! In this episode we dig into all of the latest news from The Sims world from July through to now - including The Sims 5 no longer happening, The Sims 4 Life and Death expansion pack, and so much…
…
continue reading
1
How to Confidently Put Together a Successful Art Show
26:37
26:37
التشغيل لاحقا
التشغيل لاحقا
قوائم
إعجاب
احب
26:37
As an artist, you’re always about the art. Even for a show, the focus is on creating pieces, not the other things that go into making the show itself a great event. Then when you realize it’s time to plan everything out, you’re instantly overwhelmed by all that’s involved. What you need is a guide or template that can help you prepare and eliminate…
…
continue reading
1
How to Stop Procrastinating and Finally Price Your Artwork
23:53
23:53
التشغيل لاحقا
التشغيل لاحقا
قوائم
إعجاب
احب
23:53
Do you ever feel like pricing your art is a scary mystery? You’re not alone! In the previous episode, you heard me share several pricing techniques you can use for your art. And while you might think there’s a way to discover that one perfect price, here’s a surprising truth: the technique you use doesn’t really matter! In this episode of The Savvy…
…
continue reading
1
Roots and Futures in Sheffield: Growing Heritage Around Communities
33:39
33:39
التشغيل لاحقا
التشغيل لاحقا
قوائم
إعجاب
احب
33:39
For this special bonus edition of Recovering Community, Les Back travels south of the border, to Sheffield to look at how rethinking the relationship between heritage and local communities can make them more inclusive, particularly for the most marginalised. Here, the Roots and Futures project is listening to the perspectives of under-served commun…
…
continue reading
1
How to Confidently Price Your Artwork Without Overwhelm
29:34
29:34
التشغيل لاحقا
التشغيل لاحقا
قوائم
إعجاب
احب
29:34
Pricing your artwork can be tricky, but it doesn’t have to feel overwhelming. In this episode, I’m breaking down common pricing methods like cost-based and market-based approaches, helping you figure out what works best for you. Whether you’re just starting or more experienced, you’ll walk away with a clearer idea of how to confidently set prices f…
…
continue reading
1
Improving Agent Design, JPEG-LM's Visual Breakthrough, TurboEdit's Real-Time Image Edits, Video Segmentation Advances, LLMs Learning Like Humans, RL Benchmarks
16:00
16:00
التشغيل لاحقا
التشغيل لاحقا
قوائم
إعجاب
احب
16:00
xGen-MM (BLIP-3): A Family of Open Large Multimodal ModelsJPEG-LM: LLMs as Image Generators with Canonical Codec RepresentationsAutomated Design of Agentic SystemsTurboEdit: Instant text-based image editingSurgical SAM 2: Real-time Segment Anything in Surgical Video by Efficient Frame PruningFine-tuning Large Language Models with Human-inspired Lea…
…
continue reading
1
Science & Clinical LLMs Leaps, Enhancing Small Model Reasoning, New Frontiers in Controlled Media Generation
14:24
14:24
التشغيل لاحقا
التشغيل لاحقا
قوائم
إعجاب
احب
14:24
The AI Scientist: Towards Fully Automated Open-Ended Scientific DiscoveryMed42-v2: A Suite of Clinical LLMsMutual Reasoning Makes Smaller LLMs Stronger Problem-SolversControlNeXt: Powerful and Efficient Control for Image and Video GenerationCogVideoX: Text-to-Video Diffusion Models with An Expert TransformerFruitNeRF: A Unified Neural Radiance Fiel…
…
continue reading
1
Multimodal Benchmarks, Visual Task Transfer, and 3D Object Generation
14:15
14:15
التشغيل لاحقا
التشغيل لاحقا
قوائم
إعجاب
احب
14:15
MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language ModelsLLaVA-OneVision: Easy Visual Task TransferAn Object is Worth 64x64 Pixels: Generating 3D Object via Image DiffusionMedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for MedicineIPAdapter-Instruct: Resolving Ambiguity in Image-based Co…
…
continue reading
1
Image and Video Segmentation with SAM 2, Gemma 2 for Efficient Language Models, Boosting Small Models with Contrastive Fine-Tuning, and MM-Vet v2 Challenges Large Multimodal Models
13:40
13:40
التشغيل لاحقا
التشغيل لاحقا
قوائم
إعجاب
احب
13:40
SAM 2: Segment Anything in Images and VideosGemma 2: Improving Open Language Models at a Practical SizeCoarse Correspondence Elicit 3D Spacetime Understanding in Multimodal Language ModelImproving Text Embeddings for Smaller Language Models Using Contrastive Fine-tuningOmniParser for Pure Vision Based GUI AgentSF3D: Stable Fast 3D Mesh Reconstructi…
…
continue reading
1
Text-Guided Image Inpainting, AMEX for Mobile GUI Agents, AgentScope's Multi-Agent Simulation
14:29
14:29
التشغيل لاحقا
التشغيل لاحقا
قوائم
إعجاب
احب
14:29
Diffree: Text-Guided Shape Free Object Inpainting with Diffusion ModelLAMBDA: A Large Model Based Data AgentAMEX: Android Multi-annotation Expo Dataset for Mobile GUI AgentsBetterDepth: Plug-and-Play Diffusion Refiner for Zero-Shot Monocular Depth EstimationVery Large-Scale Multi-Agent Simulation in AgentScopeData Mixture Inference: What do BPE Tok…
…
continue reading
1
OpenDevin & AI Software Development, Enhancing Visual Language Models, , DDK: Refining Large Language Model Efficiency through Domain Knowledge
13:45
13:45
التشغيل لاحقا
التشغيل لاحقا
قوائم
إعجاب
احب
13:45
OpenDevin: An Open Platform for AI Software Developers as Generalist AgentsVILA^2: VILA Augmented VILAHumanVid: Demystifying Training Data for Camera-controllable Human Image AnimationPERSONA: A Reproducible Testbed for Pluralistic AlignmentSV4D: Dynamic 3D Content Generation with Multi-Frame and Multi-View ConsistencyScalify: scale propagation for…
…
continue reading
1
Vocabulary Expansion for Large Models, Big Data Enhancing LMs, 4D Reconstruction Progress, AI Cityscape Generation, DPO Policy Analysis, Expanding Code Models, Multimodal LM Trust Evaluation
14:55
14:55
التشغيل لاحقا
التشغيل لاحقا
قوائم
إعجاب
احب
14:55
Scaling Laws with Vocabulary: Larger Models Deserve Larger VocabulariesScaling Retrieval-Based Language Models with a Trillion-Token DatastoreShape of Motion: 4D Reconstruction from a Single VideoStreetscapes: Large-scale Consistent Street View Generation Using Autoregressive Video DiffusionUnderstanding Reference Policies in Direct Preference Opti…
…
continue reading
1
Qwen2 Language Model, Mitigating Privacy Risks in LLMs, Exploring Non-Determinism, Increased Efficiency with Q-Sparse, GRUtopia for Embodied AI
10:38
10:38
التشغيل لاحقا
التشغيل لاحقا
قوائم
إعجاب
احب
10:38
Qwen2 Technical ReportLearning to Refuse: Towards Mitigating Privacy Risks in LLMsThe Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-DeterminismQ-Sparse: All Large Language Models can be Fully Sparsely-ActivatedGRUtopia: Dream General Robots in a City at Scale
…
continue reading
1
Skywork-Math's Reasoning, Video Diffusion Model Innovations, Multimodal Learning, Q-GaLore's Memory Efficiency, MAVIS: Visual Math Instruction
12:11
12:11
التشغيل لاحقا
التشغيل لاحقا
قوائم
إعجاب
احب
12:11
Skywork-Math: Data Scaling Laws for Mathematical Reasoning in Large Language Models -- The Story Goes OnVideo Diffusion Alignment via Reward GradientsMultimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language ModelQ-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank GradientsMAVIS: Math…
…
continue reading
1
Beyond Encoders in Vision-Language Models, Revolutionizing Human-LLM Interaction, and Advancing Knowledge Graphs
12:05
12:05
التشغيل لاحقا
التشغيل لاحقا
قوائم
إعجاب
احب
12:05
Unveiling Encoder-Free Vision-Language ModelsFunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMsAriGraph: Learning Knowledge Graph World Models with Episodic Memory for LLM AgentsRULE: Reliable Multimodal RAG for Factuality in Medical Vision Language ModelsChartGemma: Visual Instruction-…
…
continue reading
1
Diffusion Forcing to Expert Tuning, Structured Planning, Vision-Language Models, and Tabular ML Benchmarks
11:34
11:34
التشغيل لاحقا
التشغيل لاحقا
قوائم
إعجاب
احب
11:34
Diffusion Forcing: Next-token Prediction Meets Full-Sequence DiffusionLet the Expert Stick to His Last: Expert-Specialized Fine-Tuning for Sparse Architectural Large Language ModelsPlanetarium: A Rigorous Benchmark for Translating Text to Structured Planning LanguagesInternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Co…
…
continue reading
1
Advancing AI's Mathematical Reasoning: WE-MATH, ROS-LLM Framework, Autoregressive Image Generation
10:36
10:36
التشغيل لاحقا
التشغيل لاحقا
قوائم
إعجاب
احب
10:36
We-Math: Does Your Large Multimodal Model Achieve Human-like Mathematical Reasoning?ROS-LLM: A ROS framework for embodied AI with task feedback and structured reasoningMMEvalPro: Calibrating Multimodal Benchmarks Towards Trustworthy and Efficient EvaluationLiteSearch: Efficacious Tree Search for LLMWavelets Are All You Need for Autoregressive Image…
…
continue reading
Your go-to podcast for everything Sims related, including The Sims 4, The Sims Mobile and The Sims FreePlay! Plus, other simulation games such as Life By You and Paralives! In this episode we dig into the Lovestruck world of The Sims 4 with a new expansion pack announcement and talk about Life By You being cancelled.…
…
continue reading
1
Persona-Driven Data Synthesis, Enhancing Medical MLLMs, Robot Learning, Knowledge Distillation in LLMs, Text to 3D Gaussian Revolution
11:24
11:24
التشغيل لاحقا
التشغيل لاحقا
قوائم
إعجاب
احب
11:24
Scaling Synthetic Data Creation with 1,000,000,000 PersonasHuatuoGPT-Vision, Towards Injecting Medical Visual Knowledge into Multimodal LLMs at ScaleLLaRA: Supercharging Robot Learning Data for Vision-Language PolicyDirect Preference Knowledge Distillation for Large Language ModelsGaussianDreamerPro: Text to Manipulable 3D Gaussians with Highly Enh…
…
continue reading
1
OMG-LLaVA: Unifying Vision and Language Understanding, Step-DPO for LLMs Mathematical Reasoning, MUMU's Multimodal Image Generation
12:15
12:15
التشغيل لاحقا
التشغيل لاحقا
قوائم
إعجاب
احب
12:15
OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and UnderstandingStep-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMsMUMU: Bootstrapping Multimodal Image Generation from Text-to-Image DataSimulating Classroom Education with LLM-Empowered AgentsSeaKR: Self-aware Knowledge Retrieval for Adaptive Retrieval …
…
continue reading
1
FineWeb Datasets, YouDream's 3D Animals, PDE-Solving Breakthrough, Noise-Conditioned Perception Alignment, Language Models' Continual Learning
11:02
11:02
التشغيل لاحقا
التشغيل لاحقا
قوائم
إعجاب
احب
11:02
The FineWeb Datasets: Decanting the Web for the Finest Text Data at ScaleYouDream: Generating Anatomically Controllable Consistent Text-to-3D AnimalsDiffusionPDE: Generative PDE-Solving Under Partial ObservationAligning Diffusion Models with Noise-Conditioned PerceptionUnlocking Continual Learning Abilities in Language Models…
…
continue reading
1
BigCodeBench Challenges, Cambrian-1 Leap, D-MERIT's Evaluation, Long Context Breakthrough in Vision
11:06
11:06
التشغيل لاحقا
التشغيل لاحقا
قوائم
إعجاب
احب
11:06
DreamBench++: A Human-Aligned Benchmark for Personalized Image GenerationBigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex InstructionsCambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMsEvaluating D-MERIT of Partial-annotation on Information RetrievalLong Context Transfer from Language to Vision…
…
continue reading
1
LongRAG Breakthrough, LLMs as Judges, Transformer Memory Insights, Video Library AI, Democratizing Art Styles
10:14
10:14
التشغيل لاحقا
التشغيل لاحقا
قوائم
إعجاب
احب
10:14
LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMsJudging the Judges: Evaluating Alignment and Vulnerabilities in LLMs-as-JudgesComplexity of Symbolic Representation in Working Memory of Transformer Correlates with the Complexity of a TaskTowards Retrieval Augmented Generation over Large Video LibrariesStylebreeder: Exploring …
…
continue reading
1
But do I HAVE to set a goal for my art practice?
27:55
27:55
التشغيل لاحقا
التشغيل لاحقا
قوائم
إعجاب
احب
27:55
If you're an artist who has given up on setting goals for your art practice because they never seem to turn out - this episode is for you! I know a lot of artists who are resistant to setting goals. I get it. It seems impossible without sacrificing your creative process. Setting goals is not just about achieving them, it's about who you become in t…
…
continue reading
1
Scaling In-Context Reinforcement Learning, ChartMimic's AI Benchmark, Multimodal Document Comprehension, Long Context Reasoning Challenges
10:36
10:36
التشغيل لاحقا
التشغيل لاحقا
قوائم
إعجاب
احب
10:36
XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement LearningMake It Count: Text-to-Image Generation with an Accurate Number of ObjectsChartMimic: Evaluating LMM's Cross-Modal Reasoning Capability via Chart-to-Code GenerationNeedle In A Multimodal HaystackBABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Hay…
…
continue reading
1
Revolutionizing Vision and Language Models: Depth Prediction Breakthroughs, Pixel-Level Transformers, and Robotic Skill Learning
13:20
13:20
التشغيل لاحقا
التشغيل لاحقا
قوائم
إعجاب
احب
13:20
Depth Anything V2An Image is Worth More Than 16x16 Patches: Exploring Transformers on Individual PixelsTransformers meet Neural Algorithmic ReasonersSamba: Simple Hybrid State Space Models for Efficient Unlimited Context Language ModelingOpenVLA: An Open-Source Vision-Language-Action ModelAlleviating Distortion in Image Generation via Multi-Resolut…
…
continue reading
1
NaRCan Revolutionizes Video Editing, Training-Free Video Generation, Recaptioning Web Images with LLaMA-3, Novel Data Synthesis Approach, Smartphone LLM Inference
11:33
11:33
التشغيل لاحقا
التشغيل لاحقا
قوائم
إعجاب
احب
11:33
NaRCan: Natural Refined Canonical Image with Integration of Diffusion Prior for Video EditingMotionClone: Training-Free Motion Cloning for Controllable Video GenerationWhat If We Recaption Billions of Web Images with LLaMA-3?Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with NothingPowerInfer-2: Fast Large Language Model I…
…
continue reading
1
Revolutionizing Image Synthesis with TiTok, Multilingual Code Benchmark, Exploring GenAI Prompting Techniques,
10:53
10:53
التشغيل لاحقا
التشغيل لاحقا
قوائم
إعجاب
احب
10:53
An Image is Worth 32 Tokens for Reconstruction and GenerationMcEval: Massively Multilingual Code EvaluationZero-shot Image Editing with Reference ImitationThe Prompt Report: A Systematic Survey of Prompting TechniquesTextGrad: Automatic "Differentiation" via Text
…
continue reading
1
LlamaGen's Image Revolution, Husky: The Multi-Step Reasoner, Vript's Video Breakthrough, VALL-E 2 Achieves Human Parity
10:46
10:46
التشغيل لاحقا
التشغيل لاحقا
قوائم
إعجاب
احب
10:46
Autoregressive Model Beats Diffusion: Llama for Scalable Image GenerationHusky: A Unified, Open-Source Language Agent for Multi-Step ReasoningVript: A Video Is Worth Thousands of WordsLighting Every Darkness with 3DGS: Fast Training and Real-Time Rendering for HDR View SynthesisVALL-E 2: Neural Codec Language Models are Human Parity Zero-Shot Text …
…
continue reading
1
Mixture-of-Agents, Benchmarking LLMs, and GenAI Arena Evaluation
11:06
11:06
التشغيل لاحقا
التشغيل لاحقا
قوائم
إعجاب
احب
11:06
Mixture-of-Agents Enhances Large Language Model CapabilitiesWildBench: Benchmarking LLMs with Challenging Tasks from Real Users in the WildCRAG -- Comprehensive RAG BenchmarkGenAI Arena: An Open Evaluation Platform for Generative ModelsLarge Language Model Confidence Estimation via Black-Box Access
…
continue reading
1
Enhancing AI Video and Image Generation, BitsFusion Quantization, Step-aware Optimization, Thought-Augmented Reasoning, and Single Forward Video Generation
11:39
11:39
التشغيل لاحقا
التشغيل لاحقا
قوائم
إعجاب
احب
11:39
ShareGPT4Video: Improving Video Understanding and Generation with Better CaptionsBitsFusion: 1.99 bits Weight Quantization of Diffusion ModelStep-aware Preference Optimization: Aligning Preference with Denoising Performance at Each StepBuffer of Thoughts: Thought-Augmented Reasoning with Large Language ModelsSF-V: Single Forward Video Generation Mo…
…
continue reading
1
AI Papers Podcast Special Edition: Apple Intelligence & Ferret-UI
1:52
1:52
التشغيل لاحقا
التشغيل لاحقا
قوائم
إعجاب
احب
1:52
Apple announced new Siri features and Apple Intelligence today, Interestingly, Apple already released a paper, titled "Ferret-UI," on how it all works - a multimodal vision-language model capable of understanding widgets, icons, and text on an iOS mobile screen, and reasoning about their spatial relationships and functional meanings. https://arxiv.…
…
continue reading
1
Block Transformers: Faster Inference, Mobile Device AI Agents, 3D-Image Generation, Low Latency TTS
10:41
10:41
التشغيل لاحقا
التشغيل لاحقا
قوائم
إعجاب
احب
10:41
Block Transformer: Global-to-Local Language Modeling for Fast InferenceParrot: Multilingual Visual Instruction TuningMobile-Agent-v2: Mobile Device Operation Assistant with Effective Navigation via Multi-Agent CollaborationOuroboros3D: Image-to-3D Generation via 3D-aware Recursive DiffusionLiveSpeech: Low-Latency Zero-shot Text-to-Speech via Autore…
…
continue reading
1
Seed-TTS, Decoding LLMs, Innovations in Text-to-Video, Self-Improving AI Preferences, and Refining Diffusion Models
11:10
11:10
التشغيل لاحقا
التشغيل لاحقا
قوائم
إعجاب
احب
11:10
Seed-TTS: A Family of High-Quality Versatile Speech Generation ModelsTo Believe or Not to Believe Your LLMI4VGen: Image as Stepping Stone for Text-to-Video GenerationSelf-Improving Robust Preference OptimizationGuiding a Diffusion Model with a Bad Version of Itself
…
continue reading
1
MMLU-Pro: Next-Level Language Understanding, Tailored LLMs, High FPS Video Generation Innovation
11:30
11:30
التشغيل لاحقا
التشغيل لاحقا
قوائم
إعجاب
احب
11:30
MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding BenchmarkLearning Temporally Consistent Video Depth from Video Diffusion PriorsShow, Don't Tell: Aligning Language Models with Demonstrated FeedbackArtificial Generational Intelligence: Cultural Accumulation in Reinforcement LearningZeroSmooth: Training-free Diffuser Adaptati…
…
continue reading
1
Transformers and State-Space Models Unite, Multi-modal LLM Benchmark, Perplexity in Data Pruning, Advancing 4D Content Generation
10:23
10:23
التشغيل لاحقا
التشغيل لاحقا
قوائم
إعجاب
احب
10:23
Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space DualityVideo-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video AnalysisPerplexed by Perplexity: Perplexity-Based Data Pruning With Small Reference ModelsKaleido Diffusion: Improving Conditional Diffusion Models with Au…
…
continue reading
1
DITTO-2 Speeds Up Music AI, GECO's Quick 3D Generation, PLA4D's 4D Advances, DevEval's Real-World Code Benchmark, Parrot's LLM Application Efficiency
10:47
10:47
التشغيل لاحقا
التشغيل لاحقا
قوائم
إعجاب
احب
10:47
AI Papers Podcast for 06/04/2024 DITTO-2: Distilled Diffusion Inference-Time T-Optimization for Music GenerationGECO: Generative Image-to-3D within a SECOndPLA4D: Pixel-Level Alignments for Text-to-4D Gaussian SplattingDevEval: A Manually-Annotated Code Generation Benchmark Aligned with Real-World Code RepositoriesParrot: Efficient Serving of LLM-b…
…
continue reading
1
Boosting Text Retrieval with CLIP Models, Rethinking Retrieval Augmented Generation, and Deciphering Human Behavior through MotionLLM
10:42
10:42
التشغيل لاحقا
التشغيل لاحقا
قوائم
إعجاب
احب
10:42
AI Papers Podcast for 06/03/2024 Jina CLIP: Your CLIP Model Is Also Your Text RetrieverSimilarity is Not All You Need: Endowing Retrieval Augmented Generation with Multi Layered ThoughtsMotionLLM: Understanding Human Behaviors from Human Motions and VideosXwin-LM: Strong and Scalable Alignment Practice for LLMsMOFA-Video: Controllable Image Animati…
…
continue reading
1
Bilingual LLM Transparency, T2V-Turbo's Video Generation, LLMs Surpassing Human Theory of Mind Performance, Advancements in LLM Attribution
8:47
8:47
التشغيل لاحقا
التشغيل لاحقا
قوائم
إعجاب
احب
8:47
MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model SeriesT2V-Turbo: Breaking the Quality Bottleneck of Video Consistency Model with Mixed Reward FeedbackLLMs achieve adult human performance on higher-order theory of mind tasksNearest Neighbor Speculative Decoding for LLM Generation and AttributionZipper: A Multi-Tower Decoder Ar…
…
continue reading
1
Phased Consistency Model, 2-Stage Backpropagation, and the Future of 4D World Reconstruction
8:09
8:09
التشغيل لاحقا
التشغيل لاحقا
قوائم
إعجاب
احب
8:09
Phased Consistency Model2BP: 2-Stage BackpropagationGFlow: Recovering 4D World from Monocular VideoInstruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction TuningLLaMA-NAS: Efficient Neural Architecture Search for Large Language Models
…
continue reading