Published onFebruary 4, 2025DeepDive into DeepSeek R1 – Part 2: Enhancing Performance with DeepSeek R1LLMDeepSeekDeepSeek-R1Paper-studyThis post explores how DeepSeek R1 improves upon DeepSeek R1 Zero by addressing key challenges through strategic fine-tuning and reinforcement learning.
Published onFebruary 2, 2025DeepDive into DeepSeek R1 – Part 1: Exploring the R1 ZeroLLMDeepSeekDeepSeek-R1Paper-studyA deep dive into DeepSeek R1 Zero - exploring its architecture, training methodology, and breakthrough achievements in reasoning capabilities