Published onFebruary 4, 2025DeepDive into DeepSeek R1 – Part 2: Enhancing Performance with DeepSeek R1LLMDeepSeekDeepSeek-R1Paper-studyThis post explores how DeepSeek R1 improves upon DeepSeek R1 Zero by addressing key challenges through strategic fine-tuning and reinforcement learning.
Published onFebruary 2, 2025DeepDive into DeepSeek R1 – Part 1: Exploring the R1 ZeroLLMDeepSeekDeepSeek-R1Paper-studyA deep dive into DeepSeek R1 Zero - exploring its architecture, training methodology, and breakthrough achievements in reasoning capabilities
Published onSeptember 9, 2024Paper study - Faithful Chain of Thought ReasoningLLMPrompt-EngineeringPaper-studysummarization of the prompt engineering paper - Faithful Chain of Thought Reasoning by Lyu et al.
Published onSeptember 5, 2024Prompt engineering techniquesLLMPrompt-EngineeringThis post outlines key prompt engineering techniques to optimize LLM performance and improve model accuracy.