- Published on
This post explores how DeepSeek R1 improves upon DeepSeek R1 Zero by addressing key challenges through strategic fine-tuning and reinforcement learning.
I successfully completed both Bachelor's and Master's degrees in Computer Engineering at the Georgia Institute of Technology.
I work fulltime as a ML/Backend Engineer at Earthmera Co.
I built this blog to further study front-end development and share my thoughts on various topics, as well as what I've learned from work, study, and my projects.