Projects

Reading presentation archive.

I am grateful to Prof. Hyunwoo Kim for recommending papers and providing valuable feedback on my presentations.

paper review talks

Illusion of Thinking and ORCA
Dec 29, 2025
Review and discussion on reasoning illusions and ORCA-style approaches.
- Paper 1: The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity
- Paper 2: Orca: Progressive Learning from Complex Explanation Traces of GPT-4
DeepSeek R1 and DAPO
Jan 12, 2026
Technical review on DeepSeek-R1 and DAPO objectives.
- Paper 1: DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
- Paper 2: DAPO: An Open-Source LLM Reinforcement Learning System at Scale
- Paper 3: GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization
OLMo1 to OLMo3
Jan 26, 2026
Architecture and training evolution review across OLMo generations.
- Paper 1: OLmo: Accelerating the Science of Language Models
- Paper 2: Olmo3
Dive into RL
Feb 9, 2026
Reinforcement learning fundamentals and practical insights.
- Paper 1: ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models
- Paper 2: Part I: Tricks or Traps? A Deep Dive into RL for LLM Reasoning
- Paper 3: JustRL: Scaling a 1.5B LLM with a Simple RL Recipe