Projects
Reading presentation archive.
I am grateful to Prof. Hyunwoo Kim for recommending papers and providing valuable feedback on my presentations.
paper review talks
-
Illusion of Thinking and ORCA
Dec 29, 2025
Review and discussion on reasoning illusions and ORCA-style approaches.- Paper 1: The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity
- Paper 2: Orca: Progressive Learning from Complex Explanation Traces of GPT-4
-
DeepSeek R1 and DAPO
Jan 12, 2026
Technical review on DeepSeek-R1 and DAPO objectives.- Paper 1: DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
- Paper 2: DAPO: An Open-Source LLM Reinforcement Learning System at Scale
- Paper 3: GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization
-
OLMo1 to OLMo3
Jan 26, 2026
Architecture and training evolution review across OLMo generations.- Paper 1: OLmo: Accelerating the Science of Language Models
- Paper 2: Olmo3
-
Dive into RL
Feb 9, 2026
Reinforcement learning fundamentals and practical insights.- Paper 1: ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models
- Paper 2: Part I: Tricks or Traps? A Deep Dive into RL for LLM Reasoning
- Paper 3: JustRL: Scaling a 1.5B LLM with a Simple RL Recipe