Gemini-deep-think

Published on
2026년 5월 14일
추론 모델(reasoning models) 2026 가이드 — o3·o4·DeepSeek R1·Claude Thinking·Gemini Deep Think·QwQ 심층 비교
reasoning-models o3 o4 deepseek-r1 claude-thinking gemini-deep-think qwq rlvr test-time-compute llm
o1이 2024년 9월에 test-time compute라는 새로운 축을 열고 1년 반이 지났다. 2026년 현재 '추론 모델(reasoning model)'은 별도의 모델군이 아니라, 모든 프론티어 모델의 한 상태(mode)가 됐다. OpenAI o3·o3-pro·o4, DeepSeek R1·R1-0528·V3.1 reasoner, Anthropic Claude Sonnet 4.5·Opus 4.5의 extended thinking, Google Gemini 2.5 Pro·Deep Think, Alibaba Qwen QwQ·QwQ-Plus, xAI Grok 3·4 Heavy thinking — 여섯 가족의 추론 모드를 thinking budget·AIME·SWE-bench·도구 사용·가격까지 한눈에 정리한다. RLVR(verifiable rewards) 레시피, 추론 모델이 진짜로 필요한 순간, 그리고 빠른 비추론 모델이 더 나은 순간.

추론 모델(reasoning models) 2026 가이드 — o3·o4·DeepSeek R1·Claude Thinking·Gemini Deep Think·QwQ 심층 비교