Chaos and Order

Chaos and Order https://www.youngju.dev/blog 천천히 올바르게. AI Researcher & DevOps Engineer Youngju's tech blog. GPU/CUDA, LLM, MLOps, Kubernetes AI workloads, distributed training, and data engineering. ko fjvbn2003@gmail.com (Youngju Kim) fjvbn2003@gmail.com (Youngju Kim) Sat, 16 May 2026 00:00:00 GMT https://www.youngju.dev/blog/culture/2026-05-16-llm-papers-llama-deepseek-qwen-mistral-phi-rlhf-cot-rag-flashattention-vllm-2026-deep-dive.en Top LLM Papers 2024-2026 - Llama, DeepSeek, Qwen, Mistral, Phi, RLHF, DPO, CoT, RAG, FlashAttention, vLLM Reading List https://www.youngju.dev/blog/culture/2026-05-16-llm-papers-llama-deepseek-qwen-mistral-phi-rlhf-cot-rag-flashattention-vllm-2026-deep-dive.en A curated reading list of 30+ must-read LLM papers for engineers building with LLMs in 2024-2026. Covers foundation models (Llama 3/4, DeepSeek-V3/R1, Qwen3, Mistral, Phi-4, Gemma 3), training innovations (MoE, MLA, GQA), post-training (RLHF, DPO, ORPO, KTO), reasoning (CoT, ToT, GRPO), agents (ReAct, SWE-Agent), retrieval (RAG, GraphRAG, ColBERT), efficiency (FlashAttention 1/2/3, vLLM PagedAttention, SGLang), evaluation (MMLU, GSM8K, SWE-Bench, OSWorld), safety, and Korean and Japanese models — each paper paired with its arXiv ID and a one-paragraph why-it-matters. Sat, 16 May 2026 00:00:00 GMT fjvbn2003@gmail.com (Youngju Kim) llmpapersllamadeepseekqwenmistralphirlhfdpochain-of-thoughtragflashattentionvllmfoundation-modelsmixture-of-experts https://www.youngju.dev/blog/culture/2026-05-16-llm-papers-llama-deepseek-qwen-mistral-phi-rlhf-cot-rag-flashattention-vllm-2026-deep-dive.ja LLM論文キュレーション 2024-2026 - Llama・DeepSeek・Qwen・Mistral・Phi・RLHF・DPO・CoT・RAG・FlashAttention・vLLM 詳細ガイド https://www.youngju.dev/blog/culture/2026-05-16-llm-papers-llama-deepseek-qwen-mistral-phi-rlhf-cot-rag-flashattention-vllm-2026-deep-dive.ja LLMを構築し運用するエンジニアのための2024-2026必読論文30+本キュレーション。基盤モデル(Llama 3/4、DeepSeek-V3/R1、Qwen3、Mistral、Phi-4、Gemma 3)、学習革新(MoE、MLA、GQA)、ポストトレーニング(RLHF、DPO、ORPO、KTO)、推論(CoT、ToT、GRPO)、エージェント(ReAct、SWE-Agent)、検索(RAG、GraphRAG、ColBERT)、効率(FlashAttention 1/2/3、vLLM PagedAttention、SGLang)、評価(MMLU、GSM8K、SWE-Bench、OSWorld)、安全性、韓国・日本モデルまで — 各論文のarXiv IDと「なぜ重要か」を一段落で整理。 Sat, 16 May 2026 00:00:00 GMT fjvbn2003@gmail.com (Youngju Kim) llmpapersllamadeepseekqwenmistralphirlhfdpochain-of-thoughtragflashattentionvllmfoundation-modelsmixture-of-experts https://www.youngju.dev/blog/culture/2026-05-16-llm-papers-llama-deepseek-qwen-mistral-phi-rlhf-cot-rag-flashattention-vllm-2026-deep-dive LLM 논문 큐레이션 2024-2026 - Llama · DeepSeek · Qwen · Mistral · Phi · RLHF · DPO · CoT · RAG · FlashAttention · vLLM 심층 가이드 https://www.youngju.dev/blog/culture/2026-05-16-llm-papers-llama-deepseek-qwen-mistral-phi-rlhf-cot-rag-flashattention-vllm-2026-deep-dive LLM을 만들고 운영하는 엔지니어를 위한 2024-2026 필독 논문 30+편 큐레이션. 파운데이션 모델(Llama 3/4, DeepSeek-V3/R1, Qwen3, Mistral, Phi-4, Gemma 3), 학습 혁신(MoE, MLA, GQA), 포스트-트레이닝(RLHF, DPO, ORPO, KTO), 추론(CoT, ToT, GRPO), 에이전트(ReAct, SWE-Agent), 검색(RAG, GraphRAG, ColBERT), 효율(FlashAttention 1/2/3, vLLM PagedAttention, SGLang), 평가(MMLU, GSM8K, SWE-Bench, OSWorld), 안전성, 한국·일본 모델까지 — 각 논문의 arXiv ID와 "왜 중요한지"를 한 단락으로 정리. Sat, 16 May 2026 00:00:00 GMT fjvbn2003@gmail.com (Youngju Kim) llmpapersllamadeepseekqwenmistralphirlhfdpochain-of-thoughtragflashattentionvllmfoundation-modelsmixture-of-experts