
  <rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
    <channel>
      <title>Chaos and Order</title>
      <link>https://www.youngju.dev/blog</link>
      <description>천천히 올바르게. AI Researcher &amp; DevOps Engineer Youngju&#39;s tech blog. GPU/CUDA, LLM, MLOps, Kubernetes AI workloads, distributed training, and data engineering.</description>
      <language>ko</language>
      <managingEditor>fjvbn2003@gmail.com (Youngju Kim)</managingEditor>
      <webMaster>fjvbn2003@gmail.com (Youngju Kim)</webMaster>
      <lastBuildDate>Sat, 16 May 2026 00:00:00 GMT</lastBuildDate>
      <atom:link href="https://www.youngju.dev/tags/phi/feed.xml" rel="self" type="application/rss+xml"/>
      
  <item>
    <guid>https://www.youngju.dev/blog/culture/2026-05-16-llm-papers-llama-deepseek-qwen-mistral-phi-rlhf-cot-rag-flashattention-vllm-2026-deep-dive.en</guid>
    <title>Top LLM Papers 2024-2026 - Llama, DeepSeek, Qwen, Mistral, Phi, RLHF, DPO, CoT, RAG, FlashAttention, vLLM Reading List</title>
    <link>https://www.youngju.dev/blog/culture/2026-05-16-llm-papers-llama-deepseek-qwen-mistral-phi-rlhf-cot-rag-flashattention-vllm-2026-deep-dive.en</link>
    <description>A curated reading list of 30+ must-read LLM papers for engineers building with LLMs in 2024-2026. Covers foundation models (Llama 3/4, DeepSeek-V3/R1, Qwen3, Mistral, Phi-4, Gemma 3), training innovations (MoE, MLA, GQA), post-training (RLHF, DPO, ORPO, KTO), reasoning (CoT, ToT, GRPO), agents (ReAct, SWE-Agent), retrieval (RAG, GraphRAG, ColBERT), efficiency (FlashAttention 1/2/3, vLLM PagedAttention, SGLang), evaluation (MMLU, GSM8K, SWE-Bench, OSWorld), safety, and Korean and Japanese models — each paper paired with its arXiv ID and a one-paragraph why-it-matters.</description>
    <pubDate>Sat, 16 May 2026 00:00:00 GMT</pubDate>
    <author>fjvbn2003@gmail.com (Youngju Kim)</author>
    <category>llm</category><category>papers</category><category>llama</category><category>deepseek</category><category>qwen</category><category>mistral</category><category>phi</category><category>rlhf</category><category>dpo</category><category>chain-of-thought</category><category>rag</category><category>flashattention</category><category>vllm</category><category>foundation-models</category><category>mixture-of-experts</category>
  </item>

  <item>
    <guid>https://www.youngju.dev/blog/culture/2026-05-16-llm-papers-llama-deepseek-qwen-mistral-phi-rlhf-cot-rag-flashattention-vllm-2026-deep-dive.ja</guid>
    <title>LLM論文キュレーション 2024-2026 - Llama・DeepSeek・Qwen・Mistral・Phi・RLHF・DPO・CoT・RAG・FlashAttention・vLLM 詳細ガイド</title>
    <link>https://www.youngju.dev/blog/culture/2026-05-16-llm-papers-llama-deepseek-qwen-mistral-phi-rlhf-cot-rag-flashattention-vllm-2026-deep-dive.ja</link>
    <description>LLMを構築し運用するエンジニアのための2024-2026必読論文30+本キュレーション。基盤モデル(Llama 3/4、DeepSeek-V3/R1、Qwen3、Mistral、Phi-4、Gemma 3)、学習革新(MoE、MLA、GQA)、ポストトレーニング(RLHF、DPO、ORPO、KTO)、推論(CoT、ToT、GRPO)、エージェント(ReAct、SWE-Agent)、検索(RAG、GraphRAG、ColBERT)、効率(FlashAttention 1/2/3、vLLM PagedAttention、SGLang)、評価(MMLU、GSM8K、SWE-Bench、OSWorld)、安全性、韓国・日本モデルまで — 各論文のarXiv IDと「なぜ重要か」を一段落で整理。</description>
    <pubDate>Sat, 16 May 2026 00:00:00 GMT</pubDate>
    <author>fjvbn2003@gmail.com (Youngju Kim)</author>
    <category>llm</category><category>papers</category><category>llama</category><category>deepseek</category><category>qwen</category><category>mistral</category><category>phi</category><category>rlhf</category><category>dpo</category><category>chain-of-thought</category><category>rag</category><category>flashattention</category><category>vllm</category><category>foundation-models</category><category>mixture-of-experts</category>
  </item>

  <item>
    <guid>https://www.youngju.dev/blog/culture/2026-05-16-llm-papers-llama-deepseek-qwen-mistral-phi-rlhf-cot-rag-flashattention-vllm-2026-deep-dive</guid>
    <title>LLM 논문 큐레이션 2024-2026 - Llama · DeepSeek · Qwen · Mistral · Phi · RLHF · DPO · CoT · RAG · FlashAttention · vLLM 심층 가이드</title>
    <link>https://www.youngju.dev/blog/culture/2026-05-16-llm-papers-llama-deepseek-qwen-mistral-phi-rlhf-cot-rag-flashattention-vllm-2026-deep-dive</link>
    <description>LLM을 만들고 운영하는 엔지니어를 위한 2024-2026 필독 논문 30+편 큐레이션. 파운데이션 모델(Llama 3/4, DeepSeek-V3/R1, Qwen3, Mistral, Phi-4, Gemma 3), 학습 혁신(MoE, MLA, GQA), 포스트-트레이닝(RLHF, DPO, ORPO, KTO), 추론(CoT, ToT, GRPO), 에이전트(ReAct, SWE-Agent), 검색(RAG, GraphRAG, ColBERT), 효율(FlashAttention 1/2/3, vLLM PagedAttention, SGLang), 평가(MMLU, GSM8K, SWE-Bench, OSWorld), 안전성, 한국·일본 모델까지 — 각 논문의 arXiv ID와 &quot;왜 중요한지&quot;를 한 단락으로 정리.</description>
    <pubDate>Sat, 16 May 2026 00:00:00 GMT</pubDate>
    <author>fjvbn2003@gmail.com (Youngju Kim)</author>
    <category>llm</category><category>papers</category><category>llama</category><category>deepseek</category><category>qwen</category><category>mistral</category><category>phi</category><category>rlhf</category><category>dpo</category><category>chain-of-thought</category><category>rag</category><category>flashattention</category><category>vllm</category><category>foundation-models</category><category>mixture-of-experts</category>
  </item>

    </channel>
  </rss>
