
  <rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
    <channel>
      <title>Chaos and Order</title>
      <link>https://www.youngju.dev/blog</link>
      <description>천천히 올바르게. AI Researcher &amp; DevOps Engineer Youngju&#39;s tech blog. GPU/CUDA, LLM, MLOps, Kubernetes AI workloads, distributed training, and data engineering.</description>
      <language>ko</language>
      <managingEditor>fjvbn2003@gmail.com (Youngju Kim)</managingEditor>
      <webMaster>fjvbn2003@gmail.com (Youngju Kim)</webMaster>
      <lastBuildDate>Sat, 16 May 2026 00:00:00 GMT</lastBuildDate>
      <atom:link href="https://www.youngju.dev/tags/acme/feed.xml" rel="self" type="application/rss+xml"/>
      
  <item>
    <guid>https://www.youngju.dev/blog/culture/2026-05-16-modern-reinforcement-learning-ecosystems-2026-rllib-stable-baselines-3-tianshou-cleanrl-openspiel-gymnasium-acme-pufferlib-deep-dive.en</guid>
    <title>Modern Reinforcement Learning Ecosystems 2026 Deep Dive - RLlib (Anyscale) · Stable-Baselines3 · Tianshou · CleanRL · OpenSpiel (DeepMind) · Gymnasium (Farama) · Acme · PufferLib · Pearl (Meta) · TorchRL</title>
    <link>https://www.youngju.dev/blog/culture/2026-05-16-modern-reinforcement-learning-ecosystems-2026-rllib-stable-baselines-3-tianshou-cleanrl-openspiel-gymnasium-acme-pufferlib-deep-dive.en</link>
    <description>A complete deep dive into the reinforcement learning ecosystem as of May 2026. We cover general-purpose RL libraries (RLlib, Stable-Baselines3, Tianshou, CleanRL, TorchRL, PFRL, TF-Agents), the DeepMind stack (OpenSpiel, Acme, Haiku/RLax/Distrax), Meta Pearl, environment standards (Gymnasium, PettingZoo, ALE, MuJoCo, Brax, Isaac Lab), LLM-RL with PPO/GRPO/TRL, and the algorithm landscape (PPO, GRPO, SAC, Decision Transformer, MuZero, Diffusion Policy). We also visit Korean and Japanese RL research from KAIST/SNU/NCSOFT/Krafton to Preferred Networks and Sony AI.</description>
    <pubDate>Sat, 16 May 2026 00:00:00 GMT</pubDate>
    <author>fjvbn2003@gmail.com (Youngju Kim)</author>
    <category>reinforcement-learning</category><category>rllib</category><category>stable-baselines-3</category><category>tianshou</category><category>cleanrl</category><category>openspiel</category><category>gymnasium</category><category>acme</category><category>pufferlib</category><category>pearl-meta</category><category>torchrl</category><category>english</category>
  </item>

  <item>
    <guid>https://www.youngju.dev/blog/culture/2026-05-16-modern-reinforcement-learning-ecosystems-2026-rllib-stable-baselines-3-tianshou-cleanrl-openspiel-gymnasium-acme-pufferlib-deep-dive.ja</guid>
    <title>モダン強化学習エコシステム 2026 完全ガイド - RLlib (Anyscale) · Stable-Baselines3 · Tianshou · CleanRL · OpenSpiel (DeepMind) · Gymnasium (Farama) · Acme · PufferLib · Pearl (Meta) · TorchRL 徹底解説</title>
    <link>https://www.youngju.dev/blog/culture/2026-05-16-modern-reinforcement-learning-ecosystems-2026-rllib-stable-baselines-3-tianshou-cleanrl-openspiel-gymnasium-acme-pufferlib-deep-dive.ja</link>
    <description>2026年5月時点の強化学習(RL)エコシステムを端から端まで見渡す。汎用RLライブラリ(RLlib, Stable-Baselines3, Tianshou, CleanRL, TorchRL, PFRL, TF-Agents)、DeepMindスタック(OpenSpiel, Acme, Haiku/RLax/Distrax)、Meta Pearl、環境標準(Gymnasium, PettingZoo, ALE, MuJoCo, Brax, Isaac Lab)、LLM-RLのPPO/GRPO/TRL、アルゴリズム地形(PPO, GRPO, SAC, Decision Transformer, MuZero, Diffusion Policy)を1本にまとめる。KAIST/SNU/NCSOFT/KraftonとPreferred Networks/Sony AIといった韓国・日本の研究も合わせて押さえる。</description>
    <pubDate>Sat, 16 May 2026 00:00:00 GMT</pubDate>
    <author>fjvbn2003@gmail.com (Youngju Kim)</author>
    <category>reinforcement-learning</category><category>rllib</category><category>stable-baselines-3</category><category>tianshou</category><category>cleanrl</category><category>openspiel</category><category>gymnasium</category><category>acme</category><category>pufferlib</category><category>pearl-meta</category><category>torchrl</category><category>日本語</category>
  </item>

  <item>
    <guid>https://www.youngju.dev/blog/culture/2026-05-16-modern-reinforcement-learning-ecosystems-2026-rllib-stable-baselines-3-tianshou-cleanrl-openspiel-gymnasium-acme-pufferlib-deep-dive</guid>
    <title>모던 강화학습 생태계 2026 완벽 가이드 - RLlib (Anyscale) · Stable-Baselines3 · Tianshou · CleanRL · OpenSpiel (DeepMind) · Gymnasium (Farama) · Acme · PufferLib · Pearl (Meta) · TorchRL 심층 분석</title>
    <link>https://www.youngju.dev/blog/culture/2026-05-16-modern-reinforcement-learning-ecosystems-2026-rllib-stable-baselines-3-tianshou-cleanrl-openspiel-gymnasium-acme-pufferlib-deep-dive</link>
    <description>2026년 5월 기준 강화학습(RL) 생태계를 처음부터 끝까지 본다. 범용 RL 라이브러리(RLlib, Stable-Baselines3, Tianshou, CleanRL, TorchRL, PFRL, TF-Agents), DeepMind 스택(OpenSpiel, Acme, Haiku/RLax/Distrax), Meta Pearl, 환경 표준(Gymnasium, PettingZoo, ALE, MuJoCo, Brax, Isaac Lab), LLM-RL의 PPO/GRPO/TRL, 그리고 알고리즘 지형(PPO, GRPO, SAC, Decision Transformer, MuZero, Diffusion Policy)까지 한 글에서 정리한다. KAIST/SNU/NCSOFT/Krafton과 Preferred Networks/Sony AI 같은 한국·일본 연구도 함께 본다.</description>
    <pubDate>Sat, 16 May 2026 00:00:00 GMT</pubDate>
    <author>fjvbn2003@gmail.com (Youngju Kim)</author>
    <category>reinforcement-learning</category><category>rllib</category><category>stable-baselines-3</category><category>tianshou</category><category>cleanrl</category><category>openspiel</category><category>gymnasium</category><category>acme</category><category>pufferlib</category><category>pearl-meta</category><category>torchrl</category>
  </item>

    </channel>
  </rss>
