
  <rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
    <channel>
      <title>Chaos and Order</title>
      <link>https://www.youngju.dev/blog</link>
      <description>천천히 올바르게. AI Researcher &amp; DevOps Engineer Youngju&#39;s tech blog. GPU/CUDA, LLM, MLOps, Kubernetes AI workloads, distributed training, and data engineering.</description>
      <language>ko</language>
      <managingEditor>fjvbn2003@gmail.com (Youngju Kim)</managingEditor>
      <webMaster>fjvbn2003@gmail.com (Youngju Kim)</webMaster>
      <lastBuildDate>Tue, 16 Jun 2026 00:00:00 GMT</lastBuildDate>
      <atom:link href="https://www.youngju.dev/tags/bandwidth/feed.xml" rel="self" type="application/rss+xml"/>
      
  <item>
    <guid>https://www.youngju.dev/blog/gpu-cuda/2026-06-16-memory-wall-hbm-bandwidth.en</guid>
    <title>The Memory Wall and HBM — The Real Bottleneck That Divides AI Performance</title>
    <link>https://www.youngju.dev/blog/gpu-cuda/2026-06-16-memory-wall-hbm-bandwidth.en</link>
    <description>In an era where compute is cheap and data movement is expensive, the real bottleneck of AI performance is memory. From the memory-wall concept to HBM generations, the roofline model and arithmetic intensity, the KV cache, and how quantization saves bandwidth, all from a developer view.</description>
    <pubDate>Tue, 16 Jun 2026 00:00:00 GMT</pubDate>
    <author>fjvbn2003@gmail.com (Youngju Kim)</author>
    <category>memory-wall</category><category>hbm</category><category>bandwidth</category><category>roofline</category><category>inference</category><category>ai-hardware</category><category>quantization</category>
  </item>

  <item>
    <guid>https://www.youngju.dev/blog/gpu-cuda/2026-06-16-memory-wall-hbm-bandwidth.ja</guid>
    <title>メモリウォールとHBM — AI性能を分ける本当のボトルネック</title>
    <link>https://www.youngju.dev/blog/gpu-cuda/2026-06-16-memory-wall-hbm-bandwidth.ja</link>
    <description>演算が安くなりデータ移動が高くなった時代、AI性能の本当のボトルネックはメモリです。メモリウォールの概念からHBM世代、rooflineモデルと算術強度、KVキャッシュ、量子化による帯域幅削減まで開発者目線で整理します。</description>
    <pubDate>Tue, 16 Jun 2026 00:00:00 GMT</pubDate>
    <author>fjvbn2003@gmail.com (Youngju Kim)</author>
    <category>memory-wall</category><category>hbm</category><category>bandwidth</category><category>roofline</category><category>inference</category><category>ai-hardware</category><category>quantization</category>
  </item>

  <item>
    <guid>https://www.youngju.dev/blog/gpu-cuda/2026-06-16-memory-wall-hbm-bandwidth</guid>
    <title>메모리 월과 HBM — AI 성능을 가르는 진짜 병목</title>
    <link>https://www.youngju.dev/blog/gpu-cuda/2026-06-16-memory-wall-hbm-bandwidth</link>
    <description>연산은 싸지고 데이터 이동은 비싸진 시대, AI 성능의 진짜 병목은 메모리입니다. 메모리 월 개념부터 HBM 세대, roofline 모델과 산술 강도, KV 캐시, 양자화로 대역폭을 절감하는 법까지 개발자 관점에서 정리합니다.</description>
    <pubDate>Tue, 16 Jun 2026 00:00:00 GMT</pubDate>
    <author>fjvbn2003@gmail.com (Youngju Kim)</author>
    <category>memory-wall</category><category>hbm</category><category>bandwidth</category><category>roofline</category><category>inference</category><category>ai-hardware</category><category>quantization</category>
  </item>

    </channel>
  </rss>
