
  <rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
    <channel>
      <title>Chaos and Order</title>
      <link>https://www.youngju.dev/blog</link>
      <description>천천히 올바르게. AI Researcher &amp; DevOps Engineer Youngju&#39;s tech blog. GPU/CUDA, LLM, MLOps, Kubernetes AI workloads, distributed training, and data engineering.</description>
      <language>ko</language>
      <managingEditor>fjvbn2003@gmail.com (Youngju Kim)</managingEditor>
      <webMaster>fjvbn2003@gmail.com (Youngju Kim)</webMaster>
      <lastBuildDate>Tue, 17 Mar 2026 00:00:00 GMT</lastBuildDate>
      <atom:link href="https://www.youngju.dev/tags/gpu-programming/feed.xml" rel="self" type="application/rss+xml"/>
      
  <item>
    <guid>https://www.youngju.dev/blog/gpu-cuda/2026-03-17-cuda-gpu-programming-advanced-guide</guid>
    <title>CUDA GPU 프로그래밍 심화: Warp 최적화, Tensor Core, Triton 커널 작성까지</title>
    <link>https://www.youngju.dev/blog/gpu-cuda/2026-03-17-cuda-gpu-programming-advanced-guide</link>
    <description>CUDA 메모리 계층, Warp 최적화, Tensor Core WMMA API, Flash Attention 구현, Triton 커스텀 커널 작성까지 AI 모델 학습 가속화를 위한 GPU 프로그래밍 심화 가이드입니다.</description>
    <pubDate>Tue, 17 Mar 2026 00:00:00 GMT</pubDate>
    <author>fjvbn2003@gmail.com (Youngju Kim)</author>
    <category>cuda</category><category>gpu-programming</category><category>tensorcore</category><category>triton</category><category>flash-attention</category><category>nccl</category><category>2026-03</category>
  </item>

    </channel>
  </rss>
