
  <rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
    <channel>
      <title>Chaos and Order</title>
      <link>https://www.youngju.dev/blog</link>
      <description>천천히 올바르게. AI Researcher &amp; DevOps Engineer Youngju&#39;s tech blog. GPU/CUDA, LLM, MLOps, Kubernetes AI workloads, distributed training, and data engineering.</description>
      <language>ko</language>
      <managingEditor>fjvbn2003@gmail.com (Youngju Kim)</managingEditor>
      <webMaster>fjvbn2003@gmail.com (Youngju Kim)</webMaster>
      <lastBuildDate>Tue, 16 Jun 2026 00:00:00 GMT</lastBuildDate>
      <atom:link href="https://www.youngju.dev/tags/accelerators/feed.xml" rel="self" type="application/rss+xml"/>
      
  <item>
    <guid>https://www.youngju.dev/blog/gpu-cuda/2026-06-16-edge-ai-npu-accelerators.en</guid>
    <title>Edge AI and the NPU — On-Device Inference Accelerators</title>
    <link>https://www.youngju.dev/blog/gpu-cuda/2026-06-16-edge-ai-npu-accelerators.en</link>
    <description>We lay out why edge AI runs inference on the device instead of the cloud (latency, privacy, cost) and the concept of the NPU that makes it possible. From Apple Neural Engine, Qualcomm, Edge TPU, and ARM Ethos to model compression, on-device LLMs, runtimes (TFLite/ONNX/CoreML), and cloud-edge hybrids, this is a developer starting guide.</description>
    <pubDate>Tue, 16 Jun 2026 00:00:00 GMT</pubDate>
    <author>fjvbn2003@gmail.com (Youngju Kim)</author>
    <category>edge-ai</category><category>npu</category><category>on-device</category><category>inference</category><category>quantization</category><category>mobile</category><category>accelerators</category>
  </item>

  <item>
    <guid>https://www.youngju.dev/blog/gpu-cuda/2026-06-16-edge-ai-npu-accelerators.ja</guid>
    <title>エッジ AI と NPU — オンデバイス推論アクセラレータ</title>
    <link>https://www.youngju.dev/blog/gpu-cuda/2026-06-16-edge-ai-npu-accelerators.ja</link>
    <description>推論をクラウドではなく機器で直接回すエッジ AI の理由(遅延・プライバシー・コスト)と、それを可能にする NPU の概念を整理します。Apple Neural Engine、Qualcomm、Edge TPU、ARM Ethos からモデル軽量化、オンデバイス LLM、ランタイム(TFLite/ONNX/CoreML)、クラウド-エッジのハイブリッドまで、開発者の始め方ガイドを収めました。</description>
    <pubDate>Tue, 16 Jun 2026 00:00:00 GMT</pubDate>
    <author>fjvbn2003@gmail.com (Youngju Kim)</author>
    <category>edge-ai</category><category>npu</category><category>on-device</category><category>inference</category><category>quantization</category><category>mobile</category><category>accelerators</category>
  </item>

  <item>
    <guid>https://www.youngju.dev/blog/gpu-cuda/2026-06-16-edge-ai-npu-accelerators</guid>
    <title>엣지 AI와 NPU — 온디바이스 추론 가속기</title>
    <link>https://www.youngju.dev/blog/gpu-cuda/2026-06-16-edge-ai-npu-accelerators</link>
    <description>추론을 클라우드가 아니라 기기에서 직접 돌리는 엣지 AI의 이유(지연·프라이버시·비용)와, 이를 가능케 하는 NPU의 개념을 정리합니다. Apple Neural Engine, Qualcomm, Edge TPU, ARM Ethos부터 모델 경량화, 온디바이스 LLM, 런타임(TFLite/ONNX/CoreML), 클라우드-엣지 하이브리드까지 개발자 시작 가이드를 담았습니다.</description>
    <pubDate>Tue, 16 Jun 2026 00:00:00 GMT</pubDate>
    <author>fjvbn2003@gmail.com (Youngju Kim)</author>
    <category>edge-ai</category><category>npu</category><category>on-device</category><category>inference</category><category>quantization</category><category>mobile</category><category>accelerators</category>
  </item>

    </channel>
  </rss>
