
  <rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
    <channel>
      <title>Chaos and Order</title>
      <link>https://www.youngju.dev/blog</link>
      <description>천천히 올바르게. AI Researcher &amp; DevOps Engineer Youngju&#39;s tech blog. GPU/CUDA, LLM, MLOps, Kubernetes AI workloads, distributed training, and data engineering.</description>
      <language>ko</language>
      <managingEditor>fjvbn2003@gmail.com (Youngju Kim)</managingEditor>
      <webMaster>fjvbn2003@gmail.com (Youngju Kim)</webMaster>
      <lastBuildDate>Tue, 17 Mar 2026 00:00:00 GMT</lastBuildDate>
      <atom:link href="https://www.youngju.dev/tags/llm-deployment/feed.xml" rel="self" type="application/rss+xml"/>
      
  <item>
    <guid>https://www.youngju.dev/blog/mlops/2026-03-17-ai-model-deployment-serving-guide</guid>
    <title>AI 모델 배포 &amp; 서빙 완전 가이드: Triton, vLLM, BentoML, Kubernetes까지</title>
    <link>https://www.youngju.dev/blog/mlops/2026-03-17-ai-model-deployment-serving-guide</link>
    <description>Docker GPU 컨테이너, Kubernetes HPA, NVIDIA Triton, vLLM LLM 서빙, BentoML, Ray Serve까지 AI 모델 프로덕션 배포 완전 가이드입니다.</description>
    <pubDate>Tue, 17 Mar 2026 00:00:00 GMT</pubDate>
    <author>fjvbn2003@gmail.com (Youngju Kim)</author>
    <category>model-serving</category><category>triton</category><category>vllm</category><category>bentoml</category><category>kubernetes</category><category>llm-deployment</category><category>2026-03</category>
  </item>

    </channel>
  </rss>
