
  <rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
    <channel>
      <title>Chaos and Order</title>
      <link>https://www.youngju.dev/blog</link>
      <description>천천히 올바르게. AI Researcher &amp; DevOps Engineer Youngju&#39;s tech blog. GPU/CUDA, LLM, MLOps, Kubernetes AI workloads, distributed training, and data engineering.</description>
      <language>ko</language>
      <managingEditor>fjvbn2003@gmail.com (Youngju Kim)</managingEditor>
      <webMaster>fjvbn2003@gmail.com (Youngju Kim)</webMaster>
      <lastBuildDate>Sat, 16 May 2026 00:00:00 GMT</lastBuildDate>
      <atom:link href="https://www.youngju.dev/tags/mediapipe/feed.xml" rel="self" type="application/rss+xml"/>
      
  <item>
    <guid>https://www.youngju.dev/blog/culture/2026-05-16-computer-vision-frameworks-2026-opencv-4-mediapipe-detectron2-yolo-v11-mmdetection-sam-2-grounding-dino-deep-dive.en</guid>
    <title>Computer Vision Frameworks 2026 - OpenCV 4, MediaPipe, Detectron2, YOLO v11, MMDetection, SAM 2, Grounding DINO Deep Dive</title>
    <link>https://www.youngju.dev/blog/culture/2026-05-16-computer-vision-frameworks-2026-opencv-4-mediapipe-detectron2-yolo-v11-mmdetection-sam-2-grounding-dino-deep-dive.en</link>
    <description>The 2026 computer vision stack is no longer about &quot;touching pixels&quot;. OpenCV 4.10 has made ONNX inference table stakes, MediaPipe Studio reduces mobile real-time pipelines to one line, YOLO v11 bundles NAS, segmentation, and pose estimation into a single model from Ultralytics, SAM 2 tracks video masks in real time, and Grounding DINO 1.6 plus Florence-2 made &quot;drawing boxes from text&quot; the standard for open-vocabulary detection. This article walks through the 2026 CV stack end-to-end - OpenCV, MediaPipe, Detectron3, the YOLO family, MMDetection, SAM 2, Grounding DINO, Florence-2, YOLO-World, VLMs (GPT-4o, Claude 3.5, Gemini 2.0, Qwen2-VL, InternVL 2.5), 3D vision (DUSt3R, MASt3R, VGGT), Depth Anything v3, DINOv3, and embedded inference (ONNX Runtime, TensorRT, OpenVINO, CoreML) in one breath.</description>
    <pubDate>Sat, 16 May 2026 00:00:00 GMT</pubDate>
    <author>fjvbn2003@gmail.com (Youngju Kim)</author>
    <category>computer-vision</category><category>opencv</category><category>mediapipe</category><category>detectron2</category><category>yolo</category><category>mmdetection</category><category>sam</category><category>grounding-dino</category><category>vlm</category><category>segmentation</category><category>english</category>
  </item>

  <item>
    <guid>https://www.youngju.dev/blog/culture/2026-05-16-computer-vision-frameworks-2026-opencv-4-mediapipe-detectron2-yolo-v11-mmdetection-sam-2-grounding-dino-deep-dive.ja</guid>
    <title>コンピュータビジョン・フレームワーク2026完全ガイド - OpenCV 4・MediaPipe・Detectron2・YOLO v11・MMDetection・SAM 2・Grounding DINO徹底解説</title>
    <link>https://www.youngju.dev/blog/culture/2026-05-16-computer-vision-frameworks-2026-opencv-4-mediapipe-detectron2-yolo-v11-mmdetection-sam-2-grounding-dino-deep-dive.ja</link>
    <description>2026年のコンピュータビジョン・スタックはもはや「ピクセルを触る仕事」ではない。OpenCV 4.10はONNX推論を基本機能として取り込み、MediaPipe Studioはモバイル実時間パイプラインを一行に縮め、YOLO v11はUltralyticsからNAS・セグメンテーション・姿勢推定までを一つのモデルに束ね、SAM 2は動画マスクを実時間で追跡し、Grounding DINO 1.6とFlorence-2は「テキストから箱を描く」オープン語彙検出を標準にした。本記事はOpenCV・MediaPipe・Detectron3・YOLO系・MMDetection・SAM 2・Grounding DINO・Florence-2・YOLO-World・VLM（GPT-4o・Claude 3.5・Gemini 2.0・Qwen2-VL・InternVL 2.5）・3Dビジョン（DUSt3R・MASt3R・VGGT）・Depth Anything v3・DINOv3・組込推論（ONNX Runtime・TensorRT・OpenVINO・CoreML）まで、2026年CVスタック全体を一息で整理する。</description>
    <pubDate>Sat, 16 May 2026 00:00:00 GMT</pubDate>
    <author>fjvbn2003@gmail.com (Youngju Kim)</author>
    <category>computer-vision</category><category>opencv</category><category>mediapipe</category><category>detectron2</category><category>yolo</category><category>mmdetection</category><category>sam</category><category>grounding-dino</category><category>vlm</category><category>segmentation</category><category>日本語</category>
  </item>

  <item>
    <guid>https://www.youngju.dev/blog/culture/2026-05-16-computer-vision-frameworks-2026-opencv-4-mediapipe-detectron2-yolo-v11-mmdetection-sam-2-grounding-dino-deep-dive</guid>
    <title>컴퓨터 비전 프레임워크 2026 완벽 가이드 - OpenCV 4 · MediaPipe · Detectron2 · YOLO v11 · MMDetection · SAM 2 · Grounding DINO 심층 분석</title>
    <link>https://www.youngju.dev/blog/culture/2026-05-16-computer-vision-frameworks-2026-opencv-4-mediapipe-detectron2-yolo-v11-mmdetection-sam-2-grounding-dino-deep-dive</link>
    <description>2026년의 컴퓨터 비전 스택은 더 이상 &quot;픽셀을 만지는 일&quot;이 아니다. OpenCV 4.10이 ONNX 추론을 기본기로 받아들이고, MediaPipe Studio가 모바일 실시간 파이프라인을 한 줄로 줄이고, YOLO v11이 Ultralytics에서 NAS·세그멘테이션·자세 추정까지 한 모델에 묶고, SAM 2가 비디오 마스크를 실시간으로 추적하고, Grounding DINO 1.6과 Florence-2가 &quot;텍스트로 박스를 그리는&quot; 오픈-보캐브 검출을 표준으로 만들었다. 이 글은 OpenCV·MediaPipe·Detectron3·YOLO 계열·MMDetection·SAM 2·Grounding DINO·Florence-2·YOLO-World·VLM(GPT-4o·Claude 3.5·Gemini 2.0·Qwen2-VL·InternVL 2.5)·3D 비전(DUSt3R·MASt3R·VGGT)·Depth Anything v3·DINOv3·임베디드 추론(ONNX Runtime·TensorRT·OpenVINO·CoreML)까지 2026년 컴퓨터 비전 스택 전체를 한 호흡으로 정리한다.</description>
    <pubDate>Sat, 16 May 2026 00:00:00 GMT</pubDate>
    <author>fjvbn2003@gmail.com (Youngju Kim)</author>
    <category>computer-vision</category><category>opencv</category><category>mediapipe</category><category>detectron2</category><category>yolo</category><category>mmdetection</category><category>sam</category><category>grounding-dino</category><category>vlm</category><category>segmentation</category>
  </item>

    </channel>
  </rss>
