
  <rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
    <channel>
      <title>Chaos and Order</title>
      <link>https://www.youngju.dev/blog</link>
      <description>천천히 올바르게. AI Researcher &amp; DevOps Engineer Youngju&#39;s tech blog. GPU/CUDA, LLM, MLOps, Kubernetes AI workloads, distributed training, and data engineering.</description>
      <language>ko</language>
      <managingEditor>fjvbn2003@gmail.com (Youngju Kim)</managingEditor>
      <webMaster>fjvbn2003@gmail.com (Youngju Kim)</webMaster>
      <lastBuildDate>Fri, 26 Jun 2026 00:00:00 GMT</lastBuildDate>
      <atom:link href="https://www.youngju.dev/tags/token-pruning/feed.xml" rel="self" type="application/rss+xml"/>
      
  <item>
    <guid>https://www.youngju.dev/blog/llm/2026-06-26-multimodal-tokenization-and-fusion.en</guid>
    <title>Multimodal Tokenization and Fusion — Turning Images and Audio Into Tokens</title>
    <link>https://www.youngju.dev/blog/llm/2026-06-26-multimodal-tokenization-and-fusion.en</link>
    <description>A deep look at how images, audio, and video become tokens and get woven into one sequence with text. We cover patch and VQ image tokenization, discrete-codec audio tokenization, frame sampling, interleaving and separator tokens, token explosion and compression, and context cost in multimodal LLMs.</description>
    <pubDate>Fri, 26 Jun 2026 00:00:00 GMT</pubDate>
    <author>fjvbn2003@gmail.com (Youngju Kim)</author>
    <category>llm</category><category>multimodal</category><category>tokenization</category><category>vision-language</category><category>q-former</category><category>token-pruning</category>
  </item>

  <item>
    <guid>https://www.youngju.dev/blog/llm/2026-06-26-multimodal-tokenization-and-fusion.ja</guid>
    <title>マルチモーダルのトークナイゼーションと融合 — 画像・音声をトークンへ</title>
    <link>https://www.youngju.dev/blog/llm/2026-06-26-multimodal-tokenization-and-fusion.ja</link>
    <description>画像・音声・動画をどうトークンへ変え、テキストと一つの列へ編むのかを整理します。パッチおよびVQベースの画像トークン化、離散コーデックによる音声トークン化、フレームサンプリング、インターリービングと区切りトークン、トークン爆発と圧縮、コンテキストコストまで、マルチモーダルLLMの入力構成を深く扱います。</description>
    <pubDate>Fri, 26 Jun 2026 00:00:00 GMT</pubDate>
    <author>fjvbn2003@gmail.com (Youngju Kim)</author>
    <category>llm</category><category>multimodal</category><category>tokenization</category><category>vision-language</category><category>q-former</category><category>token-pruning</category>
  </item>

  <item>
    <guid>https://www.youngju.dev/blog/llm/2026-06-26-multimodal-tokenization-and-fusion</guid>
    <title>멀티모달 토크나이제이션과 융합 — 이미지·오디오를 토큰으로</title>
    <link>https://www.youngju.dev/blog/llm/2026-06-26-multimodal-tokenization-and-fusion</link>
    <description>이미지와 오디오, 비디오를 어떻게 토큰으로 바꾸고 텍스트와 하나의 시퀀스로 엮는지 정리합니다. 패치·VQ 기반 이미지 토큰화, 이산 코덱 오디오 토큰화, 프레임 샘플링, 인터리빙과 구분 토큰, 토큰 폭증과 압축, 컨텍스트 비용까지 멀티모달 LLM의 입력 구성을 깊이 다룹니다.</description>
    <pubDate>Fri, 26 Jun 2026 00:00:00 GMT</pubDate>
    <author>fjvbn2003@gmail.com (Youngju Kim)</author>
    <category>llm</category><category>multimodal</category><category>tokenization</category><category>vision-language</category><category>q-former</category><category>token-pruning</category>
  </item>

    </channel>
  </rss>
