
  <rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
    <channel>
      <title>Chaos and Order</title>
      <link>https://www.youngju.dev/blog</link>
      <description>천천히 올바르게. AI Researcher &amp; DevOps Engineer Youngju&#39;s tech blog. GPU/CUDA, LLM, MLOps, Kubernetes AI workloads, distributed training, and data engineering.</description>
      <language>ko</language>
      <managingEditor>fjvbn2003@gmail.com (Youngju Kim)</managingEditor>
      <webMaster>fjvbn2003@gmail.com (Youngju Kim)</webMaster>
      <lastBuildDate>Tue, 30 Jun 2026 00:00:00 GMT</lastBuildDate>
      <atom:link href="https://www.youngju.dev/tags/neural-codec/feed.xml" rel="self" type="application/rss+xml"/>
      
  <item>
    <guid>https://www.youngju.dev/blog/ai-papers/2026-06-30-sota-music-audio-generation.en</guid>
    <title>SOTA Music and Audio Generation — Neural Codecs and Generative Models</title>
    <link>https://www.youngju.dev/blog/ai-papers/2026-06-30-sota-music-audio-generation.en</link>
    <description>A lineage-focused overview from audio representations (waveform, spectrogram, neural codec) to autoregressive audio language models, diffusion-based audio, and text-to-music conditioning. We analyze the principles of the EnCodec, MusicGen, and AudioLM families, along with commercial models and evaluation and copyright issues, from an architectural perspective.</description>
    <pubDate>Tue, 30 Jun 2026 00:00:00 GMT</pubDate>
    <author>fjvbn2003@gmail.com (Youngju Kim)</author>
    <category>ai-papers</category><category>audio-generation</category><category>music-generation</category><category>neural-codec</category><category>audio-language-model</category><category>generative-ai</category>
  </item>

  <item>
    <guid>https://www.youngju.dev/blog/ai-papers/2026-06-30-sota-music-audio-generation.ja</guid>
    <title>SOTA音楽・オーディオ生成の分析 — ニューラルコーデックと生成モデル</title>
    <link>https://www.youngju.dev/blog/ai-papers/2026-06-30-sota-music-audio-generation.ja</link>
    <description>オーディオ表現(波形・スペクトログラム・ニューラルコーデック)から自己回帰オーディオ言語モデル、拡散ベースのオーディオ、テキスト音楽条件付けまで、系譜を中心に整理します。EnCodec、MusicGen、AudioLM系列の原理と商用モデル、評価・著作権の論点をアーキテクチャの観点から分析します。</description>
    <pubDate>Tue, 30 Jun 2026 00:00:00 GMT</pubDate>
    <author>fjvbn2003@gmail.com (Youngju Kim)</author>
    <category>ai-papers</category><category>audio-generation</category><category>music-generation</category><category>neural-codec</category><category>audio-language-model</category><category>generative-ai</category>
  </item>

  <item>
    <guid>https://www.youngju.dev/blog/ai-papers/2026-06-30-sota-music-audio-generation</guid>
    <title>SOTA 음악·오디오 생성 분석 — 뉴럴 코덱과 생성 모델</title>
    <link>https://www.youngju.dev/blog/ai-papers/2026-06-30-sota-music-audio-generation</link>
    <description>오디오 표현(파형·스펙트로그램·뉴럴 코덱)부터 오토리그레시브 오디오 언어모델과 확산 기반 오디오, 텍스트-음악 조건화까지 계보 중심으로 정리합니다. EnCodec, MusicGen, AudioLM 계열의 원리와 상용 모델, 평가·저작권 쟁점을 아키텍처 관점에서 분석합니다.</description>
    <pubDate>Tue, 30 Jun 2026 00:00:00 GMT</pubDate>
    <author>fjvbn2003@gmail.com (Youngju Kim)</author>
    <category>ai-papers</category><category>audio-generation</category><category>music-generation</category><category>neural-codec</category><category>audio-language-model</category><category>generative-ai</category>
  </item>

  <item>
    <guid>https://www.youngju.dev/blog/ai-papers/2026-06-30-sota-speech-recognition-tts.en</guid>
    <title>SOTA Speech Recognition and Synthesis — From Whisper to Codec Language Models</title>
    <link>https://www.youngju.dev/blog/ai-papers/2026-06-30-sota-speech-recognition-tts.en</link>
    <description>We survey recent trends in speech recognition (ASR) and synthesis (TTS). From HMM to CTC/attention, Whisper large-scale weak supervision, and from Tacotron to neural vocoders and codec language models, we trace the lineage and architectural principles.</description>
    <pubDate>Tue, 30 Jun 2026 00:00:00 GMT</pubDate>
    <author>fjvbn2003@gmail.com (Youngju Kim)</author>
    <category>ai-papers</category><category>speech-recognition</category><category>text-to-speech</category><category>whisper</category><category>neural-codec</category><category>audio-lm</category>
  </item>

  <item>
    <guid>https://www.youngju.dev/blog/ai-papers/2026-06-30-sota-speech-recognition-tts.ja</guid>
    <title>SOTA音声認識・合成分析 — Whisperからコーデック言語モデルまで</title>
    <link>https://www.youngju.dev/blog/ai-papers/2026-06-30-sota-speech-recognition-tts.ja</link>
    <description>音声認識(ASR)と音声合成(TTS)の最新の流れを整理します。HMMからCTC/アテンション、Whisperの大規模弱教師あり学習、そしてTacotronからニューラルボコーダとコーデック言語モデルまで、系譜とアーキテクチャの原理を見ていきます。</description>
    <pubDate>Tue, 30 Jun 2026 00:00:00 GMT</pubDate>
    <author>fjvbn2003@gmail.com (Youngju Kim)</author>
    <category>ai-papers</category><category>speech-recognition</category><category>text-to-speech</category><category>whisper</category><category>neural-codec</category><category>audio-lm</category>
  </item>

  <item>
    <guid>https://www.youngju.dev/blog/ai-papers/2026-06-30-sota-speech-recognition-tts</guid>
    <title>SOTA 음성 인식·합성 분석 — Whisper에서 코덱 언어모델까지</title>
    <link>https://www.youngju.dev/blog/ai-papers/2026-06-30-sota-speech-recognition-tts</link>
    <description>음성 인식(ASR)과 음성 합성(TTS)의 최신 흐름을 정리합니다. HMM에서 CTC/어텐션, Whisper의 대규모 약지도 학습, 그리고 Tacotron에서 뉴럴 보코더와 코덱 언어모델까지 계보와 아키텍처 원리를 살펴봅니다.</description>
    <pubDate>Tue, 30 Jun 2026 00:00:00 GMT</pubDate>
    <author>fjvbn2003@gmail.com (Youngju Kim)</author>
    <category>ai-papers</category><category>speech-recognition</category><category>text-to-speech</category><category>whisper</category><category>neural-codec</category><category>audio-lm</category>
  </item>

    </channel>
  </rss>
