
  <rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
    <channel>
      <title>Chaos and Order</title>
      <link>https://www.youngju.dev/blog</link>
      <description>천천히 올바르게. AI Researcher &amp; DevOps Engineer Youngju&#39;s tech blog. GPU/CUDA, LLM, MLOps, Kubernetes AI workloads, distributed training, and data engineering.</description>
      <language>ko</language>
      <managingEditor>fjvbn2003@gmail.com (Youngju Kim)</managingEditor>
      <webMaster>fjvbn2003@gmail.com (Youngju Kim)</webMaster>
      <lastBuildDate>Tue, 30 Jun 2026 00:00:00 GMT</lastBuildDate>
      <atom:link href="https://www.youngju.dev/tags/text-to-speech/feed.xml" rel="self" type="application/rss+xml"/>
      
  <item>
    <guid>https://www.youngju.dev/blog/ai-papers/2026-06-30-sota-speech-recognition-tts.en</guid>
    <title>SOTA Speech Recognition and Synthesis — From Whisper to Codec Language Models</title>
    <link>https://www.youngju.dev/blog/ai-papers/2026-06-30-sota-speech-recognition-tts.en</link>
    <description>We survey recent trends in speech recognition (ASR) and synthesis (TTS). From HMM to CTC/attention, Whisper large-scale weak supervision, and from Tacotron to neural vocoders and codec language models, we trace the lineage and architectural principles.</description>
    <pubDate>Tue, 30 Jun 2026 00:00:00 GMT</pubDate>
    <author>fjvbn2003@gmail.com (Youngju Kim)</author>
    <category>ai-papers</category><category>speech-recognition</category><category>text-to-speech</category><category>whisper</category><category>neural-codec</category><category>audio-lm</category>
  </item>

  <item>
    <guid>https://www.youngju.dev/blog/ai-papers/2026-06-30-sota-speech-recognition-tts.ja</guid>
    <title>SOTA音声認識・合成分析 — Whisperからコーデック言語モデルまで</title>
    <link>https://www.youngju.dev/blog/ai-papers/2026-06-30-sota-speech-recognition-tts.ja</link>
    <description>音声認識(ASR)と音声合成(TTS)の最新の流れを整理します。HMMからCTC/アテンション、Whisperの大規模弱教師あり学習、そしてTacotronからニューラルボコーダとコーデック言語モデルまで、系譜とアーキテクチャの原理を見ていきます。</description>
    <pubDate>Tue, 30 Jun 2026 00:00:00 GMT</pubDate>
    <author>fjvbn2003@gmail.com (Youngju Kim)</author>
    <category>ai-papers</category><category>speech-recognition</category><category>text-to-speech</category><category>whisper</category><category>neural-codec</category><category>audio-lm</category>
  </item>

  <item>
    <guid>https://www.youngju.dev/blog/ai-papers/2026-06-30-sota-speech-recognition-tts</guid>
    <title>SOTA 음성 인식·합성 분석 — Whisper에서 코덱 언어모델까지</title>
    <link>https://www.youngju.dev/blog/ai-papers/2026-06-30-sota-speech-recognition-tts</link>
    <description>음성 인식(ASR)과 음성 합성(TTS)의 최신 흐름을 정리합니다. HMM에서 CTC/어텐션, Whisper의 대규모 약지도 학습, 그리고 Tacotron에서 뉴럴 보코더와 코덱 언어모델까지 계보와 아키텍처 원리를 살펴봅니다.</description>
    <pubDate>Tue, 30 Jun 2026 00:00:00 GMT</pubDate>
    <author>fjvbn2003@gmail.com (Youngju Kim)</author>
    <category>ai-papers</category><category>speech-recognition</category><category>text-to-speech</category><category>whisper</category><category>neural-codec</category><category>audio-lm</category>
  </item>

    </channel>
  </rss>
