
  <rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
    <channel>
      <title>Chaos and Order</title>
      <link>https://www.youngju.dev/blog</link>
      <description>천천히 올바르게. AI Researcher &amp; DevOps Engineer Youngju&#39;s tech blog. GPU/CUDA, LLM, MLOps, Kubernetes AI workloads, distributed training, and data engineering.</description>
      <language>ko</language>
      <managingEditor>fjvbn2003@gmail.com (Youngju Kim)</managingEditor>
      <webMaster>fjvbn2003@gmail.com (Youngju Kim)</webMaster>
      <lastBuildDate>Fri, 15 May 2026 00:00:00 GMT</lastBuildDate>
      <atom:link href="https://www.youngju.dev/tags/assemblyai/feed.xml" rel="self" type="application/rss+xml"/>
      
  <item>
    <guid>https://www.youngju.dev/blog/culture/2026-05-15-voice-ai-2026-elevenlabs-cartesia-sesame-whisper-deepgram-parakeet-deep-dive.en</guid>
    <title>Voice AI in 2026 — ElevenLabs / Cartesia / Sesame / Whisper Turbo / Deepgram / Parakeet Deep Dive</title>
    <link>https://www.youngju.dev/blog/culture/2026-05-15-voice-ai-2026-elevenlabs-cartesia-sesame-whisper-deepgram-parakeet-deep-dive.en</link>
    <description>In October 2024 Whisper Large v3 Turbo got 8x faster, Cartesia (built by the Mamba authors) hit sub-90ms TTS, and Sesame from Brendan Iribe launched &quot;voice presence.&quot; By May 2026 the TTS, STT, and realtime-agent axes have all exploded at once. From ElevenLabs V3 to NVIDIA Parakeet 1.1, VOICEVOX, F5-TTS, and Vapi/Retell — who does what well, and how to choose.</description>
    <pubDate>Fri, 15 May 2026 00:00:00 GMT</pubDate>
    <author>fjvbn2003@gmail.com (Youngju Kim)</author>
    <category>voice-ai</category><category>tts</category><category>stt</category><category>asr</category><category>elevenlabs</category><category>cartesia</category><category>sesame</category><category>whisper</category><category>deepgram</category><category>parakeet</category><category>assemblyai</category><category>voicevox</category><category>vapi</category><category>retell</category><category>2026</category><category>deep-dive</category><category>english</category>
  </item>

  <item>
    <guid>https://www.youngju.dev/blog/culture/2026-05-15-voice-ai-2026-elevenlabs-cartesia-sesame-whisper-deepgram-parakeet-deep-dive.ja</guid>
    <title>音声AI 2026 — ElevenLabs / Cartesia / Sesame / Whisper Turbo / Deepgram / Parakeet 徹底ガイド</title>
    <link>https://www.youngju.dev/blog/culture/2026-05-15-voice-ai-2026-elevenlabs-cartesia-sesame-whisper-deepgram-parakeet-deep-dive.ja</link>
    <description>2024年10月にWhisper Large v3 Turboが8倍速くなり、Mamba著者たちが立ち上げたCartesiaが90ms未満TTSを出し、Sesame(Brendan Iribe)が「voice presence」で殴り込み、2026年5月時点で音声AIのTTS・STT・リアルタイムエージェントの3軸が同時に爆発した。ElevenLabs V3からNVIDIA Parakeet 1.1、VOICEVOX、F5-TTS、Vapi/Retellまで、誰が何に強く、どう選ぶかを整理する。</description>
    <pubDate>Fri, 15 May 2026 00:00:00 GMT</pubDate>
    <author>fjvbn2003@gmail.com (Youngju Kim)</author>
    <category>voice-ai</category><category>tts</category><category>stt</category><category>asr</category><category>elevenlabs</category><category>cartesia</category><category>sesame</category><category>whisper</category><category>deepgram</category><category>parakeet</category><category>assemblyai</category><category>voicevox</category><category>vapi</category><category>retell</category><category>2026</category><category>deep-dive</category><category>日本語</category>
  </item>

  <item>
    <guid>https://www.youngju.dev/blog/culture/2026-05-15-voice-ai-2026-elevenlabs-cartesia-sesame-whisper-deepgram-parakeet-deep-dive</guid>
    <title>음성 AI 2026 — ElevenLabs / Cartesia / Sesame / Whisper Turbo / Deepgram / Parakeet 심층 가이드</title>
    <link>https://www.youngju.dev/blog/culture/2026-05-15-voice-ai-2026-elevenlabs-cartesia-sesame-whisper-deepgram-parakeet-deep-dive</link>
    <description>2024년 10월 Whisper Large v3 Turbo가 8배 빨라지고, Cartesia가 Mamba 저자들 손으로 90ms TTS를 만들고, Sesame의 Brendan Iribe가 &quot;voice presence&quot;를 들고 나오면서 2026년 음성 AI는 TTS·STT·실시간 에이전트 세 축이 모두 폭발했다. ElevenLabs V3부터 NVIDIA Parakeet 1.1, VOICEVOX, F5-TTS, Vapi/Retell까지 — 누가 무엇을 잘하고 무엇을 골라야 하는지 정리한다.</description>
    <pubDate>Fri, 15 May 2026 00:00:00 GMT</pubDate>
    <author>fjvbn2003@gmail.com (Youngju Kim)</author>
    <category>voice-ai</category><category>tts</category><category>stt</category><category>asr</category><category>elevenlabs</category><category>cartesia</category><category>sesame</category><category>whisper</category><category>deepgram</category><category>parakeet</category><category>assemblyai</category><category>voicevox</category><category>vapi</category><category>retell</category><category>2026</category><category>deep-dive</category>
  </item>

    </channel>
  </rss>
