
  <rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
    <channel>
      <title>Chaos and Order</title>
      <link>https://www.youngju.dev/blog</link>
      <description>천천히 올바르게. AI Researcher &amp; DevOps Engineer Youngju&#39;s tech blog. GPU/CUDA, LLM, MLOps, Kubernetes AI workloads, distributed training, and data engineering.</description>
      <language>ko</language>
      <managingEditor>fjvbn2003@gmail.com (Youngju Kim)</managingEditor>
      <webMaster>fjvbn2003@gmail.com (Youngju Kim)</webMaster>
      <lastBuildDate>Tue, 16 Jun 2026 00:00:00 GMT</lastBuildDate>
      <atom:link href="https://www.youngju.dev/tags/text/feed.xml" rel="self" type="application/rss+xml"/>
      
  <item>
    <guid>https://www.youngju.dev/blog/2026-06-16-unicode-and-utf8.en</guid>
    <title>Unicode and UTF-8: The Text Encoding Minefield</title>
    <link>https://www.youngju.dev/blog/2026-06-16-unicode-and-utf8.en</link>
    <description>Bytes, code points, and grapheme clusters are three different layers. Why the length of a single emoji lies, what separates UTF-8 from UTF-16 and UTF-32, NFC/NFD normalization and the é problem, ZWJ-joined emoji, surrogate pairs, and why naive string reversal breaks. A deep tour of the text encoding minefield, with special attention to Korean and Japanese.</description>
    <pubDate>Tue, 16 Jun 2026 00:00:00 GMT</pubDate>
    <author>fjvbn2003@gmail.com (Youngju Kim)</author>
    <category>unicode</category><category>text</category><category>fundamentals</category>
  </item>

  <item>
    <guid>https://www.youngju.dev/blog/2026-06-16-unicode-and-utf8.ja</guid>
    <title>UnicodeとUTF-8：文字コードの地雷原</title>
    <link>https://www.youngju.dev/blog/2026-06-16-unicode-and-utf8.ja</link>
    <description>バイト、コードポイント、書記素クラスタは別々のレイヤーです。なぜ絵文字ひとつの length が嘘をつくのか、UTF-8・16・32は何が違うのか、NFC/NFD 正規化と é 問題、ZWJ で連結した絵文字、サロゲートペア、そして素朴な文字列反転がなぜ壊れるのか。文字コードの地雷原を、韓国語・日本語ユーザーの視点から深く掘り下げます。</description>
    <pubDate>Tue, 16 Jun 2026 00:00:00 GMT</pubDate>
    <author>fjvbn2003@gmail.com (Youngju Kim)</author>
    <category>unicode</category><category>text</category><category>fundamentals</category>
  </item>

  <item>
    <guid>https://www.youngju.dev/blog/2026-06-16-unicode-and-utf8</guid>
    <title>유니코드와 UTF-8: 텍스트의 지뢰밭</title>
    <link>https://www.youngju.dev/blog/2026-06-16-unicode-and-utf8</link>
    <description>바이트, 코드 포인트, 그래핌 클러스터는 서로 다른 층입니다. 왜 이모지 하나의 길이가 거짓말을 하는지, UTF-8·16·32는 무엇이 다른지, NFC/NFD 정규화와 é 문제, ZWJ로 엮인 이모지, 서로게이트 페어, 그리고 문자열 뒤집기가 왜 깨지는지까지 텍스트 인코딩의 지뢰밭을 한글·일본어 사용자 관점에서 깊이 파헤칩니다.</description>
    <pubDate>Tue, 16 Jun 2026 00:00:00 GMT</pubDate>
    <author>fjvbn2003@gmail.com (Youngju Kim)</author>
    <category>unicode</category><category>text</category><category>fundamentals</category>
  </item>

    </channel>
  </rss>
