
  <rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
    <channel>
      <title>Chaos and Order</title>
      <link>https://www.youngju.dev/blog</link>
      <description>천천히 올바르게. AI Researcher &amp; DevOps Engineer Youngju&#39;s tech blog. GPU/CUDA, LLM, MLOps, Kubernetes AI workloads, distributed training, and data engineering.</description>
      <language>ko</language>
      <managingEditor>fjvbn2003@gmail.com (Youngju Kim)</managingEditor>
      <webMaster>fjvbn2003@gmail.com (Youngju Kim)</webMaster>
      <lastBuildDate>Sat, 16 May 2026 00:00:00 GMT</lastBuildDate>
      <atom:link href="https://www.youngju.dev/tags/reader-lm/feed.xml" rel="self" type="application/rss+xml"/>
      
  <item>
    <guid>https://www.youngju.dev/blog/culture/2026-05-16-web-scraping-crawling-tools-2026-scrapy-playwright-puppeteer-crawlee-apify-firecrawl-jina-stagehand-deep-dive.en</guid>
    <title>Web Scraping &amp; Crawling Tools in 2026 — Scrapy / Playwright / Puppeteer / Crawlee (Apify) / Firecrawl / Jina Reader / Stagehand AI Deep Dive</title>
    <link>https://www.youngju.dev/blog/culture/2026-05-16-web-scraping-crawling-tools-2026-scrapy-playwright-puppeteer-crawlee-apify-firecrawl-jina-stagehand-deep-dive.en</link>
    <description>A single-pass tour of the 2026 web scraping ecosystem. We cover the classics (Scrapy, Playwright, Puppeteer, Selenium), modern frameworks (Crawlee, Apify), proxy clouds (Bright Data, Oxylabs, Smartproxy), API services (ScrapingBee, Browserless, ZenRows), LLM-friendly tooling (Firecrawl, Jina AI Reader, Diffbot), AI agent browsers (Stagehand, Browser Use, AnchorBrowser), stealth stacks (puppeteer-extra-stealth, undetected-chromedriver, Camoufox), robots.txt and crawl ethics, plus Korean and Japanese case studies — with a decision matrix at the end.</description>
    <pubDate>Sat, 16 May 2026 00:00:00 GMT</pubDate>
    <author>fjvbn2003@gmail.com (Youngju Kim)</author>
    <category>web-scraping</category><category>crawling</category><category>scrapy</category><category>playwright</category><category>puppeteer</category><category>selenium</category><category>crawlee</category><category>apify</category><category>bright-data</category><category>oxylabs</category><category>smartproxy</category><category>scrapingbee</category><category>browserless</category><category>zenrows</category><category>firecrawl</category><category>jina-ai-reader</category><category>diffbot</category><category>cheerio</category><category>beautifulsoup</category><category>lxml</category><category>jsoup</category><category>goutte</category><category>stagehand</category><category>browserbase</category><category>browser-use</category><category>anchorbrowser</category><category>puppeteer-extra-stealth</category><category>undetected-chromedriver</category><category>camoufox</category><category>reader-lm</category><category>2026</category><category>deep-dive</category><category>english</category>
  </item>

  <item>
    <guid>https://www.youngju.dev/blog/culture/2026-05-16-web-scraping-crawling-tools-2026-scrapy-playwright-puppeteer-crawlee-apify-firecrawl-jina-stagehand-deep-dive.ja</guid>
    <title>Webスクレイピング・クローリングツール 2026 — Scrapy / Playwright / Puppeteer / Crawlee (Apify) / Firecrawl / Jina Reader / Stagehand AI 徹底ガイド</title>
    <link>https://www.youngju.dev/blog/culture/2026-05-16-web-scraping-crawling-tools-2026-scrapy-playwright-puppeteer-crawlee-apify-firecrawl-jina-stagehand-deep-dive.ja</link>
    <description>2026年のWebスクレイピング・エコシステムを一気に整理します。Scrapy・Playwright・Puppeteer・Seleniumといった古典から、Crawlee・Apifyのようなモダンフレームワーク、Bright Data・Oxylabs・Smartproxyのプロキシクラウド、ScrapingBee・Browserless・ZenRowsのAPIサービス、Firecrawl・Jina AI Reader・DiffbotのLLMフレンドリーツール、Stagehand・Browser Use・AnchorBrowserのAIエージェント、puppeteer-extra-stealth・undetected-chromedriver・Camoufoxのステルススタック、robots.txtとクロール倫理、そして日本・韓国の事例まで — どこで何を選ぶかの意思決定マトリクスを含めて解説します。</description>
    <pubDate>Sat, 16 May 2026 00:00:00 GMT</pubDate>
    <author>fjvbn2003@gmail.com (Youngju Kim)</author>
    <category>web-scraping</category><category>crawling</category><category>scrapy</category><category>playwright</category><category>puppeteer</category><category>selenium</category><category>crawlee</category><category>apify</category><category>bright-data</category><category>oxylabs</category><category>smartproxy</category><category>scrapingbee</category><category>browserless</category><category>zenrows</category><category>firecrawl</category><category>jina-ai-reader</category><category>diffbot</category><category>cheerio</category><category>beautifulsoup</category><category>lxml</category><category>jsoup</category><category>goutte</category><category>stagehand</category><category>browserbase</category><category>browser-use</category><category>anchorbrowser</category><category>puppeteer-extra-stealth</category><category>undetected-chromedriver</category><category>camoufox</category><category>reader-lm</category><category>2026</category><category>deep-dive</category><category>日本語</category>
  </item>

  <item>
    <guid>https://www.youngju.dev/blog/culture/2026-05-16-web-scraping-crawling-tools-2026-scrapy-playwright-puppeteer-crawlee-apify-firecrawl-jina-stagehand-deep-dive</guid>
    <title>웹 스크래핑 &amp; 크롤링 도구 2026 — Scrapy / Playwright / Puppeteer / Crawlee (Apify) / Firecrawl / Jina Reader / Stagehand AI 심층 가이드</title>
    <link>https://www.youngju.dev/blog/culture/2026-05-16-web-scraping-crawling-tools-2026-scrapy-playwright-puppeteer-crawlee-apify-firecrawl-jina-stagehand-deep-dive</link>
    <description>2026년 웹 스크래핑 생태계를 한 호흡으로 정리합니다. Scrapy·Playwright·Puppeteer·Selenium 같은 고전부터 Crawlee·Apify 같은 모던 프레임워크, Bright Data·Oxylabs·Smartproxy 같은 프록시 클라우드, ScrapingBee·Browserless·ZenRows 같은 API 서비스, Firecrawl·Jina AI Reader·Diffbot 같은 LLM 친화 도구, Stagehand·Browser Use·AnchorBrowser 같은 AI 에이전트 브라우저, puppeteer-extra-stealth·undetected-chromedriver·Camoufox 같은 스텔스 스택, 그리고 robots.txt와 크롤링 윤리·한일 사례까지 — 도구 지도와 선택 기준을 함께 다룹니다.</description>
    <pubDate>Sat, 16 May 2026 00:00:00 GMT</pubDate>
    <author>fjvbn2003@gmail.com (Youngju Kim)</author>
    <category>web-scraping</category><category>crawling</category><category>scrapy</category><category>playwright</category><category>puppeteer</category><category>selenium</category><category>crawlee</category><category>apify</category><category>bright-data</category><category>oxylabs</category><category>smartproxy</category><category>scrapingbee</category><category>browserless</category><category>zenrows</category><category>firecrawl</category><category>jina-ai-reader</category><category>diffbot</category><category>cheerio</category><category>beautifulsoup</category><category>lxml</category><category>jsoup</category><category>goutte</category><category>stagehand</category><category>browserbase</category><category>browser-use</category><category>anchorbrowser</category><category>puppeteer-extra-stealth</category><category>undetected-chromedriver</category><category>camoufox</category><category>reader-lm</category><category>2026</category><category>deep-dive</category>
  </item>

    </channel>
  </rss>
