Split View: 오픈소스 커뮤니티 소식 모음: 2026년 3월 주목할 프로젝트들

오픈소스 커뮤니티 소식 모음: 2026년 3월 주목할 프로젝트들

개요
1. AI 에이전트 혁명
2. 연구 자동화의 부상
- Karpathy의 autoresearch — 3일 만에 22,983 스타
3. 멀티모달 및 비디오 AI
- Google Gemini Embedding 2 — 통합 멀티모달 임베딩
- LTX 2.3 (Lightricks) — 오픈소스 4K 비디오 모델
4. 개발자 도구
- Biome — ESLint + Prettier의 완전한 대체재
- Ladybird Browser — 처음부터 새로 만든 독립 브라우저
5. 보안 및 OSINT
- Shadowbroker — 15개 실시간 데이터 소스 통합 OSINT 도구
6. 커뮤니티 이벤트
- Google Summer of Code 2026 — 22번째 해, 지원 마감 3월 31일
7. 트렌드 분석: 2026년 1분기 GitHub AI 보고서
퀴즈
References

개요

2026년 3월, 오픈소스 생태계는 유례없는 속도로 진화하고 있습니다. AI 에이전트 프레임워크의 폭발적 성장, 연구 자동화 도구의 등장, 그리고 멀티모달 모델의 고도화가 동시에 이루어지고 있습니다. GitHub의 2026년 AI 보고서에 따르면, 2026년 1분기에만 267개의 새 AI 모델이 출시되었으며, 그 대부분이 오픈소스이거나 특화된 모델입니다.

이번 글에서는 이 달 가장 주목받은 프로젝트들을 한눈에 정리해 드립니다.

1. AI 에이전트 혁명

OpenClaw — 가장 빠르게 성장한 오픈소스 프로젝트

2026년 초, GitHub 역사상 가장 빠른 성장세를 기록한 프로젝트 중 하나가 등장했습니다. OpenClaw는 불과 몇 주 만에 9,000개의 스타에서 210,000개 이상으로 폭등했습니다.

OpenClaw는 자신의 하드웨어에서 실행되는 개인 AI 어시스턴트로, 클라우드 의존 없이 로컬 환경에서 강력한 자동화를 구현합니다. 특히 다양한 메시징 플랫폼과의 연동이 강점입니다.

지원 플랫폼:

WhatsApp, Telegram, Slack, Discord
Signal, iMessage, Microsoft Teams

주요 활용 사례:

개발자 워크플로우 자동화
개인 생산성 향상
웹 스크래핑 및 데이터 수집

# OpenClaw 로컬 설치 예시 (Ollama 활용)
git clone https://github.com/openclaw/openclaw
cd openclaw
ollama pull llama3.2
python setup.py --platform slack,telegram
python run.py --model llama3.2 --port 8080

OpenClaw의 성공 비결은 간단합니다. 프라이버시를 중시하는 사용자들이 자신의 하드웨어에서 AI를 직접 실행하면서, 이미 사용 중인 메시징 도구와 자연스럽게 연결할 수 있다는 점입니다.

ByteDance DeerFlow 2.0 — 오픈소스 SuperAgent 하네스

ByteDance가 공개한 DeerFlow 2.0은 "SuperAgent 하네스"로 불리는 오픈소스 에이전트 프레임워크입니다. 연구, 코드 작성, 콘텐츠 생성을 하나의 파이프라인에서 처리할 수 있습니다.

핵심 기능:

샌드박스 환경에서의 안전한 코드 실행
메모리 시스템을 통한 장기적 컨텍스트 유지
다양한 외부 도구 및 API 연동
스킬 라이브러리와 서브에이전트 조율

# DeerFlow 2.0 사용 예시
from deerflow import SuperAgent, Sandbox

agent = SuperAgent(
    model="deepseek-r2",
    sandbox=Sandbox(type="docker"),
    memory=True,
    skills=["web_search", "code_exec", "doc_gen"]
)

result = agent.run(
    task="최신 오픈소스 트렌드를 조사하고 보고서를 작성해줘",
    output_format="markdown"
)

Alibaba Page Agent — 자연어로 웹 인터페이스 제어

Alibaba가 공개한 Page Agent는 JavaScript 기반의 인-페이지 GUI 에이전트입니다. 스크린샷이 아닌 텍스트 기반으로 웹 인터페이스를 분석하므로, 가볍고 정확합니다.

자연어 명령만으로 웹 양식을 작성하고, 버튼을 클릭하고, 복잡한 UI 흐름을 자동화할 수 있습니다.

// Page Agent 사용 예시
import { PageAgent } from '@alibaba/page-agent'

const agent = new PageAgent({ model: 'qwen-max' })

// 자연어 명령으로 웹 제어
await agent.execute('검색창에 "오픈소스 AI 에이전트"를 입력하고 첫 번째 결과를 클릭해줘')
await agent.execute('연락처 양식을 작성하고 제출해줘')

NousResearch Hermes Agent — 자기 학습 AI 에이전트

Hermes Agent는 NousResearch가 공개한 자기 학습 AI 에이전트입니다. 반복적인 작업에서 패턴을 자동으로 파악하고 "스킬"로 저장하여, 유사한 상황에서 재사용합니다.

사용자가 별도로 프로그래밍하지 않아도, 에이전트가 스스로 최적화된 워크플로우를 구축해 나갑니다. 이는 AI 에이전트가 단순한 도구를 넘어 학습하는 동반자가 되는 방향을 제시합니다.

Nvidia NemoClaw — 기업용 AI 에이전트 플랫폼

Nvidia가 발표한 NemoClaw는 기업 환경에 특화된 오픈소스 AI 에이전트 플랫폼입니다. 보안, 규정 준수, 확장성을 고려한 엔터프라이즈 도구들을 포함합니다. Nvidia의 GPU 인프라와 긴밀하게 통합되어 고성능 추론을 지원합니다.

2. 연구 자동화의 부상

Karpathy의 autoresearch — 3일 만에 22,983 스타

Andrej Karpathy가 공개한 autoresearch는 GitHub에서 3일 만에 22,983개의 스타를 획득했습니다. Hacker News에서도 198포인트를 기록하며 큰 주목을 받았습니다.

이 도구는 연구 쿼리가 주어지면 웹 검색, 논문 수집, 요약, 인사이트 생성을 자동으로 수행합니다. "연구 자동화 운동"의 출발점이 된 프로젝트로 평가받습니다.

# autoresearch 사용 예시
pip install autoresearch
autoresearch --query "transformer architecture improvements 2025-2026" \
             --sources arxiv,semantic_scholar,web \
             --output report.md \
             --depth 3

학계와 산업계 모두에서 반향을 일으키고 있으며, 유사한 연구 자동화 도구들의 등장을 촉진하고 있습니다.

3. 멀티모달 및 비디오 AI

Google Gemini Embedding 2 — 통합 멀티모달 임베딩

3월 15일 Google이 공개한 Gemini Embedding 2는 텍스트, 이미지, 비디오, 오디오, 문서를 단일 임베딩 공간에서 처리하는 통합 멀티모달 모델입니다.

기존에는 각 모달리티별로 별도의 임베딩 모델을 사용해야 했지만, Gemini Embedding 2는 이를 하나로 통합합니다. 이는 멀티모달 검색, 추천 시스템, RAG(검색 증강 생성) 파이프라인을 크게 단순화합니다.

LTX 2.3 (Lightricks) — 오픈소스 4K 비디오 모델

LTX 2.3은 Lightricks가 공개한 220억 파라미터의 오픈소스 비디오 생성 모델입니다.

주요 사양:

파라미터 수: 22B
해상도: 네이티브 4K
프레임레이트: 50 FPS
오디오: 동기화된 오디오 생성 지원
세로 모드: 1080x1920 지원 (모바일 콘텐츠)

오픈소스 비디오 생성 분야에서 품질과 성능의 새로운 기준을 제시합니다.

4. 개발자 도구

Biome — ESLint + Prettier의 완전한 대체재

Biome은 JavaScript/TypeScript 생태계의 두 가지 핵심 도구인 ESLint(린팅)와 Prettier(포매팅)를 하나로 통합한 툴체인입니다.

성능 비교:

항목	ESLint	Prettier	Biome
린팅 속도	기준값	-	100배 빠름
포매팅	-	기준값	25배 빠름
설정 파일	`.eslintrc` + `.prettierrc`	-	`biome.json` 하나
언어 지원	JS/TS	다수	JS/TS/JSON/CSS

// biome.json 설정 예시
{
  "$schema": "https://biomejs.dev/schemas/1.9.0/schema.json",
  "linter": {
    "enabled": true,
    "rules": {
      "recommended": true
    }
  },
  "formatter": {
    "enabled": true,
    "indentStyle": "space",
    "indentWidth": 2
  }
}

# Biome으로 린트 + 포맷 한 번에
npx @biomejs/biome check --apply ./src

Rust로 작성되어 기존 Node.js 기반 도구 대비 압도적인 성능을 보여줍니다.

Ladybird Browser — 처음부터 새로 만든 독립 브라우저

Ladybird는 Chromium, WebKit, Gecko 등 기존 브라우저 엔진을 전혀 사용하지 않고 처음부터 새로 작성한 독립 오픈소스 브라우저입니다.

성능, 보안, 프라이버시에 집중
빅테크 의존도 없는 독립적인 웹 엔진
SerenityOS 프로젝트에서 파생되어 발전

현재 개발 중이지만, 웹 브라우저 독점 구조에 도전하는 의미 있는 프로젝트로 커뮤니티의 주목을 받고 있습니다.

5. 보안 및 OSINT

Shadowbroker — 15개 실시간 데이터 소스 통합 OSINT 도구

Shadowbroker는 15개의 실시간 데이터 소스를 통합한 OSINT(오픈소스 인텔리전스) 도구로, Hacker News에서 304포인트를 기록했습니다.

주요 기능:

기업 전용 항공기(비즈니스 제트) 추적
위성 궤도 데이터 분석
해상 선박 추적
기업 정보 및 인프라 매핑

보안 연구자, 저널리스트, 기업 인텔리전스 팀에 특히 유용합니다. 단, 윤리적이고 합법적인 목적으로만 사용해야 합니다.

6. 커뮤니티 이벤트

Google Summer of Code 2026 — 22번째 해, 지원 마감 3월 31일

Google Summer of Code(GSoC) 2026이 22번째 해를 맞이했습니다.

참여 오픈소스 단체: 185개
지원 마감: 2026년 3월 31일
대상: 18세 이상 오픈소스 기여 희망자

GSoC는 매년 수천 명의 개발자들에게 오픈소스 프로젝트에 실질적으로 기여할 기회를 제공합니다. 관심 있는 분들은 3월 31일 전에 지원하세요.

7. 트렌드 분석: 2026년 1분기 GitHub AI 보고서

GitHub의 2026년 AI 보고서는 현재 오픈소스 트렌드를 명확히 보여줍니다.

주요 통계:

2026년 1분기 신규 AI 모델: 267개
오픈소스 또는 특화 모델 비율: 과반수

3대 주도 트렌드:

로컬 추론(Local Inference): 클라우드 없이 자신의 하드웨어에서 AI를 실행하는 수요 급증
워크플로우 자동화: 반복 작업을 AI로 대체하는 파이프라인 구축 도구
에이전트 프레임워크: 여러 AI 모델과 도구를 조율하는 멀티에이전트 시스템

이 데이터는 오픈소스 AI가 단순한 실험을 넘어 실제 프로덕션 환경에 깊숙이 침투하고 있음을 보여줍니다.

퀴즈

Q1. OpenClaw이 가장 빠르게 성장한 오픈소스 프로젝트 중 하나인 이유는?

정답: 자체 하드웨어에서 실행되는 개인 AI 어시스턴트로, WhatsApp, Telegram, Slack 등 주요 메시징 플랫폼과 연동되어 실용성이 높기 때문입니다.

설명: 2026년 초 9,000개에서 210,000개 이상의 스타로 급증했으며, 프라이버시와 로컬 실행을 중시하는 사용자들에게 강한 호소력을 가집니다.

Q2. Biome이 ESLint + Prettier 조합 대비 가지는 가장 큰 기술적 장점은?

정답: Rust로 작성되어 ESLint 대비 약 100배의 린팅 속도를 제공하며, 두 도구를 하나의 설정 파일로 통합 관리할 수 있습니다.

설명: 기존에는 .eslintrc와 .prettierrc를 별도로 관리해야 했지만, Biome은 biome.json 하나로 린팅과 포매팅을 모두 처리합니다.

Q3. Karpathy의 autoresearch 도구의 핵심 기능은 무엇이며, 얼마나 빠르게 인기를 얻었나요?

정답: 연구 쿼리가 주어지면 웹 검색, 논문 수집, 요약, 인사이트 생성을 자동으로 수행하는 연구 자동화 도구로, 3일 만에 22,983개의 GitHub 스타를 획득했습니다.

설명: Hacker News에서도 198포인트를 기록하며 "연구 자동화 운동"의 시발점이 된 것으로 평가받습니다.

Q4. LTX 2.3이 오픈소스 비디오 생성 분야에서 주목받는 이유는?

정답: 22B 파라미터 규모의 오픈소스 모델로, 네이티브 4K 해상도에서 50 FPS, 동기화된 오디오까지 지원하여 오픈소스 비디오 생성의 새로운 기준을 제시했기 때문입니다.

설명: 세로 모드(1080x1920)도 지원하여 모바일 콘텐츠 제작에도 활용할 수 있습니다.

Q5. Google Summer of Code 2026에 대해 올바른 설명은?

정답: 2026년이 22번째 해이며, 185개 오픈소스 단체가 참여하고, 지원 마감은 2026년 3월 31일입니다.

설명: GSoC는 매년 전 세계 개발자들에게 오픈소스 프로젝트에 유급으로 기여할 기회를 제공하는 Google의 대표적인 오픈소스 지원 프로그램입니다.

References

Open Source Community Digest: Noteworthy Projects of March 2026

Overview
1. The AI Agent Revolution
2. The Rise of Research Automation
- Karpathy's autoresearch — 22,983 Stars in 3 Days
3. Multimodal and Video AI
- Google Gemini Embedding 2 — Unified Multimodal Embeddings
- LTX 2.3 by Lightricks — Open-Source 4K Video Generation
4. Developer Tooling
- Biome — The Complete ESLint + Prettier Replacement
- Ladybird Browser — A Truly Independent Web Browser
5. Security and OSINT
- Shadowbroker — 15 Real-Time Data Sources in One OSINT Tool
6. Community Events
- Google Summer of Code 2026 — 22nd Year, Apply by March 31
7. Trend Analysis: GitHub AI Report Q1 2026
Quiz
References

Overview

March 2026 has been a remarkable month for the open source ecosystem. AI agent frameworks are proliferating at an unprecedented pace, research automation tools are gaining traction, and multimodal models continue to push boundaries. According to GitHub's 2026 AI Report, 267 new AI models were released in Q1 2026 alone — the majority open-source or domain-specialized.

This digest covers the most compelling projects and news from this month.

1. The AI Agent Revolution

OpenClaw — Fastest-Growing Open Source Project

One of the most dramatic GitHub growth stories in recent memory: OpenClaw surged from 9,000 to 210,000+ stars within just a few weeks.

OpenClaw is a personal AI assistant that runs entirely on your own hardware — no cloud dependency required. It delivers powerful automation while keeping your data local and private. What sets it apart is seamless integration with the messaging platforms people already use every day.

Supported platforms:

WhatsApp, Telegram, Slack, Discord
Signal, iMessage, Microsoft Teams

Key use cases:

Developer workflow automation
Personal productivity management
Web scraping and data collection

# OpenClaw local setup using Ollama
git clone https://github.com/openclaw/openclaw
cd openclaw
ollama pull llama3.2
python setup.py --platform slack,telegram
python run.py --model llama3.2 --port 8080

The recipe for OpenClaw's success is straightforward: privacy-conscious users want to run AI on their own machines, connected to the communication tools they already rely on. The project delivers that in a polished, accessible package.

ByteDance DeerFlow 2.0 — Open-Source SuperAgent Harness

ByteDance released DeerFlow 2.0, described as an open-source "SuperAgent harness" capable of handling research, code generation, and content creation within a unified pipeline.

Core capabilities:

Safe code execution inside sandboxed environments
Long-term context retention via memory systems
Integration with diverse external tools and APIs
Skills library and sub-agent orchestration

# DeerFlow 2.0 usage example
from deerflow import SuperAgent, Sandbox

agent = SuperAgent(
    model="deepseek-r2",
    sandbox=Sandbox(type="docker"),
    memory=True,
    skills=["web_search", "code_exec", "doc_gen"]
)

result = agent.run(
    task="Research the latest open source trends and write a summary report",
    output_format="markdown"
)

DeerFlow 2.0 represents ByteDance's bet that the future of AI lies in composable, auditable agent pipelines rather than monolithic models.

Alibaba Page Agent — Control Web UIs with Natural Language

Alibaba's Page Agent is a JavaScript-based in-page GUI agent that takes a unique approach: instead of relying on screenshots (computationally expensive and fragile), it uses text-based DOM analysis to understand and interact with web interfaces.

// Page Agent usage example
import { PageAgent } from '@alibaba/page-agent'

const agent = new PageAgent({ model: 'qwen-max' })

// Control web interfaces with natural language
await agent.execute('Type "open source AI agents" in the search bar and click the first result')
await agent.execute('Fill out the contact form and submit it')

Being text-based makes it dramatically lighter than screenshot-based alternatives, while maintaining high accuracy on structured web content.

NousResearch Hermes Agent — Self-Learning AI Agent

Hermes Agent from NousResearch introduces a compelling concept: an AI agent that learns from repetition. As the agent encounters recurring tasks, it automatically identifies patterns, encodes them as "skills," and reuses those skills in future similar situations.

No manual programming of new routines is required — the agent optimizes its own behavior over time. This represents a meaningful step toward AI agents as genuine learning companions rather than static tools.

Nvidia NemoClaw — Enterprise AI Agent Platform

Nvidia announced NemoClaw, an open-source AI agent platform purpose-built for enterprise environments. It includes agentic tooling with security, compliance, and scalability in mind, tightly integrated with Nvidia's GPU infrastructure for high-performance inference.

2. The Rise of Research Automation

Karpathy's autoresearch — 22,983 Stars in 3 Days

Andrej Karpathy published autoresearch, which reached 22,983 GitHub stars within just three days. It also scored 198 points on Hacker News, demonstrating strong community interest.

Given a research query, the tool automatically performs web searches, collects relevant papers, generates summaries, and surfaces actionable insights. It is widely credited with sparking the current research automation movement.

# autoresearch usage example
pip install autoresearch
autoresearch --query "transformer architecture improvements 2025-2026" \
             --sources arxiv,semantic_scholar,web \
             --output report.md \
             --depth 3

Both academia and industry are taking notice, and a wave of similar tools is already emerging in its wake.

3. Multimodal and Video AI

Google Gemini Embedding 2 — Unified Multimodal Embeddings

Released on March 15, Gemini Embedding 2 is Google's unified multimodal embedding model that represents text, images, video, audio, and documents all within a single shared embedding space.

Previously, developers needed separate embedding models for each modality. Gemini Embedding 2 collapses this into a single model, significantly simplifying multimodal search, recommendation systems, and RAG (Retrieval-Augmented Generation) pipelines.

LTX 2.3 by Lightricks — Open-Source 4K Video Generation

Lightricks released LTX 2.3, a 22-billion-parameter open-source video generation model that sets a new quality bar for the field.

Key specifications:

Parameters: 22B
Resolution: Native 4K
Frame rate: 50 FPS
Audio: Synchronized audio generation
Portrait mode: 1080x1920 (optimized for mobile content)

This is a landmark release for open-source video generation — matching or exceeding proprietary offerings in key metrics while being fully open.

4. Developer Tooling

Biome — The Complete ESLint + Prettier Replacement

Biome consolidates two essential JavaScript/TypeScript tools — ESLint (linting) and Prettier (formatting) — into a single, blazing-fast toolchain.

Performance comparison:

Tool	Linting	Formatting	Config
ESLint + Prettier	baseline	baseline	Two separate config files
Biome	100x faster	25x faster	Single `biome.json`

// biome.json configuration example
{
  "$schema": "https://biomejs.dev/schemas/1.9.0/schema.json",
  "linter": {
    "enabled": true,
    "rules": {
      "recommended": true
    }
  },
  "formatter": {
    "enabled": true,
    "indentStyle": "space",
    "indentWidth": 2
  }
}

# Lint + format in a single command
npx @biomejs/biome check --apply ./src

Written in Rust, Biome achieves its performance advantage by operating at native speed rather than through Node.js. For large codebases, the difference is dramatic.

Ladybird Browser — A Truly Independent Web Browser

Ladybird is an open-source web browser built entirely from scratch — no Chromium, no WebKit, no Gecko. Every component is original code, developed with a focus on performance, security, and privacy.

Ladybird originated from the SerenityOS project and has grown into a standalone effort to create a genuine alternative to the current browser duopoly. While still in active development, it represents one of the most ambitious open-source engineering projects of this era.

5. Security and OSINT

Shadowbroker — 15 Real-Time Data Sources in One OSINT Tool

Shadowbroker is an OSINT (Open-Source Intelligence) tool that integrates 15 real-time data sources into a single interface. It scored an impressive 304 points on Hacker News.

Capabilities include:

Corporate jet and private aircraft tracking
Satellite orbit data and analysis
Maritime vessel tracking
Corporate infrastructure mapping

It is particularly valuable for security researchers, investigative journalists, and corporate intelligence teams. As with all OSINT tools, ethical and lawful use is essential.

6. Community Events

Google Summer of Code 2026 — 22nd Year, Apply by March 31

Google Summer of Code (GSoC) 2026 marks its 22nd year.

Participating organizations: 185 open source communities
Application deadline: March 31, 2026
Eligibility: Anyone 18 or older interested in contributing to open source

GSoC provides developers worldwide with a structured, paid opportunity to contribute meaningfully to open source projects under the guidance of experienced mentors. If you've been considering participating, the deadline is approaching.

7. Trend Analysis: GitHub AI Report Q1 2026

GitHub's 2026 AI Report paints a clear picture of where the open source world is heading.

Key figures:

New AI models in Q1 2026: 267
Share that are open-source or specialized: majority

Three leading trends:

Local inference: Demand for running AI on personal hardware, without cloud dependency, is growing rapidly — OpenClaw being the most visible example.
Workflow automation: Tools that replace repetitive tasks with AI-powered pipelines are proliferating across every domain.
Agent frameworks: Multi-agent systems that orchestrate several AI models and external tools are becoming the dominant architectural pattern.

The data confirms what practitioners have been sensing: open source AI is no longer experimental. It is production-ready and deeply embedded in how developers work.

Quiz

Q1. How many GitHub stars did OpenClaw reach, and what is its primary differentiator from cloud-based AI assistants?

Answer: OpenClaw surpassed 210,000 stars (up from 9,000) within weeks. Its primary differentiator is that it runs entirely on the user's own hardware, ensuring full data privacy without cloud dependency.

Explanation: Integration with everyday messaging platforms (WhatsApp, Telegram, Slack, Discord, Signal, iMessage, Teams) made it immediately practical for a broad audience.

Q2. What makes Biome significantly faster than the ESLint + Prettier combination?

Answer: Biome is written in Rust, enabling it to operate at native performance rather than through the Node.js runtime. It achieves approximately 100x faster linting compared to ESLint.

Explanation: In addition to speed, Biome simplifies project configuration by replacing two separate config files (.eslintrc and .prettierrc) with a single biome.json.

Q3. What novel capability does NousResearch Hermes Agent introduce compared to conventional AI agents?

Answer: Hermes Agent is a self-learning agent that automatically identifies patterns in repetitive tasks, encodes them as reusable "skills," and applies those skills in similar future situations — without manual reprogramming.

Explanation: This shifts AI agents from static tools that execute fixed instructions to dynamic systems that improve through use.

Q4. What distinguishes Alibaba's Page Agent from screenshot-based web automation agents?

Answer: Page Agent uses text-based DOM analysis rather than screenshots, making it significantly lighter and more accurate on structured web content while consuming far fewer computational resources.

Explanation: Screenshot-based agents must process image data at every step, which is computationally expensive and prone to errors from visual noise. Text-based DOM analysis is more deterministic and efficient.

Q5. According to the GitHub AI Report, what are the three dominant trends in open source AI for Q1 2026?

Answer: (1) Local inference — running AI on personal hardware without cloud dependency; (2) Workflow automation — replacing repetitive tasks with AI pipelines; (3) Agent frameworks — multi-agent systems orchestrating multiple AI models and tools.

Explanation: 267 new AI models were released in Q1 2026, the majority open-source or domain-specialized, reflecting how rapidly this space is diversifying beyond large general-purpose models.