Triton

All Posts

Published on
2026년 5월 16일
리버스 엔지니어링 도구 2026 — Ghidra / IDA Pro / Binary Ninja / radare2 / Frida / x64dbg / angr 심층 가이드
reverse-engineering ghidra ida-pro binary-ninja radare2 cutter x64dbg ollydbg hopper plasma angr frida cheat-engine wireshark pwntools pwndbg gef capstone keystone unicorn qemu klee triton 2026 deep-dive
2026년 리버스 엔지니어링 도구 지형도를 디스어셈블러(Ghidra/IDA Pro/Binary Ninja/radare2+Cutter/Hopper), 디버거(x64dbg/OllyDbg/WinDbg/Pwndbg/GEF), 동적 계측(Frida/Cheat Engine/Wireshark), 심볼릭 실행(angr/Triton/KLEE), 그리고 엔진 레이어(Capstone/Keystone/Unicorn/QEMU)로 정밀하게 분해한다. NSA가 2019년에 풀어버린 Ghidra가 어떻게 IDA Pro의 가격 독점을 무너뜨렸는지, Binary Ninja의 BNIL/HLIL이 왜 모던 RE의 표준이 되었는지, Frida가 모바일 RE를 어떻게 재정의했는지, angr와 KLEE의 심볼릭 실행이 실제로 어디까지 작동하는지, 그리고 LLM-assisted RE — Binary Ninja AI / IDA Pro Decompiler AI / Sidekick — 가 2026년에 어디까지 진짜로 작동하는지. CTF 도구 체인(Pwntools + Pwndbg + GEF)과 한국(KAIST 해킹동아리·KISA)·일본(AIST·JPCERT·FFRI セキュリティ) 보안 연구 생태계까지 한 번에 정리한다.
Published on
2026년 4월 15일
MLOps 완전 가이드 — 모델 서빙·Feature Store·Drift·A/B 테스트·GPU 경제학 (Season 2 Ep 7, 2025)
mlops model-serving feature-store drift-detection ab-testing gpu-economics vllm triton mlflow ray kubernetes season-2
모델을 학습하는 것과 프로덕션에서 운영하는 것은 완전히 다른 게임이다. Serving(TorchServe·Triton·vLLM·TGI), Feature Store(Feast·Tecton), Training Infra(Ray·Determined), Experiment Tracking(MLflow·W&B), Data/Concept Drift 감지, Model A/B 테스트와 Shadow Deployment, 그리고 GPU 경제학(on-demand·spot·자체 구매)까지 — "논문에서 프로덕션까지의 거리"를 메우는 실전 MLOps 한 편. Season 2의 일곱 번째.
Published on
2026년 3월 21일
토스뱅크 ML Engineer (MLOps) 합격 완벽 가이드: MLFlow부터 LLM 플랫폼까지 기술스택 총정리
mlops ml-platform tossbank kubernetes mlflow airflow kubeflow triton scylladb feature-store llm gpu career interview 2026-03 2026-03-21
토스뱅크 ML Platform Team의 MLOps Engineer JD를 완전 분석합니다. MLFlow, Airflow, JupyterHub, Kubeflow, Triton Inference Server, ScyllaDB Feature Store, LLM 플랫폼까지 — 합격을 위한 기술스택 딥다이브, 면접 예상 질문 30선, 6개월 학습 로드맵.
Published on
2026년 3월 17일
PyTorch 내부 구조 & 고급 최적화: autograd, torch.compile, FSDP, Triton까지
PyTorch torch.compile FSDP Triton 혼합정밀도 분산학습 2026-03 2026-03-17
PyTorch autograd 엔진, torch.compile() TorchInductor 최적화, FSDP 분산 학습, gradient checkpointing, 커스텀 CUDA 연산까지 PyTorch 완전 정복 가이드입니다.
Published on
2026년 3월 17일
CUDA GPU 프로그래밍 심화: Warp 최적화, Tensor Core, Triton 커널 작성까지
CUDA GPU프로그래밍 TensorCore Triton FlashAttention NCCL 2026-03 2026-03-17
CUDA 메모리 계층, Warp 최적화, Tensor Core WMMA API, Flash Attention 구현, Triton 커스텀 커널 작성까지 AI 모델 학습 가속화를 위한 GPU 프로그래밍 심화 가이드입니다.
Published on
2026년 3월 17일
AI 모델 배포 & 서빙 완전 가이드: Triton, vLLM, BentoML, Kubernetes까지
모델서빙 Triton vLLM BentoML Kubernetes LLM배포 2026-03 2026-03-17
Docker GPU 컨테이너, Kubernetes HPA, NVIDIA Triton, vLLM LLM 서빙, BentoML, Ray Serve까지 AI 모델 프로덕션 배포 완전 가이드입니다.
Published on
2026년 3월 17일
AI 모델 서빙과 추론 최적화 완전 가이드: vLLM, TensorRT, Triton, Ollama
mlops model-serving vllm tensorrt triton inference optimization 2026-03 2026-03-17
AI 모델을 프로덕션에서 효율적으로 서빙하는 완전 가이드. vLLM, TensorRT, NVIDIA Triton Inference Server, Ollama, 양자화(INT8/INT4), 배치 처리, 지연 최적화까지 실전 예제로 마스터합니다.
Published on
2026년 3월 8일
NVIDIA Triton Inference Server 프로덕션 가이드: GPU 모델 서빙 최적화 전략
ai-platform triton inference-server gpu model-serving nvidia 2026-03 2026-03-08
NVIDIA Triton Inference Server를 활용한 GPU 모델 서빙 최적화 가이드. Dynamic Batching, Model Ensemble, TensorRT 통합, 멀티 모델 서빙, Kubernetes 배포, 성능 프로파일링과 프로덕션 트러블슈팅까지 다룹니다.
Published on
2026년 3월 1일
Kubernetes ML 모델 서빙: KServe와 NVIDIA Triton 완전 분석
mlops kubernetes model-serving kserve triton
KServe와 NVIDIA Triton 공식 문서를 기반으로 Kubernetes 환경에서의 ML 모델 서빙 아키텍처를 체계적으로 분석한다.

Triton

triton (9)