필사 모드: Uptime Monitoring & Status Pages in 2026 — Better Stack / Pingdom / Uptime Robot / Uptime Kuma / Checkly / Statuspage Deep Dive
English> "Downtime is unavoidable. The only real downtime is downtime that goes unmeasured." — Charity Majors, co-founder of Honeycomb, 2024
Uptime monitoring is the tool that answers the most basic question: "Is your service alive?" Since SiteUptime first appeared in 1998, the essence has stayed the same for nearly 30 years, but the scope of what we monitor has exploded — from a simple HTTP 200 response to SSL certificate expiration, DNS propagation, TLD expiration, API response bodies, multi-step synthetic user flows, and missing cron jobs.
As of May 2026, the uptime monitoring market falls into four major camps: **(1) classic SaaS** (Pingdom, Uptime Robot, Updown.io, Better Stack), **(2) APM-integrated synthetic monitoring** (Datadog Synthetic, New Relic Synthetics, Checkly), **(3) open-source self-hosted** (Uptime Kuma, Gatus, OneUptime, Cachet), and **(4) status pages** (Atlassian Statuspage, Instatus, Better Stack Status Pages). This post walks through the flagship product in each category, the pricing and feature differences, and real-world cases from Korean and Japanese companies.
1. The 2026 Uptime Monitoring Map — SaaS / OSS / APM Integrations / Status Pages
In 2026 the uptime monitoring landscape can be sorted into four buckets.
| Category | Flagship products | Core value | Price range |
|---|---|---|---|
| Classic SaaS | Pingdom, Uptime Robot, Updown.io, Statuscake, Hyperping, Freshping, Site24x7 | HTTP/TCP/ICMP checks from external vantage points with alerts | Free ~ $50/mo |
| Next-gen integrated SaaS | Better Stack, Cronitor, OneUptime | Logs + uptime + on-call + status pages unified | $30 ~ $200/mo |
| APM synthetic monitoring | Datadog Synthetic, New Relic Synthetics, Checkly, Pingdom | Multi-step user flows, API chains, browser automation | $50 ~ $500+/mo |
| Cron monitoring | Cronitor, Healthchecks.io | Detect silence when scheduled jobs fail to run | Free ~ $80/mo |
| OSS self-hosted | Uptime Kuma, Gatus, OneUptime, Cachet | Self-host with a single Docker command, data sovereignty | $0 (infra cost) |
| Status pages | Atlassian Statuspage, Instatus, Better Stack, Hund | Communicate incidents transparently to customers | $29 ~ $1,500+/mo |
The categories overlap. Better Stack is SaaS but bundles status pages and on-call; OneUptime is open source but also offers SaaS and bundles status pages; Checkly began with synthetic monitoring but expanded into API monitoring and local Playwright execution.
The single biggest trend in 2026 is **"single-product consolidation."** Whereas teams used to glue together Pingdom (uptime) + PagerDuty (on-call) + Statuspage (announcements) + Datadog (metrics), Better Stack and OneUptime put all four into one dashboard. That consolidated package is rapidly gaining share in the SMB segment.
2. Better Stack — BetterUptime+Logtail Merged in August 2023
Better Stack began in 2018 in Slovakia as BetterUptime, an uptime monitoring SaaS. In 2021 it launched a separate log management product called Logtail, then in August 2023 it merged the two products under the single **Better Stack** brand and raised an 18 million dollar Series A that same year. As of 2025 it has about 80 employees, tens of thousands of customers, and roughly 20 million dollars in ARR.
Better Stack's core differentiator is unifying **"uptime + logs + alerts + status pages + on-call"** in a single dashboard. Areas that were separate in competing products are now merged.
- Uptime checks (parity with the old BetterUptime, Pingdom)
- Log collection and SQL search (parity with the old Logtail, Datadog Logs)
- Alert routing and schedules (a "PagerDuty Lite")
- Status page hosting (an alternative to Statuspage)
- Incident management (an "incident.io Lite")
Pricing starts with a free plan (10 monitors, 3-minute checks) and climbs through **Freelancer ($29/mo)**, **Small Team ($99/mo)**, and **Business ($299/mo)**. Offering more features at roughly half the price of classic Pingdom plans is the biggest strength.
Another strength is the **detailed incident page**. When an uptime check fails, an incident is auto-generated containing traceroute, DNS queries, SSL certificate state, HTTP headers, response body, and related logs from the same time window — all on one page. The company markets this as a direct contributor to reducing mean time to recovery (MTTR).
Example Better Stack health check (UI or API)
name: api.example.com health
url: https://api.example.com/health
method: GET
expected_status: 200
expected_body: '"status":"ok"'
check_frequency: 30 # seconds
regions:
- us-east
- eu-west
- ap-northeast
notifications:
- type: slack
channel: '#oncall-platform'
- type: pagerduty
service_id: PAB123
ssl_certificate_check: true
domain_expiration_check: true
An API and Terraform provider are available, so monitoring configuration can be managed as code. The `betterstack-elements` SDK that went GA in 2025 lets you embed status pages directly as React or Vue components.
Downsides are (1) it is not a full APM in the league of Datadog or New Relic, (2) log retention is limited to 30–90 days even on expensive plans, and (3) data residency is US and EU only. Korean and Japanese customers sometimes hesitate to adopt it for residency reasons.
3. Pingdom — The Classic Under SolarWinds
Pingdom started in 2005 in Sweden and is one of the oldest uptime monitoring SaaS products in the market. It went through a stagnant period after SolarWinds acquired it in 2014, but the phrase "monitored by Pingdom" is so iconic that it is practically a synonym for uptime monitoring. Even today it is used by more than 700,000 websites worldwide.
The core product splits into two:
- **Pingdom Uptime**: HTTP/HTTPS, TCP, UDP, DNS, SMTP, POP3, IMAP checks from 100+ external vantage points
- **Pingdom Real User Monitoring (RUM)**: page-load timing from real users via a JavaScript snippet
Pricing is **Starter ($15/mo, 10 checks)**, **Professional ($39/mo, 50 checks)**, **Advanced ($89/mo)**, and RUM starts separately at $20/mo. Synthetic transactions cost extra at roughly $15–$20 per transaction per month, so a full feature set quickly climbs into hundreds of dollars per month.
Pingdom REST API example — list checks
curl -X GET https://api.pingdom.com/api/3.1/checks \
-H "Authorization: Bearer $PINGDOM_API_TOKEN"
Create a check
curl -X POST https://api.pingdom.com/api/3.1/checks \
-H "Authorization: Bearer $PINGDOM_API_TOKEN" \
-H "Content-Type: application/json" \
-d '{
"name": "api.example.com",
"host": "api.example.com",
"type": "http",
"resolution": 1,
"url": "/health",
"encryption": true
}'
Pingdom's biggest strengths are **(1) brand recognition** and **(2) 100+ vantage points**. Especially for global services, when you need to validate "can users in Southeast Asia reach our service," Pingdom offers overwhelmingly more measurement locations than competitors.
Weaknesses are **(1) UI/UX still feels like the 2010s**, **(2) pricing is high relative to competitors**, and **(3) new feature velocity is slow**. As the SolarWinds parent spread resources across other products (Loggly, AppOptics, Papertrail), standalone Pingdom innovation stalled. A price hike followed the 2024 private-equity acquisition of SolarWinds, the free trial was cut from 30 to 14 days, and overall cost efficiency degraded.
For a fresh adoption in 2026, most teams should first evaluate Better Stack, Checkly, or Uptime Robot before Pingdom. That said, if you already live inside the SolarWinds ecosystem, the Pingdom + AppOptics + Loggly bundle discount can be attractive.
4. Uptime Robot — The Undisputed King of the Free 50-Monitor Tier
Uptime Robot started in 2010 in Turkey. The **"50 free monitors with a 5-minute interval"** plan has become the price-comparison anchor for every other product, and as of 2026 it serves over 2 million users and 10 million active monitors worldwide.
Pricing tiers:
- **Free**: 50 monitors, 5-minute interval, basic alerts
- **Solo ($7/mo)**: 50 monitors, 1-minute interval, partial SMS/voice
- **Team ($14/mo)**: 100 monitors, 30-second interval
- **Enterprise ($59/mo)**: 200 monitors, 30-second interval, priority support
The reason the free tier is so generous is not ad monetization but a focus on **"Solo conversion rate."** Clear value gaps like the 1-minute interval and SMS alerts produce about a 5% conversion to paid plans among free users.
Uptime Robot v3 API — list monitors
curl -X POST https://api.uptimerobot.com/v3/getMonitors \
-H "Authorization: Bearer $UPTIMEROBOT_API_KEY" \
-d "format=json"
Create a monitor
curl -X POST https://api.uptimerobot.com/v3/newMonitor \
-H "Authorization: Bearer $UPTIMEROBOT_API_KEY" \
-d "type=1&url=https://example.com&friendly_name=Production%20Web"
The appeal of Uptime Robot is that it is **"the easiest first choice for solo developers and side projects."** You can register 50 monitors within five minutes of signup, with no credit card and no time limit. A free status page (`stats.uptimerobot.com/xxx`) is included, so small projects need nothing else.
Limits are **(1) no multi-step synthetic monitoring**, **(2) no log or metric integration**, and **(3) no built-in on-call** (you have to pair with PagerDuty or OpsGenie). It is best understood as a tool laser-focused on one thing: confirming HTTP 200. When the business grows enough to need multi-step checks or incident management, the typical migration path is to Better Stack or Checkly.
5. Updown.io — French Minimalist Paid
Updown.io was launched in 2014 by French solo developer Adrien Rey-Jarthon. Twelve years later in 2026 it is still operated by 1–2 people, yet handles around 50,000 active monitors reliably.
Every design decision in Updown.io reduces to a single word: **"simplicity."**
- Usage-based pricing (billed by check count, not number of monitors)
- 1 check = 0.000125 EUR, so 1-minute interval for a year ≈ 65 EUR
- Free credit of 50,000 checks (about 35 days at 1-minute intervals)
- Single-page SPA UI, no multi-language support, no mobile app
These constraints make the pricing overwhelmingly cheap. Running 100 monitors at a 1-minute interval for a month costs around 5 EUR — roughly one tenth of Pingdom.
Updown.io API — list all checks
curl -X GET https://updown.io/api/checks \
-H "X-API-KEY: $UPDOWN_API_KEY"
Add a check
curl -X POST https://updown.io/api/checks \
-H "X-API-KEY: $UPDOWN_API_KEY" \
-d "url=https://example.com&period=60&enabled=true"
Updown.io has a passionate base among European indie developers and small startups. Many well-known OSS maintainers on GitHub have publicly stated they use Updown.io for the API monitoring of their projects.
Downsides include (1) limited vantage points in Korea and Japan (as of 2026: 4 in Europe, 3 in the US, 2 in Asia), (2) no multi-step or synthetic transactions, and (3) English-mostly support docs. If you run a global SaaS, the regional diversity of vantage points may be insufficient.
6. Cronitor and Healthchecks.io — The Two Giants of Cron Monitoring
A different category often confused with uptime monitoring is **"cron job monitoring."** This is the tool that detects **"the silence of a scheduled task that did not run."** A regular HTTP check confirms the service is alive, while cron monitoring confirms whether "today's backup ran at 3 a.m."
The two flagships here are Cronitor and Healthchecks.io.
Healthchecks.io
Started in 2015 by Latvian developer Pēteris Caune as a solo project, it is an open-source cron monitoring SaaS. The free plan is generous (20 checks) and the BSD-licensed source code can be self-hosted. Pricing tiers are **Hobbyist ($5/mo, 100)**, **Business ($25/mo, 500)**, and **Plus ($80/mo, 1,500)**.
The core pattern is the **dead man's switch**. You ping a URL at the start and end of a job, and if the ping does not arrive within the expected window, an alert fires.
crontab example — hourly backup
0 * * * * /usr/local/bin/backup.sh && curl -fsS --retry 3 https://hc-ping.com/your-uuid
Report start/end/failure (recommended)
0 * * * * curl -fsS --retry 3 https://hc-ping.com/your-uuid/start && /usr/local/bin/backup.sh && curl -fsS --retry 3 https://hc-ping.com/your-uuid || curl -fsS --retry 3 https://hc-ping.com/your-uuid/fail
Cronitor
Cronitor began in 2014 in San Francisco. Pricing is higher than Healthchecks.io (starting at $24/mo), but it offers **(1) full uptime check integration**, **(2) metric and log collection**, **(3) cron job statistics**, and **(4) automatic discovery of AWS Lambda and Kubernetes CronJobs**.
The `cronitor` CLI wraps existing cron commands and tracks them automatically.
Cronitor CLI install
brew install cronitor
Auto-discover crontab entries
cronitor discover
Command-wrapper form
0 * * * * cronitor exec backup-job /usr/local/bin/backup.sh
The selection criterion is simple. **"Pure cron monitoring only — Healthchecks.io. Uptime/metrics/logs integration too — Cronitor."** Healthchecks.io is also attractive to security-conscious organizations because it can be self-hosted.
7. OneUptime — Open Source All-in-One Monitoring Platform
OneUptime was launched in 2021 in the United States as an **open-source all-in-one monitoring platform**. Apache 2.0 licensed with the full source on GitHub, it also offers a cloud SaaS (pricing $0–$50 per user per month).
OneUptime's ambition is to be **"the open-source alternative to Better Stack + Statuspage + Datadog + PagerDuty + Cachet"**. In a single dashboard it covers:
- Uptime monitoring (HTTP, TCP, ICMP, DNS, SSL expiration)
- Synthetic monitoring (Playwright-based)
- Status page hosting
- Incident management + on-call rotation
- Log collection and search
- APM (distributed tracing)
- User and session monitoring
Self-hosting is a single Docker Compose command.
OneUptime self-hosting
git clone https://github.com/OneUptime/oneuptime.git
cd oneuptime
cp config.example.env config.env
After editing config.env
docker compose up -d
Strengths include **(1) ability to retain all data via self-hosting**, **(2) consolidated monitoring/on-call/status pages on one platform**, and **(3) Apache 2.0 commercial-use freedom**.
Weaknesses are **(1) installation and operational complexity** (requires PostgreSQL, ClickHouse, and Redis), **(2) UI polish lower than commercial SaaS**, **(3) sparse documentation**, and **(4) still-small community** (around 6,000 GitHub stars). As of 2026, OneUptime is the most ambitious player in the "open source unified monitoring" segment, but also walks the toughest path.
8. Statuscake / Hyperping / Freshping / Site24x7 — The Mid-Price Contenders
These four products target the gap between Pingdom and Uptime Robot in the mid-price band. The short version of each:
Statuscake (UK)
Started in 2012 in the UK. The free plan is generous (10 monitors, 5-minute interval) and paid plans start at **Superior ($24/mo)**. Popular among UK and EU users for data residency reasons. SSL certificate monitoring and domain expiration checks come bundled at the same price.
Hyperping (France)
Started in 2017 in France. The differentiators are a **"developer-friendly UI"** and **free status-page hosting**. Pricing starts at $19/mo, sitting between Updown.io and Pingdom. A Korean UI added in late 2024 boosted Asian uptake.
Freshping (Freshworks)
A free add-on product from Freshworks (Freshdesk, Freshservice). With **50 free monitors at 1-minute intervals**, it is a direct competitor to Uptime Robot. Attractive to organizations using the Freshworks ecosystem (e.g., automatic Freshdesk ticket creation), but feature-thin as a standalone product.
Site24x7 (Zoho)
Zoho's (India) comprehensive monitoring suite. Covers uptime, server, network, cloud, APM, logs, real user monitoring, and status pages — one of the broadest products available. Starts at **$10/mo**, but a full feature set climbs into hundreds of dollars per month. Often evaluated as a low-cost Datadog alternative.
| Product | Origin | Starting price | Notes |
|---|---|---|---|
| Statuscake | UK | $24/mo | EU data residency, SSL/domain expiration |
| Hyperping | France | $19/mo | Developer UI, free status page, Korean support |
| Freshping | India | Free 50 | Freshworks integration |
| Site24x7 | India | $10/mo | Zoho full stack, Datadog alternative |
9. New Relic Synthetics + Datadog Synthetic — APM-Integrated Synthetics
Where traditional uptime monitoring asks "does this single URL respond," **synthetic monitoring** verifies "does the real user's multi-step flow work end to end?" For example, running the 5-step flow "home → login → product search → add to cart → checkout → confirmation" every 5 minutes in an actual headless browser.
The two giants here are Datadog Synthetic Monitoring and New Relic Synthetics.
Datadog Synthetic Monitoring
Datadog's synthetic monitoring splits into two:
- **API Tests**: single HTTP/gRPC/WebSocket checks
- **Browser Tests**: multi-step user flow automation via headless Chrome
Pricing is usage-based: **API Tests 10,000/month at $5**, **Browser Tests 1,000/month at $12**. Decisive for organizations already on Datadog APM/Logs/Infrastructure because metrics and synthetic data show up in the same dashboard.
// Example Datadog Synthetic Browser Test script (recorded in UI, then exported)
await page.goto('https://example.com')
await page.click('button[data-test="login"]')
await page.fill('input[name="email"]', 'test@example.com')
await page.fill('input[name="password"]', 'password123')
await page.click('button[type="submit"]')
await page.waitForSelector('.dashboard')
New Relic Synthetics
Similar to Datadog, but it offered a **"Scripted Browser"** capability for multi-step browser automation earlier. Scripts are written in Selenium WebDriver style and can be executed inside corporate networks via Private Minion (a self-hosted agent).
Pricing now uses the new usage model: **first 100 GB/month free**, then $0.30 per GB after that. Synthetic checks count against the GB usage, and the bundle becomes most efficient when you use the full observability package (APM + Logs + Synthetics + RUM).
Comparison with Checkly
Checkly (covered later) carves out the synthetic monitoring portion of Datadog/New Relic and makes it more developer-friendly. Pricing is about one third of Datadog, but without APM/Logs integration, so organizations that already have an observability stack often pay the premium for Datadog/New Relic for unified tooling.
10. Atlassian Statuspage — The Status Page Market Standard
A status page is the official page that transparently communicates service incidents to customers. After Stripe's hand-built status.stripe.com became the design template in 2012, Statuspage.io productized that design as a SaaS and Atlassian acquired it in 2016, rebranding it as **Atlassian Statuspage**.
As of 2026, more than 90% of major global SaaS — GitHub, AWS, Cloudflare, OpenAI, Vercel, Slack, Shopify, and others — use Atlassian Statuspage. The mental model "status page = Atlassian Statuspage" is so entrenched that it is the de facto standard.
Pricing tiers: **Free** (1 page, 2 team members, 100 subscribers), **Starter ($29/mo)**, **Growth ($99/mo)**, **Business ($299/mo)**, **Enterprise ($1,500+/mo)**. Enterprise adds SSO, audit logs, custom domains, and SLA guarantees.
Key building blocks of a status page:
- **Components**: monitored items (API, Web, Database, Email, ...)
- **Component status**: Operational, Degraded, Partial Outage, Major Outage
- **Incidents**: ongoing incidents and their update history
- **Scheduled Maintenance**: announced maintenance windows
- **Subscribers**: receive incident notifications via email, SMS, Slack, RSS, webhooks
- **Uptime history**: 90-day uptime chart per component
Atlassian Statuspage API — create an incident
curl -X POST \
"https://api.statuspage.io/v1/pages/$PAGE_ID/incidents" \
-H "Authorization: OAuth $STATUSPAGE_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"incident": {
"name": "API latency observed",
"status": "investigating",
"impact_override": "minor",
"body": "We are investigating elevated API latency.",
"component_ids": ["abc123"],
"components": {"abc123": "degraded_performance"}
}
}'
Strengths: (1) market-standard position (users are visually used to it), (2) integration with Jira/Opsgenie/Confluence, (3) Atlassian's stable SLA. Weaknesses: (1) expensive with frequent price hikes, (2) slow feature velocity, (3) UI feels heavy for new users.
11. Instatus / Cachet — Status Page Alternatives
The market reacted to the price and weight of Atlassian Statuspage with alternatives like Instatus and Cachet.
Instatus
Launched in 2020 in the UK, Instatus positions itself as **"Statuspage at one third the price with a 10x faster UI."** Pricing starts **free (1 page, unlimited subscribers)** then climbs to **Starter ($20/mo)** and **Pro ($50/mo)**.
Differentiators:
- **Sub-second page load** (vs. Atlassian Statuspage at 3–5 seconds)
- Unlimited subscribers (Statuspage Starter caps at 1,000)
- Custom domain even on the free plan
- Incident template library (writing guides per incident type)
// Instatus API — create an incident
fetch('https://api.instatus.com/v1/{pageId}/incidents', {
method: 'POST',
headers: {
'Authorization': `Bearer ${process.env.INSTATUS_API_KEY}`,
'Content-Type': 'application/json',
},
body: JSON.stringify({
name: 'API latency increased',
status: 'INVESTIGATING',
notify: true,
components: ['comp_abc123'],
statuses: [{id: 'comp_abc123', status: 'PARTIALOUTAGE'}],
started: new Date().toISOString(),
}),
})
Notable customer stories include OpenAI evaluating Statuspage alternatives and migrating some subsidiary pages to Instatus, and Vercel using Instatus for select regional status pages.
Cachet (OSS)
Cachet is an open-source status page built in 2014 by UK developer James Brooks using PHP/Laravel. BSD-3 licensed with the source on GitHub, it sits at around 17,000 stars as of 2026.
Self-hosting is the core path and can be set up in five minutes via Docker.
Cachet self-hosted via Docker
docker run -d --name=cachet \
-p 80:80 \
-e APP_KEY=$(php -r "echo 'base64:'.base64_encode(random_bytes(32));") \
-e DB_DRIVER=pgsql \
-e DB_HOST=postgres \
-e DB_DATABASE=cachet \
-e DB_USERNAME=cachet \
-e DB_PASSWORD=secret \
cachethq/docker:latest
Cachet's limits are (1) maintainer activity dropped after 2020 and updates slowed, (2) heavy PHP dependency, and (3) somewhat dated UI. For new adoption in 2026, the **Uptime Kuma** described next is usually a more modern alternative.
Other — Hund
Hund is a newer SaaS competitor in the status page market, with strengths in incident automation and component grouping. Pricing starts at $59/mo — more than Instatus but less than Atlassian Statuspage. It often shows up as the comparison choice for SMBs evaluating both options.
12. Uptime Kuma — The Most Popular OSS Self-Hosted via Docker
Uptime Kuma was launched in 2021 by Hong Kong developer Louis Lam as a solo side project. As of May 2026 it sits at roughly 65,000 GitHub stars and is the **de facto standard for self-hosted uptime monitoring**.
Installation is a single Docker command.
Self-host Uptime Kuma — single Docker container
docker run -d --restart=always -p 3001:3001 \
-v uptime-kuma:/app/data \
--name uptime-kuma \
louislam/uptime-kuma:1
Docker Compose version
docker-compose.yml
version: '3.3'
services:
uptime-kuma:
image: louislam/uptime-kuma:1
container_name: uptime-kuma
volumes:
- ./uptime-kuma-data:/app/data
ports:
- "3001:3001"
restart: always
The appeal is **(1) every feature in one container**, **(2) Pingdom-quality UI polish**, **(3) 90+ notification channels** (Slack, Discord, Telegram, email, SMS, webhook, ntfy, Gotify, and more), **(4) built-in status page hosting**, and **(5) a very active community**.
Supported check types are wide:
| Check type | Description |
|---|---|
| HTTP(s) | URL with response code and keyword matching |
| TCP Port | Port-open check |
| Ping | ICMP ping |
| DNS | DNS record response |
| Docker Container | Docker container state |
| Steam Game Server | Game server |
| Push (Heartbeat) | Cron monitoring |
| MQTT | IoT message broker |
| Microsoft SQL Server, MySQL, PostgreSQL, Redis | Database connection |
| Kafka Producer | Message queue |
| gRPC | gRPC endpoint |
Status pages are one-click, and custom domains and password protection are supported. The **Uptime Kuma 2.0** beta released in 2024 added PostgreSQL backend support, multi-instance synchronization, and data retention policies.
Limits are **(1) single-instance SPOF** (being addressed in 2.0), **(2) no multi-vantage-point** (only your own server checks), and **(3) sparse enterprise features** (SSO, audit log, etc.). Still, for solo developers, homelabs, and SMBs the value-per-dollar is overwhelming.
13. Gatus — Go-Based Monitoring with a Single YAML File
Gatus is a **Go-based open-source monitoring tool** built in 2019 by Canadian developer Tristan Webber. A direct competitor to Uptime Kuma, but with a completely different design philosophy.
- **Uptime Kuma**: UI-first, click to configure, store in a database
- **Gatus**: a single YAML file for everything, GitOps-friendly, code-reviewable
In Gatus, every monitor is defined in a single YAML file.
config.yaml — full Gatus configuration
storage:
type: postgres
path: "postgres://user:password@localhost:5432/gatus?sslmode=disable"
endpoints:
- name: api-health
group: production
url: https://api.example.com/health
interval: 30s
conditions:
- "[STATUS] == 200"
- "[RESPONSE_TIME] < 500"
- "[BODY].status == ok"
alerts:
- type: slack
send-on-resolved: true
- name: database-tcp
group: production
url: tcp://db.internal:5432
interval: 1m
conditions:
- "[CONNECTED] == true"
- name: certificate-check
url: https://example.com
interval: 1h
conditions:
- "[CERTIFICATE_EXPIRATION] > 168h"
alerting:
slack:
webhook-url: "https://hooks.slack.com/services/..."
default-alert:
description: "Service down"
send-on-resolved: true
failure-threshold: 3
Strengths: **(1) the YAML file can be Git-versioned**, **(2) first-class Helm chart for Kubernetes**, **(3) selectable PostgreSQL/SQLite/in-memory backends**, **(4) Prometheus metrics export**, and **(5) maintenance window support**.
Gatus via Docker
docker run -d \
-p 8080:8080 \
-v $(pwd)/config:/config \
--name gatus \
twinproduction/gatus:latest
Kubernetes via Helm
helm repo add twin https://twin.github.io/helm-charts
helm install gatus twin/gatus
Limits: **(1) the UI is plain** (not as flashy as Uptime Kuma), **(2) no multi-user/team management**, and **(3) no synthetic monitoring**. But for DevOps/SRE teams that believe in **"monitoring as code,"** it is the cleanest fit.
14. Checkly — The Rising Star of Playwright-Based Synthetic Monitoring
Checkly was launched in 2018 in Berlin, Germany. It positions itself as **"synthetic monitoring for developers,"** with Playwright-based code-first monitoring as the core differentiator. In 2023 it raised a 53 million dollar Series B, and as of 2026 it has about 100 employees and is growing fast.
The biggest innovation in Checkly is **"Monitoring as Code."** Every monitor is defined in TypeScript/JavaScript and managed via Git.
// __checks__/api-health.check.ts
new ApiCheck('api-health-check', {
name: 'API Health Check',
alertChannels: [],
degradedResponseTime: 1000,
maxResponseTime: 5000,
request: {
method: 'GET',
url: 'https://api.example.com/health',
assertions: [
AssertionBuilder.statusCode().equals(200),
AssertionBuilder.jsonBody('$.status').equals('ok'),
AssertionBuilder.responseTime().lessThan(500),
],
},
})
// __checks__/checkout-flow.spec.ts (Playwright browser test)
test('checkout flow', async ({ page }) => {
await page.goto('https://example.com')
await page.click('text=Sign in')
await page.fill('[name=email]', 'test@example.com')
await page.fill('[name=password]', process.env.TEST_PASSWORD!)
await page.click('button[type=submit]')
await expect(page).toHaveURL(/dashboard/)
})
Deployment is via CLI.
Checkly CLI
npm install -g checkly
checkly login
checkly deploy
Preview deploys per PR in CI/CD
checkly trigger --record
Strengths: **(1) standardizes on Playwright** (no proprietary DSL to learn), **(2) GitOps-friendly**, **(3) next-gen UI**, **(4) automatic integration with Vercel/Netlify deploys**, and **(5) Datadog/Slack/PagerDuty integrations**.
Pricing: **Free** (5 checks, 10,000 API calls/mo), **Hobby ($14/mo)**, **Team ($80/mo)**, **Enterprise ($300+/mo)**. About one third the price of Datadog Synthetic, making it attractive to startups assembling their first observability stack.
Limits: **(1) no APM/Logs integration** (unlike Datadog), **(2) only 5 vantage points** (fewer than Pingdom), and **(3) no Korean or Japanese UI**. Still, in 2026 it is the **fastest-growing product in the synthetic monitoring category**.
15. Korea and Japan — Real Operational Cases at Toss, Kakao, Mercari, and NTT
What uptime monitoring tools do the major Korean and Japanese internet companies actually run? Based on public materials and engineering blogs:
Korea — Toss, Kakao
**Toss (Viva Republica)** revealed its in-house monitoring system **TIMS (Toss Infrastructure Monitoring System)** at the SLASH 2024 conference. On a Prometheus + Grafana + Loki metric/log foundation, external uptime checks use Datadog Synthetic and internal cron monitoring is a custom system. The status page is a custom Next.js app styled with the Toss design system — no external SaaS used.
**Kakao** operates the largest monitoring infrastructure in Korea. After the 2022 SK C&C data center fire, several things changed:
- Expanded multi-cloud vantage points (checks now run from AWS, GCP, and Naver Cloud)
- Launched the dedicated KakaoTalk status page at `status.kakao.com`
- Integrated push notifications inside KakaoTalk to inform users directly during incidents
- Built an internal incident management tool after evaluating and rejecting Atlassian Statuspage
Japan — Mercari, NTT
**Mercari** disclosed its monitoring stack on its SRE blog in 2023:
- Metrics: Datadog + Prometheus
- Synthetics: Datadog Synthetic + custom Playwright scripts
- Status page: Atlassian Statuspage (`status.mercari.com`)
- On-call: PagerDuty
- Auto-incident creation: Datadog to Statuspage automated workflow
**NTT**, true to its position as Japan's telecom infrastructure operator, runs an in-house monitoring system. External SaaS is only a secondary tool; core infrastructure depends on Zabbix-based custom monitoring and NTT's internal NMS (Network Management System). External status pages still rely on the traditional approach where operators update them manually.
Tool selection trends in Korea and Japan
| Scale | Korea | Japan |
|---|---|---|
| Solo/side | Uptime Robot, Uptime Kuma | Mackerel, Uptime Kuma |
| Startup | Datadog, Better Stack | Mackerel, Datadog |
| Mid-market | Datadog, Pingdom | Datadog, New Relic |
| Enterprise | In-house + Datadog | In-house + Datadog |
The interesting wrinkle in the Japanese market is that **Mackerel** (a Japan-grown monitoring SaaS by Hatena) remains strong in the SMB segment. Native Japanese UI/docs, JPY billing, and tight integrations with Japan-local SaaS (the Japan offices of Slack, Microsoft Teams Japan, etc.) are its key strengths.
16. Who Should Pick What — Solo / Startup / OSS / Enterprise
Among the 20+ products we reviewed, which one should you pick? Recommendations by user type:
Solo developers / side projects
- **First pick: Uptime Robot free plan** (50 checks, free forever)
- Self-hosting preference: **Uptime Kuma** (one Docker line)
- Add cron monitoring: **Healthchecks.io free** (20 checks)
- Need a status page: **Instatus free** or Uptime Kuma's built-in
Seed to Series A startup
- One tool that does it all: **Better Stack** ($29 ~ $99/mo)
- Need synthetic monitoring: **Checkly** ($14 ~ $80/mo) + Uptime Robot
- First observability stack: **Datadog 14-day free**, then evaluate cost
- Status page: **Instatus** ($20/mo) or built into Better Stack
Series B+ growth-stage
- APM + synthetics integration: **Datadog Synthetic** ($30 ~ $300/mo)
- Or as a Datadog alternative: leverage **New Relic's 100 GB free**
- Incident management: **PagerDuty** + **incident.io**
- Status page: **Atlassian Statuspage** (market standard)
- Cron: **Cronitor** (auto-detects Lambda/K8s)
Enterprise
- Observability core: **Datadog** or **New Relic** (enterprise license)
- Synthetics: built into Datadog/NR + **Checkly** Enterprise (CI/CD integration)
- Status page: **Atlassian Statuspage Enterprise** ($1,500+/mo) + Confluence integration
- On-call: **PagerDuty Enterprise** or **Opsgenie**
- Data residency concerns: in Korea/Japan, self-built + Datadog as a supplement
Security/regulated industries (finance, healthcare, government)
- Self-hosting is mandatory: combine **OneUptime** self-hosted, **Uptime Kuma**, and **Gatus**
- Metrics/logs: build **Prometheus + Grafana + Loki** yourself
- Status page: **Cachet** or a static page built with Next.js/Hugo
Conclusion
In 2026 uptime monitoring evolved along two axes: **"integrated product vs. specialized product + glue tooling."** From solo developers to seed-stage startups, a single integrated product like Uptime Robot, Better Stack, or Uptime Kuma is enough. From Series B onward synthetic monitoring becomes mandatory, and at enterprise scale the typical pattern is unifying the whole observability stack on a full-stack platform like Datadog or New Relic.
The single most important principle is one line. **"Unmonitored downtime is real downtime."** Whichever tool you pick, the priority is to adopt one and start collecting data.
References
- Better Stack — https://betterstack.com/
- Better Stack merger announcement (Aug 2023) — https://betterstack.com/community/news/introducing-better-stack/
- Pingdom — https://www.pingdom.com/
- SolarWinds Pingdom — https://www.solarwinds.com/pingdom
- Uptime Robot — https://uptimerobot.com/
- Updown.io — https://updown.io/
- Cronitor — https://cronitor.io/
- Healthchecks.io — https://healthchecks.io/
- Healthchecks GitHub (OSS) — https://github.com/healthchecks/healthchecks
- OneUptime — https://oneuptime.com/
- OneUptime GitHub — https://github.com/OneUptime/oneuptime
- Statuscake — https://www.statuscake.com/
- Hyperping — https://hyperping.com/
- Freshping — https://www.freshworks.com/website-monitoring/
- Site24x7 — https://www.site24x7.com/
- Datadog Synthetic Monitoring — https://www.datadoghq.com/product/synthetic-monitoring/
- New Relic Synthetics — https://newrelic.com/platform/synthetic-monitoring
- Checkly — https://www.checklyhq.com/
- Checkly CLI — https://www.checklyhq.com/docs/cli/
- Atlassian Statuspage — https://www.atlassian.com/software/statuspage
- Statuspage API — https://developer.statuspage.io/
- Instatus — https://instatus.com/
- Cachet — https://cachethq.io/
- Cachet GitHub — https://github.com/CachetHQ/Cachet
- Hund — https://hund.io/
- Uptime Kuma — https://uptime.kuma.pet/
- Uptime Kuma GitHub — https://github.com/louislam/uptime-kuma
- Gatus GitHub — https://github.com/TwiN/gatus
- Standard Webhooks — https://www.standardwebhooks.com/
- Mercari Engineering Blog — https://engineering.mercari.com/
- Toss SLASH 2024 — https://toss.tech/slash-24
- Mackerel (Hatena) — https://mackerel.io/
현재 단락 (1/399)
Uptime monitoring is the tool that answers the most basic question: "Is your service alive?" Since S...