Logs, metrics, traces, and alerts — built into every service. No Datadog bill. No agent to install. No configuration required.
Everything you need to understand your production system. No third-party tools required.
Every stdout and stderr captured, timestamped, and searchable. Filter by service, level, time range, or text search. Stream in real-time or query historical.
rf logs --service api --level error --since 1hCPU, memory, request rate, latency percentiles, error rate, queue depth — all collected automatically. Push custom metrics from your app for business KPIs.
rf metrics --service api --period 24hSee how a request flows through your system. Which service took 680ms of an 820ms request? OpenTelemetry-compatible distributed tracing across all services.
rf traces --service api --min-duration 500msThreshold alerts, anomaly detection, deployment notifications, and uptime monitoring. Route to Slack, email, PagerDuty, or webhooks. Escalation policies.
rf alerts create --metric error_rate --threshold "> 2%" --notify slackrf metrics shows all services at a glance. rf logs streams live or searches historical. rf traces shows request flow. No dashboards to set up.
Output JSON from your app and RaidFrame parses it automatically. Fields become searchable and filterable. No logging library required.
Errors are automatically grouped, deduplicated, and linked to the deployment that introduced them. See frequency, affected endpoints, and stack traces.
Export metrics to Grafana via the Prometheus-compatible endpoint. Build custom dashboards if you need them. rf metrics prometheus-endpoint to enable.
Set alerts and go build. Don't watch dashboards.
Trigger when a metric crosses a threshold. CPU > 85% for 3 minutes. Error rate > 2% for 5 minutes. P99 latency > 1 second.
RaidFrame learns your traffic patterns and alerts on deviations. No threshold to set — unusual spikes or drops detected automatically.
Auto-notify on deploy start, success, failure, and rollback. Know immediately when a deploy goes out and if it caused problems.
Monitor external endpoints every 30 seconds. Alert when your site goes down before users report it. rf alerts create --type uptime.
Route alerts to Slack, email, PagerDuty, OpsGenie, or custom webhooks. Different channels for different severity levels.
Notify Slack immediately. If not acknowledged in 15 minutes, page on-call via PagerDuty. After 30 minutes, email leadership.
| Feature | Datadog | New Relic | RaidFrame |
|---|---|---|---|
| Pricing | $23/host/mo + $0.10/GB logs | $0.30/GB ingested | Included |
| Logs | Separate product | Included (100GB free) | Included |
| Metrics | $23/host/mo | Included | Included |
| Tracing (APM) | $36/host/mo | $0.30/GB | Included |
| Alerting | Included | Included | Included |
| Setup | Install agent per host | Install agent | Automatic |
| 4 hosts, APM, 50GB logs | ~$236/mo | ~$100/mo | $0 |
Custom metrics appear alongside system metrics. Use them as auto-scaling triggers or alert conditions.
For most teams, yes. Logs, metrics, traces, and alerts are included at no extra cost. No per-host pricing. No agent installation.
3 days on Starter, 30 days on Pro, 90 days on Enterprise. Export to S3-compatible storage for longer retention.
No. Metrics, logs, and traces are collected automatically. No APM agent, no sidecar container, no instrumentation for basic monitoring.
Yes. OpenTelemetry-compatible. Trace context is injected automatically. Use the OTEL SDK for richer trace spans.
Yes. Prometheus-compatible metrics endpoint. Connect Grafana and build custom dashboards with PromQL.
Negligible. Log collection adds less than 1ms per request. Metrics are sampled. Tracing overhead is under 2%.
Built-in observability. No Datadog bill required.