Skip to main content

Monitoring Router Performance

Learn how to track, analyze, and optimize router performance in production applications.

Overview

Effective monitoring is essential for understanding how smart routing behaves in production. This guide covers tracking routing decisions, analyzing performance metrics, and optimizing based on real usage data. What you’ll learn:
  • Tracking routing decisions and model selection
  • Measuring routing latency and overhead
  • Analyzing cost patterns
  • Identifying routing failures
  • Building monitoring dashboards
  • Optimizing based on metrics

Basic Monitoring

Tracking Selected Models

Track which models the router selects for your requests:

Advanced Monitoring

Comprehensive Metrics Collection

Collect detailed metrics for analysis:

Cost Tracking

Monitoring Routing Costs

Track estimated costs by model and over time:

Integration with Monitoring Tools

Prometheus Metrics Export

Export metrics for Prometheus monitoring:

DataDog Integration

Send routing metrics to DataDog:

Alerting and Notifications

Set Up Alerts for Routing Issues

Monitor critical metrics and send alerts:

Best Practices

Monitoring Checklist

Essential Metrics
  • Model selection distribution
  • Routing latency (P50, P95, P99)
  • Error rates and types
  • Request volume over time
  • Cost per model and total
Performance Monitoring
  • Track latency trends
  • Monitor for latency regressions
  • Alert on high latency (>2s routing time)
  • Compare routed vs. fixed model performance
Cost Monitoring
  • Track daily/weekly spending
  • Monitor cost per model
  • Alert on budget thresholds
  • Analyze cost optimization opportunities
Error Monitoring
  • Track error rates by pool
  • Monitor routing failures
  • Set up alerts for error spikes
  • Log errors for debugging
Avoid
  • Monitoring without actionable alerts
  • Ignoring cost patterns
  • Missing latency regressions
  • Not tracking model distribution

Next Steps