Production¶
Deploy, operate, and scale StrataRouter in production environments.
Guides¶
Deployment
Deploy to Docker, Kubernetes, AWS, GCP, or Azure with ready-to-use manifests.
→
Configuration
Production-grade configuration: timeouts, concurrency, retry policies, and resource limits.
→
Monitoring
Prometheus dashboards, OpenTelemetry traces, SLO tracking, and alerting setup.
→
Performance Tuning
Optimize HNSW parameters, cache hit rates, and thread pool size for your workload.
→
Scaling
Horizontal and vertical scaling strategies to handle 10K–100K+ requests per second.
→
Security
TLS, API key management, network policies, and security hardening best practices.
→
High Availability
Design for 99.99%+ uptime with active-active deployment, failover, and circuit breakers.
→
Troubleshooting
Common production issues, diagnostic commands, and resolution playbooks.
→
Quick Reference¶
| Topic | Key Metric |
|---|---|
| P99 Latency | < 10ms routing, ~50ms with provider |
| Throughput | 18K req/s single node (4 cores) |
| Cache Hit Rate | 85%+ in typical production workloads |
| Uptime Target | 99.99% with HA deployment |
| Memory (1K routes) | ~64MB base footprint |