Skip to content

STRATAROUTER

Production

raman-intel/stratarouter

Production¶

Deploy, operate, and scale StrataRouter in production environments.

Guides¶

Deploy to Docker, Kubernetes, AWS, GCP, or Azure with ready-to-use manifests.

Production-grade configuration: timeouts, concurrency, retry policies, and resource limits.

Prometheus dashboards, OpenTelemetry traces, SLO tracking, and alerting setup.

Performance Tuning

Optimize HNSW parameters, cache hit rates, and thread pool size for your workload.

Horizontal and vertical scaling strategies to handle 10K–100K+ requests per second.

TLS, API key management, network policies, and security hardening best practices.

High Availability

Design for 99.99%+ uptime with active-active deployment, failover, and circuit breakers.

Troubleshooting

Common production issues, diagnostic commands, and resolution playbooks.

Quick Reference¶

Topic	Key Metric
P99 Latency	< 10ms routing, ~50ms with provider
Throughput	18K req/s single node (4 cores)
Cache Hit Rate	85%+ in typical production workloads
Uptime Target	99.99% with HA deployment
Memory (1K routes)	~64MB base footprint