Skip to content

Architecture Overview

Components

  1. Python API Layer
  2. Rust Core (PyO3)
  3. HNSW Index
  4. Hybrid Scorer
  5. Calibration Manager

Performance

  • P99 latency: <10ms
  • Memory: ~64MB
  • Throughput: 18K req/s