Sizing guidance for on-prem deployment profiles.
Sizing values below are planning baselines for integrator workshops. They are indicative and must be validated in customer pre-production tests. The runtime sizing source of truth is ../Noxa/docs/system-requirements.md. As of 2026-04-04, no public NOXA benchmark report is published with guaranteed throughput/latency numbers.
Minimum, recommended, and optimal baselines
Each tier is shown for deployments without local AI and with optional local AI enabled.
Matches runtime minimum planning baseline for demo/lab and initial validation.
Default production onboarding baseline for standard on-prem workloads.
Used for multi-team sustained loads and stronger growth headroom.
How profiles map to sizing tiers
Fast setup for validation and walkthroughs.
Starts from small-customer baseline and scales to recommended production headroom.
Depends on concurrency, compliance controls, and retention policy.
Local model choice is the largest sizing driver.