- On-prem
- Hosted
This reference uses
gpt-oss-120b-8-32-64-128k as the example model bundle. You can use any model bundle(s) you like in practice.Configuration Parameters
gateway
| Parameter | Type | Description |
|---|---|---|
gateway.replicas | integer | API gateway replica count for high availability |
gateway.auth.enabled | boolean | Enable built-in OIDC integration |
gateway.auth.secretName | string | Name of Kubernetes Secret containing OIDC credentials. Leave empty for default auth mode |
gateway.ingress.hosts[].host | string | Your API FQDN (e.g., api.example.com) |
gateway.ingress.hosts[].tlsSecretName | string | Kubernetes TLS secret name for the API host |
cloud-ui (Web UI)
| Parameter | Type | Description |
|---|---|---|
cloud-ui.replicas | integer | UI replica count for high availability |
cloud-ui.ingress.hosts[].host | string | Your UI FQDN (e.g., ui.example.com) |
cloud-ui.ingress.hosts[].tlsSecretName | string | Kubernetes TLS secret name for the UI host |
db-admin
| Parameter | Type | Description |
|---|---|---|
db-admin.admins | list | Email addresses of users who can access the Admin UI |
auth-and-billing
| Parameter | Type | Description |
|---|---|---|
auth-and-billing.replicas | integer | Core control-plane service scaling |
auth-and-billing.pgSecretName | string | Name of Kubernetes Secret containing external PostgreSQL connection details (DB_HOST, DB_DATABASE, DB_USER, DB_PASSWD) as base64-encoded data fields. Required when using external PostgreSQL |
cloudnative-pg
| Parameter | Type | Description |
|---|---|---|
cloudnative-pg.enabled | boolean | true = deploy in-cluster PostgreSQL; false = use external PostgreSQL via auth-and-billing.pgSecretName |
bundles
| Parameter | Type | Description |
|---|---|---|
bundles.bundleSpecs[] | list | Declares bundles (model assets) by name |
bundles.bundleDeploymentSpecs[] | list | Deploys the declared bundles |
bundleDeploymentSpecs[].name | string | Must match a declared bundleSpecs.name |
bundleDeploymentSpecs[].groups[].name | string | Routing/capacity group name |
bundleDeploymentSpecs[].groups[].minReplicas | integer | Minimum engines for the group |
bundleDeploymentSpecs[].groups[].qosList[] | list | QoS tags (e.g., web, free, pro) |
