public-docs). The same inference chart can deploy it in-cluster, so operators have local access to the documentation alongside the models. This is useful in air-gapped or restricted environments where the hosted documentation is not reachable.
This configuration is not enabled by default. You can enable it when you first install the chart, or add it later to a running release with helm upgrade.
This page applies to Amazon EKS, OpenShift, and upstream Kubernetes deployments. Follow the tab that matches your platform where the steps differ.
Prerequisites
- A cloud inference deployment, either already installed or in progress. See Install on Amazon EKS, Install on OpenShift, or Install on Kubernetes.
-
The
public-docsimage uploaded to your registry. The image ships in the bundle’s./containers/directory, andupload_images.shuploads it alongside theatlasimage during the install image-upload step. No separate upload is required.
Enable the site
Setdocs.enabled to true in your inference_values.yaml file:
inference_values.yaml
Deployment and Service named inference-public-docs. The docs.image registry falls back to the top-level image.registry, and the image name and tag come pre-set to match the shipped public-docs image, so you do not normally set anything under docs.image.
With no hostname configured, the site is reachable only inside the cluster, at:
Expose the site
To reach the site from outside the cluster, give it a hostname. The docs site reuses the same ingress or Route configuration as the models, so it must share the model exposure method.- Amazon EKS (ALB)
- Kubernetes (Ingress)
- OpenShift (Route)
Exposing the site requires The docs
ingress.enabled: true (the same setting the models use). Set docs.ingressHost:inference_values.yaml
Ingress reuses the shared ingress.className and ingress.annotations, so it joins the same Application Load Balancer through the alb.ingress.kubernetes.io/group.name annotation. TLS terminates at the load balancer with the ACM certificate from the alb.ingress.kubernetes.io/certificate-arn annotation, so that certificate must also be valid for docs.ingressHost. You do not create an in-cluster TLS secret.Apply the change
Set thedocs values in your inference_values.yaml file, then apply them with helm upgrade -i. The -i (--install) flag makes the command idempotent: it installs the release if it does not exist yet, or upgrades it in place if it does. The same command therefore works whether you are enabling the site during the initial install or adding it to a running release. Use the same chart path and flags you used to install. If your install command used --set-file s3.caBundle=... because your S3 backend uses a private CA, include that flag when you run helm upgrade -i on this page:
Verify
Confirm the docs pod is running:/healthz, which you can use for external monitoring.