Overview
For production workloads, Helicone provides a production-ready Helm chart that deploys all services to Kubernetes with:- Horizontal auto-scaling
- High availability
- Resource management
- Service discovery
- Rolling updates
- Health checks and probes
Prerequisites
- Kubernetes 1.24 or later
- Helm 3.8 or later
- kubectl configured to access your cluster
- 16GB+ memory across nodes
- 100GB+ storage (persistent volumes)
Getting the Helm Chart
The Helm chart is available for enterprise customers. Contact us to get access:Get Enterprise Access
Email enterprise@helicone.ai to request the Helm chart and production support
Quick Start
Once you have access to the Helm chart:Architecture on Kubernetes
Helicone deploys the following workloads:Configuration Reference
Global Settings
Authentication
PostgreSQL (Application Database)
ClickHouse (Analytics Database)
MinIO (Object Storage)
Jawn (Backend API)
Web (Frontend)
Ingress
Using External Managed Services
For production, we recommend using managed services:AWS Example
Monitoring and Observability
The Helm chart includes Prometheus metrics and health checks:Available Metrics
- Request latency (p50, p95, p99)
- Request volume
- Error rates
- Database connection pools
- Cache hit rates
Backup and Disaster Recovery
PostgreSQL Backups
ClickHouse Backups
Scaling
Manual Scaling
Auto-Scaling
HPA (Horizontal Pod Autoscaler) is configured in values.yaml:Upgrading
Troubleshooting
Pods not starting
Pods not starting
Check pod events and logs:
Database connection errors
Database connection errors
Verify database connectivity:
PVC mounting issues
PVC mounting issues
Check storage class and PVC status:
Production Checklist
Security
- Changed all default passwords
- Configured TLS/SSL certificates
- Set up network policies
- Enabled pod security policies
- Configured RBAC
High Availability
- Multiple replicas for stateless services
- Database replication configured
- Anti-affinity rules set
- PodDisruptionBudgets configured
Monitoring
- Prometheus metrics enabled
- Grafana dashboards imported
- Alerts configured
- Log aggregation set up
Next Steps
Architecture
Understand the system architecture
Enterprise Support
Get help with your production deployment
