Architecture Overview
Helicone is comprised of five core services that work together to provide AI Gateway and observability capabilities:Core Components
- Worker
- Jawn
- Web
- Data Layer
Worker (Cloudflare Workers)
The Worker is the edge proxy that intercepts and routes all LLM requests. It’s deployed globally on Cloudflare’s edge network for minimal latency.Key responsibilities:- Route requests to 100+ LLM providers
- Apply intelligent fallbacks when providers fail
- Enforce rate limits and caching rules
- Log request metadata to Jawn
- Add <50ms overhead on average
/workerHow it works:- Hosted on Cloudflare Workers (cloud)
- Runs locally with
wrangler dev(self-hosted)
Request Flow
Let’s trace a complete request through the system:Worker routes to provider
The Worker (Cloudflare edge):
- Authenticates the request
- Determines target provider (OpenAI)
- Applies any caching or rate limit rules
- Forwards request to OpenAI
- Receives response
Provider responds
OpenAI processes the request and returns the completion.The Worker:
- Calculates cost and token usage
- Streams response back to user (if streaming)
- Logs request metadata to Jawn
Jawn stores data
Jawn receives the log and:
- Stores metadata in Supabase (request ID, user, timestamp)
- Writes metrics to ClickHouse (cost, tokens, latency)
- Uploads request/response to MinIO (full bodies)
Key Features Enabled by Architecture
Edge Routing
Worker on Cloudflare edge means:
- <50ms latency overhead
- Global availability
- Automatic failover
- DDoS protection
Scalable Analytics
ClickHouse columnar storage:
- Query millions of requests
- Real-time aggregations
- Cost-effective at scale
- Sub-second dashboard loads
Flexible Storage
Tiered storage approach:
- Hot data in Supabase
- Analytics in ClickHouse
- Bodies in S3/MinIO
- Optimized costs
Multi-Provider
Worker intelligence:
- 100+ provider integrations
- Automatic fallbacks
- Smart load balancing
- Unified observability
Self-Hosting Options
Helicone is fully open source and can be self-hosted in multiple ways:- Docker (Easiest)
- Docker Compose
- Kubernetes/Helm
All-in-One Docker Container
Run everything in a single container for testing or small deployments:3000- Web dashboard8585- Jawn API + LLM proxy9080- MinIO S3 storage
- Local development
- Testing integrations
- Small teams
Development Setup
Want to contribute or customize Helicone? Here’s the local development setup:Requirements
Requirements
- Docker - For infrastructure (Postgres, ClickHouse, MinIO)
- Node.js 20+ - Use nvm to manage versions
- Yarn - Package manager
- Supabase CLI - For database management
- Wrangler - For Cloudflare Worker development
Quick Start
Quick Start
Project Structure
Project Structure
Security & Compliance
SOC 2 Compliant
Type II certified for cloud hosting. Enterprise-grade security controls and annual audits.
GDPR Compliant
Full GDPR compliance with data residency options and user data controls.
Data Encryption
- TLS 1.3 in transit
- AES-256 at rest
- Encrypted backups
Data Ownership
- You own your data
- Export anytime via API
- Self-host for complete control
Performance Characteristics
- Latency
- Scalability
- Availability
Edge routing keeps overhead minimal:
- p50: <25ms added latency
- p95: <50ms added latency
- p99: <100ms added latency
What’s Next?
Now that you understand how Helicone works, explore the features:Sessions & Agent Debugging
Track multi-step AI workflows with session trees
Gateway Fallbacks
Configure automatic failover when providers go down
Prompt Management
Version control and deploy prompts without code
Cost Tracking
Understand your LLM economics by user or feature
