Getting Started

Quick Start

Get started with Helicone AI Gateway in three simple steps:

Get Your API Key

If using Pass-Through Billing, add credits at helicone.ai/credits

Update Your Code

Change your baseURL to point to Helicone’s AI Gateway:

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://ai-gateway.helicone.ai",
  apiKey: process.env.HELICONE_API_KEY,
});

Make Requests

Use any model from 20+ providers:

const response = await client.chat.completions.create({
  model: "gpt-4o-mini",  // or claude-sonnet-4, gemini-2.0-flash, etc.
  messages: [{ role: "user", content: "Hello!" }]
});

You’re all set! View your requests at helicone.ai/dashboard

Examples by Language

TypeScript/Node.js
Python
cURL

OpenAI SDK

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://ai-gateway.helicone.ai",
  apiKey: process.env.HELICONE_API_KEY,
});

async function main() {
  const response = await client.chat.completions.create({
    model: "gpt-4o-mini",
    messages: [
      { role: "user", content: "What is the capital of France?" }
    ]
  });
  
  console.log(response.choices[0].message.content);
}

main();

Anthropic SDK

You can also use the Anthropic SDK directly with no format mapping:

import Anthropic from "@anthropic-ai/sdk";

const client = new Anthropic({
  apiKey: `Bearer ${process.env.HELICONE_API_KEY}`,
  baseURL: "https://ai-gateway.helicone.ai",
  defaultHeaders: {
    "Helicone-Gateway-Body-Mapping": "NO_MAPPING",
  },
});

const response = await client.messages.create({
  model: "claude-3-7-sonnet-20250219/anthropic",
  max_tokens: 100,
  messages: [{ role: "user", content: "Hello!" }],
});

OpenAI SDK

from openai import OpenAI

client = OpenAI(
    base_url="https://ai-gateway.helicone.ai",
    api_key=os.getenv("HELICONE_API_KEY"),
)

response = client.chat.completions.create(
    model="gpt-4o-mini",
    messages=[
        {"role": "user", "content": "What is the capital of France?"}
    ],
)

print(response.choices[0].message.content)

Anthropic SDK

from anthropic import Anthropic

client = Anthropic(
    api_key=f"Bearer {os.getenv('HELICONE_API_KEY')}",
    base_url="https://ai-gateway.helicone.ai",
    default_headers={
        "Helicone-Gateway-Body-Mapping": "NO_MAPPING",
    },
)

response = client.messages.create(
    model="claude-3-7-sonnet-20250219/anthropic",
    max_tokens=100,
    messages=[{"role": "user", "content": "Hello!"}],
)

Basic Request

curl https://ai-gateway.helicone.ai/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $HELICONE_API_KEY" \
  -d '{
    "model": "gpt-4o-mini",
    "messages": [
      {"role": "user", "content": "What is the capital of France?"}
    ]
  }'

With Fallback

curl https://ai-gateway.helicone.ai/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $HELICONE_API_KEY" \
  -d '{
    "model": "gpt-4o/openai,gpt-4o/azure",
    "messages": [
      {"role": "user", "content": "Hello!"}
    ]
  }'

Authentication Methods

Pass-Through Billing (PTB)

Use Helicone’s API key and billing for all providers:

const client = new OpenAI({
  baseURL: "https://ai-gateway.helicone.ai",
  apiKey: process.env.HELICONE_API_KEY,
});

Requirements:

Helicone API key
Credits added to your account (add credits)

Benefits:

Single API key for all providers
Consolidated billing
No provider setup needed

Bring Your Own Key (BYOK)

Use your own provider API keys with Helicone:

Add Provider Keys

Go to Settings → API Keys and add your provider keys:

OpenAI API key
Anthropic API key
Google API key
AWS credentials (for Bedrock)
Other provider keys

Use Helicone API Key

Your requests automatically use BYOK when available:

const client = new OpenAI({
  baseURL: "https://ai-gateway.helicone.ai",
  apiKey: process.env.HELICONE_API_KEY,
});

Benefits:

Use existing provider credits
Direct provider billing
BYOK attempts prioritized over PTB
Automatic fallback to PTB if BYOK fails

The AI Gateway automatically prioritizes BYOK over PTB attempts. Learn more in Routing.

Model Formats

The AI Gateway supports multiple model naming formats:

Model Only
Model with Provider
With Fallbacks
With Provider Exclusions

Specify just the model name for automatic provider routing:

model: "gpt-4o-mini"  // Routes to best available provider

The gateway automatically:

Tries BYOK providers first
Falls back to PTB if needed
Selects based on cost and availability

Specify both model and provider for explicit routing:

model: "gpt-4o/openai"              // OpenAI directly
model: "gpt-4o/azure"               // Azure OpenAI
model: "claude-sonnet-4/anthropic"  // Anthropic direct
model: "claude-sonnet-4/bedrock"    // AWS Bedrock

Use comma-separated models for automatic fallback:

// Try Bedrock first, fallback to Anthropic
model: "claude-3-7-sonnet-20250219/bedrock,claude-3-7-sonnet-20250219/anthropic"

// Try OpenAI, fallback to Azure, then DeepInfra
model: "gpt-4o/openai,gpt-4o/azure,gpt-4o/deepinfra"

Learn more in Fallbacks.

Exclude specific providers from automatic routing:

// Use any provider except Anthropic
model: "!anthropic,gpt-4o-mini"

// Exclude multiple providers
model: "!anthropic,!deepinfra,gpt-4o-mini"

The ! prefix excludes providers globally for all models in the request.

Discovering Models

Use the /v1/models endpoint to discover available models:

curl https://ai-gateway.helicone.ai/v1/models \
  -H "Authorization: Bearer $HELICONE_API_KEY"

{
  "object": "list",
  "data": [
    {
      "id": "gpt-4o-mini",
      "object": "model",
      "created": 1716537600,
      "owned_by": "openai"
    },
    {
      "id": "claude-sonnet-4",
      "object": "model",
      "created": 1708560000,
      "owned_by": "anthropic"
    }
  ]
}

Or browse all models at helicone.ai/models.

Configuration Headers

Customize gateway behavior with optional headers:

Helicone-Gateway-Body-Mapping

string

default:"OPENAI"

Control request/response format mapping:

OPENAI: Map to OpenAI format (default)
RESPONSES: Use OpenAI Responses API format
NO_MAPPING: Pass through native provider format

X-Stripe-Customer-Id

string

Associate requests with a Stripe customer for metering

Example with Headers

const client = new OpenAI({
  baseURL: "https://ai-gateway.helicone.ai",
  apiKey: process.env.HELICONE_API_KEY,
  defaultHeaders: {
    "Helicone-Gateway-Body-Mapping": "NO_MAPPING",
  },
});

Streaming Responses

Streaming works the same as with native providers:

const stream = await client.chat.completions.create({
  model: "gpt-4o-mini",
  messages: [{ role: "user", content: "Tell me a story" }],
  stream: true,
});

for await (const chunk of stream) {
  process.stdout.write(chunk.choices[0]?.delta?.content || "");
}

Error Handling

The gateway returns standard OpenAI-compatible error responses:

try {
  const response = await client.chat.completions.create({
    model: "gpt-4o-mini",
    messages: [{ role: "user", content: "Hello!" }],
  });
} catch (error) {
  if (error.status === 401) {
    console.error("Invalid API key");
  } else if (error.status === 429) {
    console.error("Rate limit exceeded or insufficient credits");
  } else if (error.status === 400) {
    console.error("Bad request:", error.message);
  }
}

Enable fallbacks to automatically retry with different providers on errors. See Fallbacks.

Next Steps

Routing

Learn how requests are routed across providers

Fallbacks

Configure automatic failover for reliability

Prompt Integration

Use prompts stored in Helicone

Integrations

Integrate with frameworks and tools

Get Started

AI Gateway

Observability

Prompt Management

Evaluation & Testing

Features

Self-Hosting

Integrations

Quick Start

Examples by Language

OpenAI SDK

Anthropic SDK

OpenAI SDK

Anthropic SDK

Basic Request

With Fallback

Authentication Methods

Pass-Through Billing (PTB)

Bring Your Own Key (BYOK)

Model Formats

Discovering Models

Configuration Headers

Example with Headers

Streaming Responses

Error Handling

Next Steps

Routing

Fallbacks

Prompt Integration

Integrations

Get Started

AI Gateway

Observability

Prompt Management

Evaluation & Testing

Features

Self-Hosting

Integrations

Documentation Index

​Quick Start

​Examples by Language

​OpenAI SDK

​Anthropic SDK

​OpenAI SDK

​Anthropic SDK

​Basic Request

​With Fallback

​Authentication Methods

​Pass-Through Billing (PTB)

​Bring Your Own Key (BYOK)

​Model Formats

​Discovering Models

​Configuration Headers

​Example with Headers

​Streaming Responses

​Error Handling

​Next Steps

Routing

Fallbacks

Prompt Integration

Integrations

Quick Start

Examples by Language

OpenAI SDK

Anthropic SDK

OpenAI SDK

Anthropic SDK

Basic Request

With Fallback

Authentication Methods

Pass-Through Billing (PTB)

Bring Your Own Key (BYOK)

Model Formats

Discovering Models

Configuration Headers

Example with Headers

Streaming Responses

Error Handling

Next Steps