Prompt Deployment

Deploy prompt versions to different environments (production, staging, custom) and access them through the @helicone/prompts SDK or directly via the Helicone AI Gateway.

Deployment Environments

Environments let you deploy different prompt versions for different use cases:

Production

The live version that serves real user traffic. Deploy stable, tested versions here.

Staging

Pre-production testing environment. Validate changes before going live.

Custom

Create any environment: dev, qa, canary, feature-branch, etc.

Environment Properties

One version can be deployed to multiple environments
Multiple versions can be deployed to different environments simultaneously
Default environment is production
Environment names are case-sensitive
No predefined environment limits - create as many as needed

Deploying in the UI

Select a version

Open your prompt and navigate to the Versions tab. Select the version you want to deploy.

Choose environment

Click Deploy and select or create an environment:

Select existing: production, staging, etc.
Create new: Enter a custom environment name

Confirm deployment

Click Deploy to [environment] to activate the version.

The version is immediately available in your code when accessing that environment.

Deploying via API

Deploy to Environment

curl -X POST https://api.helicone.ai/v1/prompt-2025/update/environment \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "promptId": "abc123",
    "promptVersionId": "5d4ec7d7-5725-46c2-ad26-41ddf6287527",
    "environment": "production"
  }'

Remove from Environment

curl -X POST https://api.helicone.ai/v1/prompt-2025/remove/environment \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "promptId": "abc123",
    "promptVersionId": "5d4ec7d7-5725-46c2-ad26-41ddf6287527",
    "environment": "staging"
  }'

Using Deployed Prompts

SDK Integration

The @helicone/prompts SDK is the recommended way to use prompts in your application.

Installation

npm install @helicone/prompts

Basic Usage

import { HeliconePromptManager } from "@helicone/prompts";
import { OpenAI } from "openai";

const promptManager = new HeliconePromptManager({
  apiKey: process.env.HELICONE_API_KEY!,
  baseUrl: "https://api.helicone.ai", // Optional, defaults to this
});

const openai = new OpenAI({
  apiKey: process.env.OPENAI_API_KEY!,
});

// Fetch the production version
const { body, errors } = await promptManager.getPromptBody({
  prompt_id: "abc123",
  environment: "production", // Optional, defaults to production
  inputs: {
    userName: "Alice",
    topic: "machine learning",
  },
});

if (errors.length > 0) {
  console.error("Variable validation errors:", errors);
}

// Use with OpenAI
const response = await openai.chat.completions.create(body);

Environment Selection

// Production (default)
const prod = await promptManager.getPromptBody({
  prompt_id: "abc123",
  inputs: { userName: "Alice" },
});

// Staging
const staging = await promptManager.getPromptBody({
  prompt_id: "abc123",
  environment: "staging",
  inputs: { userName: "Alice" },
});

// Custom environment
const custom = await promptManager.getPromptBody({
  prompt_id: "abc123",
  environment: "feature-branch-123",
  inputs: { userName: "Alice" },
});

Specific Version

Bypass environment deployment and use a specific version directly:

const { body } = await promptManager.getPromptBody({
  prompt_id: "abc123",
  version_id: "5d4ec7d7-5725-46c2-ad26-41ddf6287527",
  inputs: { userName: "Alice" },
});

AI Gateway Integration

Use prompts directly through the Helicone AI Gateway without the SDK.

Setup

import { OpenAI } from "openai";

const client = new OpenAI({
  baseURL: "https://ai-gateway.helicone.ai",
  apiKey: process.env.HELICONE_API_KEY,
});

Using Prompts

const response = await client.chat.completions.create({
  model: "gpt-4o-mini",
  // Prompt reference
  prompt_id: "abc123",
  environment: "production", // Optional
  inputs: {
    userName: "Alice",
    topic: "AI safety",
  },
  // Optional: Add additional messages
  messages: [
    { role: "user", content: "Follow-up question here" },
  ],
});

The gateway:

Fetches the prompt from the specified environment
Substitutes variables with your inputs
Merges with any additional messages you provide
Routes to the appropriate LLM provider
Returns the response

Variables and Inputs

Prompts can include typed variables using the syntax {{hc:variableName:type}}.

Supported Types

string: Text values
number: Numeric values
boolean: True/false values

Example Prompt

You are a helpful assistant for {{hc:companyName:string}}.

User {{hc:userName:string}} has asked about {{hc:topic:string}}.
Their account is {{hc:isPremium:boolean}} premium.

Providing Inputs

const { body, errors } = await promptManager.getPromptBody({
  prompt_id: "abc123",
  environment: "production",
  inputs: {
    companyName: "Acme Corp",
    userName: "Alice",
    topic: "billing",
    isPremium: true,
  },
});

Type Validation

The SDK validates input types at runtime:

const { body, errors } = await promptManager.getPromptBody({
  prompt_id: "abc123",
  inputs: {
    userName: "Alice",
    isPremium: "yes", // Error: expected boolean
  },
});

if (errors.length > 0) {
  console.error("Validation errors:", errors);
  // [
  //   {
  //     variable: "isPremium",
  //     expected: "boolean",
  //     value: "yes"
  //   }
  // ]
}

Advanced Usage

Merging Messages

Append additional messages to the prompt:

const { body } = await promptManager.getPromptBody({
  prompt_id: "abc123",
  environment: "production",
  inputs: { userName: "Alice" },
  messages: [
    {
      role: "user",
      content: "What are the latest updates?",
    },
  ],
});

// Resulting body contains:
// - Prompt messages with variables substituted
// - Appended user message

Overriding Parameters

Override prompt parameters at runtime:

const { body } = await promptManager.getPromptBody({
  prompt_id: "abc123",
  environment: "production",
  inputs: { userName: "Alice" },
  temperature: 0.9, // Override default
  max_tokens: 500,  // Override default
});

Streaming Responses

import { OpenAI } from "openai";

const { body } = await promptManager.getPromptBody({
  prompt_id: "abc123",
  environment: "production",
  inputs: { userName: "Alice" },
});

const stream = await openai.chat.completions.create({
  ...body,
  stream: true,
});

for await (const chunk of stream) {
  process.stdout.write(chunk.choices[0]?.delta?.content || "");
}

Deployment Strategies

Blue-Green Deployment

Deploy new version to staging
Test thoroughly
Deploy to production
Monitor metrics
Rollback if needed (redeploy previous version)

Canary Deployment

Deploy new version to canary environment
Route 5-10% of traffic to canary
Monitor error rates and quality
Gradually increase traffic
Promote to production when confident

Feature Branch Testing

Create environment named after feature branch: feature-new-tone
Deploy experimental version
Test in isolation
Promote to staging when ready

Monitoring Deployments

Track prompt usage and performance in the Helicone dashboard:

Requests per version: See which versions are active
Environment usage: Monitor traffic by environment
Error rates: Detect issues with new deployments
Cost per version: Track spending changes
Latency: Measure performance impact

Best Practices

Test in staging first: Always validate in non-production before deploying to production

Use semantic versioning: Major versions for breaking changes, minor for improvements

Monitor after deployment: Watch error rates and user feedback for 24-48 hours

Keep environment names consistent: Use standardized names across projects

Avoid frequent production changes: Deploy to production intentionally, not constantly

Don’t delete deployed versions: Keep versions that are or were in production for rollback

Troubleshooting

Version Not Found

If the SDK returns “version not found”:

Verify the prompt ID is correct
Check that the version is deployed to the specified environment
Ensure your API key has access to the organization

Variable Validation Errors

If you receive validation errors:

Check that all required variables have inputs
Verify input types match variable declarations
Review the errors array for specific issues

Gateway Integration Issues

If prompts aren’t working through the gateway:

Confirm baseURL is set to https://ai-gateway.helicone.ai
Verify prompt_id is correct
Check that the environment exists and has a deployed version

Next Steps

Experiments

Compare prompt versions with A/B testing

Gateway

Learn more about the Helicone AI Gateway

Get Started

AI Gateway

Observability

Prompt Management

Evaluation & Testing

Features

Self-Hosting

Integrations

Documentation Index

​Deployment Environments

Production

Staging

Custom

​Environment Properties

​Deploying in the UI

​Deploying via API

​Deploy to Environment

​Remove from Environment

​Using Deployed Prompts

​SDK Integration

​Installation

​Basic Usage

​Environment Selection

​Specific Version

​AI Gateway Integration

​Setup

​Using Prompts

​Variables and Inputs

​Supported Types

​Example Prompt

​Providing Inputs

​Type Validation

​Advanced Usage

​Merging Messages

​Overriding Parameters

​Streaming Responses

​Deployment Strategies

​Blue-Green Deployment

​Canary Deployment

​Feature Branch Testing

​Monitoring Deployments

​Best Practices

​Troubleshooting

​Version Not Found

​Variable Validation Errors

​Gateway Integration Issues

​Next Steps

Experiments

Gateway

Deployment Environments

Environment Properties

Deploying in the UI

Deploying via API

Deploy to Environment

Remove from Environment

Using Deployed Prompts

SDK Integration

Installation

Basic Usage

Environment Selection

Specific Version

AI Gateway Integration

Setup

Using Prompts

Variables and Inputs

Supported Types

Example Prompt

Providing Inputs

Type Validation

Advanced Usage

Merging Messages

Overriding Parameters

Streaming Responses

Deployment Strategies

Blue-Green Deployment

Canary Deployment

Feature Branch Testing

Monitoring Deployments

Best Practices

Troubleshooting

Version Not Found

Variable Validation Errors

Gateway Integration Issues

Next Steps