Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.nolma.ai/llms.txt

Use this file to discover all available pages before exploring further.

OpenAI via Nolma

Setup

import openai

client = openai.OpenAI(
    api_key=os.environ["OPENAI_API_KEY"],
    base_url="https://gateway.nolma.ai/openai",
    default_headers={
        "NM-Key": "nm_live_abc123",
        "NM-Agent": "my-agent",
    }
)

Supported models

ModelTierCost (input / output per 1M tokens)
gpt-4oPremium2.50/2.50 / 10.00
gpt-4o-miniStandard0.15/0.15 / 0.60

Streaming

stream = client.chat.completions.create(
    model="gpt-4o-mini",
    messages=[{"role": "user", "content": "Hello"}],
    stream=True
)

for chunk in stream:
    if chunk.choices[0].delta.content:
        print(chunk.choices[0].delta.content, end="", flush=True)

Node.js

import OpenAI from 'openai'

const client = new OpenAI({
  apiKey: process.env.OPENAI_API_KEY,
  baseURL: 'https://gateway.nolma.ai/openai',
  defaultHeaders: {
    'NM-Key': 'nm_live_abc123',
    'NM-Agent': 'my-agent',
  }
})