Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.nolma.ai/llms.txt

Use this file to discover all available pages before exploring further.

Quickstart

1

Get your API key

Sign up at nolma.ai and copy your API key from Settings → API Keys.Your key looks like: nm_live_abc123...
2

Change one URL

Find where you create your LLM client and change the base_url:
import anthropic

client = anthropic.Anthropic(
    api_key="your-anthropic-key",
    base_url="https://gateway.nolma.ai/anthropic",
    default_headers={
        "NM-Key": "nm_live_abc123",
        "NM-Agent": "my-agent",
    }
)

# All existing calls work unchanged
response = client.messages.create(
    model="claude-haiku-4-5-20251001",
    max_tokens=100,
    messages=[{"role": "user", "content": "Hello"}]
)
3

Open your dashboard

Go to nolma.ai/dashboard/guard.Within 30 seconds of making your first call you will see:
  • Session created
  • Token count
  • Cost in USD
  • Agent name
That’s it. You’re integrated.
4

Set a budget (optional)

Prevent surprise bills by setting a hard limit:Dashboard → Guard → Budgets → Add rule
FieldValue
ScopeAgent
Agentmy-agent
Limit$10.00
PeriodDaily
EnforcementHard block
When the limit is hit Nolma returns HTTP 429 before the LLM call fires. Zero cost incurred.

What’s next?

Install the Python SDK

Session tracking and signal collection

Set up alerts

Slack and email notifications

Collect signals

Track user acceptance rates

Budget templates

Pre-built budget configurations