Documentation Index
Fetch the complete documentation index at: https://docs.nolma.ai/llms.txt
Use this file to discover all available pages before exploring further.
What is Nolma?
Nolma is an AI gateway platform that sits between your agents and every LLM provider. One URL change gives you:- Real-time cost tracking per agent, model, user, and session
- Hard budget enforcement that stops spend before the LLM call fires
- Prompt intelligence — track what users do with AI outputs and get recommendations to cut costs
Quickstart
Integrate in 5 minutes
Python SDK
pip install nolma
Node.js SDK
npm install @nolma/node
API Reference
Full API docs
How it works
Supported providers
| Provider | URL prefix | Models |
|---|---|---|
| OpenAI | /openai | gpt-4o, gpt-4o-mini |
| Anthropic | /anthropic | claude-sonnet-4-6, claude-haiku-4-5 |
| Google Gemini | /gemini | gemini-2.5-pro, gemini-2.0-flash |
| Groq | /groq | llama-3.3-70b |
| Mistral | /mistral | mistral-small-latest |