API Documentation

An OpenAI-compatible chat completion LLM API to easily integrate AI into your applications.

Quick Start

All mammouth subscribers have some credits included.

Plan	`Starter`	`Standard`	`Expert`
Monthly credits	2$	4$	10$

➡️ Get your API key and credits.

With the Mammouth API directly

Generates a chat completion response based on your prompt.

PythonJavaScriptcURL

python

import requests
url = "https://api.mammouth.ai/v1/chat/completions"
headers = {
    "Authorization": "Bearer YOUR_API_KEY",
    "Content-Type": "application/json"
}
data = {
    "model": "gpt-4.1",
    "messages": [
        {
            "role": "user",
            "content": "Explain the basics of machine learning"
        }
    ]
}
response = requests.post(url, headers=headers, json=data)
print(response.json())

javascript

const fetch = require("node-fetch");

async function callMammouth() {
  const url = "https://api.mammouth.ai/v1/chat/completions";
  const headers = {
    Authorization: "Bearer YOUR_API_KEY",
    "Content-Type": "application/json",
  };

  const data = {
    model: "gpt-4.1",
    messages: [
      {
        role: "user",
        content: "Create an example JavaScript function",
      },
    ],
  };

  try {
    const response = await fetch(url, {
      method: "POST",
      headers: headers,
      body: JSON.stringify(data),
    });

    const result = await response.json();
    console.log(result.choices[0].message.content);
  } catch (error) {
    console.error("Error:", error);
  }
}

callMammouth();

bash

curl -X POST https://api.mammouth.ai/v1/chat/completions \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-4.1",
    "messages": [
      {
        "role": "user",
        "content": "Hello, how are you doing?"
      }
    ]
  }'

➡️ Get your API key and credits.

With OpenAI Library

python

import openai

# Configure the client to use Mammouth.ai
openai.api_base = "https://api.mammouth.ai/v1"
openai.api_key = "YOUR_API_KEY"

response = openai.ChatCompletion.create(
    model="gpt-4.1",
    messages=[
        {"role": "user", "content": "What are the benefits of renewable energy?"}
    ]
)

print(response.choices[0].message.content)

Response Format

Successful Response

json

{
  "id": "chatcmpl-123",
  "object": "chat.completion",
  "created": 1677652288,
  "model": "gpt-4.1",
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "content": "Hello! I'm doing very well, thank you for asking. How can I help you today?"
      },
      "finish_reason": "stop"
    }
  ],
  "usage": {
    "prompt_tokens": 12,
    "completion_tokens": 19,
    "total_tokens": 31
  }
}

Streaming Response

When stream: true is set, responses are returned as Server-Sent Events:

data: {"id":"chatcmpl-123","object":"chat.completion.chunk","created":1677652288,"model":"gpt-4.1","choices":[{"index":0,"delta":{"content":"Hello"},"finish_reason":null}]}

data: {"id":"chatcmpl-123","object":"chat.completion.chunk","created":1677652288,"model":"gpt-4.1","choices":[{"index":0,"delta":{"content":"!"},"finish_reason":null}]}

data: [DONE]

Models & Pricing

Model	Input ($/M tokens)	Output ($/M tokens)
`gpt-5`	1,25	10
`gpt-5-mini`	0,25	2
`gpt-4.1`	2	8
`gpt-4.1-mini`	0.4	1.6
`gpt-4.1-nano`	0.1	0.4
`gpt-4o`	2.5	10
`o4-mini`	1.1	4.4
`o3`	2	8
`mistral-large-2411`	2	6
`mistral-medium-3.1`	0.4	2
`mistral-small-3.2-24b-instruct`	0.1	0.3
`magistral-medium-2506`	2	5
`codestral-2501`	0.3	0.9
`grok-3`	3	15
`grok-3-mini`	0.3	0.5
`grok-4`	3	15
`grok-4-fast`	0.2	0.5
`gemini-2.5-flash`	0.3	2.5
`gemini-2.5-pro`	2.5	15
`deepseek-r1-0528`	3	8
`deepseek-v3-0324`	0.9	0.9
`deepseek-v3.1`	0.30	1
`deepseek-v3.1-terminus`	0.30	1
`deepseek-v3.2-exp`	0.30	0.45
`llama-4-maverick`	0.22	0.88
`llama-4-scout`	0.15	0.6
`claude-3-5-haiku-20241022`	0.8	4
`claude-3-5-sonnet-20241022`	3	15
`claude-3-7-sonnet-20250219`	3	15
`claude-4-sonnet-20250522`	3	15
`claude-4-1-20250805`	15	75
`claude-sonnet-4-5`	3	15

Prices may vary and not be up to date in this table.

📜 Usage and cost are logged in your settings.

💡 We added aliases aligned with the Mammouth app to facilitate your model selection: if you write mistral, it will use mistral-medium-3.

Error Codes

Code	Description
`400`	Bad Request - Missing or incorrect parameters
`401`	Unauthorized - Invalid API key
`429`	Too Many Requests - Rate limit exceeded
`500`	Internal Server Error - Server-side issue
`503`	Service Unavailable - Server temporarily unavailable

Parameters

Required Parameters

Parameter	Type	Description
`messages`	array	List of messages in the conversation
`model`	string	Model identifier to use

Optional Parameters

Parameter	Type	Default	Description
`temperature`	number	0.7	Controls creativity (0.0 to 2.0)
`max_tokens`	integer	2048	Maximum number of tokens to generate
`top_p`	number	1.0	Controls response diversity
`stream`	boolean	false	Real-time response streaming

Optimization Tips

Message Structure

json

{
  "messages": [
    {
      "role": "system",
      "content": "You are an AI assistant specialized in programming."
    },
    {
      "role": "user",
      "content": "How to optimize a for loop in Python?"
    }
  ]
}

Role Types

system: Sets the behavior and context for the assistant
user: Represents messages from the user
assistant: Represents previous responses from the AI

Migration from OpenAI

If you're already using OpenAI's API, migrating to Mammouth.ai is simple:

Change the base URL from https://api.openai.com/v1 to https://api.mammouth.ai/v1
Update your API key
Keep all other parameters the same

OpenAI Python Library

python

import openai

# Before
openai.api_base = "https://api.openai.com/v1"
openai.api_key = "sk-openai-key"

# After
openai.api_base = "https://api.mammouth.ai/v1"
openai.api_key = "your-mammouth-key"

n8n, VS Code, Cline, Make, XCode, CLI, etc.

You can use the Mammouth API with tools like n8n, VS Code, Cline, Make and more.

Make sure you are using the correct URL. If unsure, try each of them.

For base URL, https://api.mammouth.ai/v1 or https://api.mammouth.ai/
For https queries, https://api.mammouth.ai/v1/chat/completions will be required.

Tutorials on how to use the Mammouth API in your favorite tools

For automations:

For IDEs:

For CLI (Claude Code equivalent):

Opencode
Goose: https://block.github.io/goose/

Other

GitKraken

➡️ Get your API key and credits.

API Documentation ​

Quick Start ​

With the Mammouth API directly ​

With OpenAI Library ​

Response Format ​

Successful Response ​

Streaming Response ​

Models & Pricing ​

Error Codes ​

Parameters ​

Required Parameters ​

Optional Parameters ​

Optimization Tips ​

Message Structure ​

Role Types ​

Migration from OpenAI ​

OpenAI Python Library ​

n8n, VS Code, Cline, Make, XCode, CLI, etc. ​

Tutorials on how to use the Mammouth API in your favorite tools ​

​

API Documentation

Quick Start

With the Mammouth API directly

With OpenAI Library

Response Format

Successful Response

Streaming Response

Models & Pricing

Error Codes

Parameters

Required Parameters

Optional Parameters

Optimization Tips

Message Structure

Role Types

Migration from OpenAI

OpenAI Python Library

n8n, VS Code, Cline, Make, XCode, CLI, etc.

Tutorials on how to use the Mammouth API in your favorite tools