Skip to content

Quota System & Usage Limitations

How do quotas work?

Usage limitations on Mammouth are defined per session. Quotas are fully renewed after 3 hours.

Different subscription plans (Starter, Standard, and Expert) have different usage allocations.


What consumes quotas?

Your consumption within a session depends on several factors:

FactorDescription
Message lengthThe longer your messages, the more they consume. The length of responses also counts.
Document sizeFiles and images added to the conversation impact your consumption
Conversation lengthThe longer a conversation gets, the more costly each exchange becomes (accumulated context is sent with each message)
Tool usageTools like web search or document generation consume additional quota
Model chosenMore powerful models consume more quota
Mammouth instructionsMammouth instructions are included in each request and impact consumption

The model threshold principle

Mammouth is designed so that you are not interrupted in your work. When you reach a model's quota, Mammouth automatically switches to a lighter model for the rest of your exchanges.

Concrete example:

You use Claude Opus and reach the threshold → Claude Sonnet takes over
You continue and reach the Claude Sonnet threshold → Claude Haiku takes over

This way, you can always continue working, even during heavy use.


Choosing the right plan

Mammouth offers several plans adapted to different usage levels:

PlanQuotaFor whom?
🌱 StarterReferenceDaily consumer usage — 80% of individual users prefer this plan
Standard3× the StarterIntensive users who want more flexibility without worrying about session limits — 50% of companies prefer this plan
🚀 Expert10× the StarterProfessionals and power users with significant and regular needs

💡 The quantity/price ratio improves significantly with higher plans: the Expert plan offers 10 times more quota for only 6 times the Starter price.


Input size and volume

  • Maximum input length: 4,000,000 characters
  • This input length includes:
    • document content
    • user prompt
    • contextual instructions (custom mammouth)

Content extraction

  • For very long or multiple documents:
    • a smart extraction of the most relevant content is performed
    • maximum extract size:
      • 30,000 characters (standard)
      • 150,000 characters (large context mode)

Supported files

File limits per conversations

  • Maximum 20 files
  • Maximum total size: 32 MB (all files combined)
  • Maximum size per file: 20 MB

Special cases

  • PDFs containing only images (e.g., scanned PDFs):
    • limited to 20 pages

💡 For large files, compress or split them before uploading. For PDF above 20MB, upload the docx or txt file.


Tips to optimize your consumption

🎯 Be precise and concise

Formulate clear and detailed instructions from the first message. Avoid vague requests that require back-and-forth clarification — each additional message consumes quotas.

💾 Use custom Mammouths for persistent context

If you need a document or instruction to be used across multiple conversations or throughout a long conversation, save it in a custom Mammouth. This avoids sending the same information repeatedly.

👉 Learn more about custom Mammouths

✏️ Review before sending

Take a moment to check the clarity and completeness of your message before sending it. A well-formulated prompt from the start reduces the number of messages needed and preserves your quota.

💡 If you regularly reach your limits, consider exploring Mammouth's higher plans for extended quotas.