Limits & Quotas Overview

Understanding the platform limitations, quotas, and performance boundaries in mixus.

Types of Limits

mixus implements several types of limits to ensure optimal performance and fair usage across all users:

Rate Limits

Control the frequency of API requests and user actions:

API requests per minute
Messages per hour
File uploads per day
Concurrent agent executions

Storage Limits

Manage data storage and memory usage:

File storage quota
Memory system capacity
Knowledge base size
Conversation history retention

Model Limits

Specific to AI model capabilities:

Token context windows
Maximum response length
Concurrent model requests
Processing timeouts

Limit Categories by Plan

Free Plan Limits

Monthly tokens: 1,000
File uploads: 5 per month
Storage: 100MB total
API requests: 60 per hour
Concurrent chats: 1
Memory entries: 50

Pro Plan Limits

Monthly tokens: 100,000 included
File uploads: Unlimited
Storage: 10GB total
API requests: 1,000 per hour
Concurrent chats: 10
Memory entries: 1,000

Team Plan Limits

Monthly tokens: 500,000 included
File uploads: Unlimited
Storage: 100GB total
API requests: 10,000 per hour
Concurrent chats: 50
Memory entries: 10,000

Enterprise Limits

Monthly tokens: Custom allocation
File uploads: Unlimited
Storage: Custom allocation
API requests: Custom rate limits
Concurrent chats: Unlimited
Memory entries: Unlimited

Understanding Limit Enforcement

Soft Limits

Limits that provide warnings but don’t block usage:

Budget thresholds: Alerts at 75%, 90%, 95%
Storage warnings: Notifications at 80% capacity
Performance alerts: Slow response time warnings

Hard Limits

Limits that prevent further usage until reset:

Rate limits: Temporary blocking until time window resets
Storage caps: No new uploads until space is freed
Token exhaustion: No model usage until next billing cycle

Graceful Degradation

How the platform handles limit approaches:

Priority queuing: Pro/Team users get priority during high load
Model fallback: Automatic switching to faster models when needed
Context truncation: Intelligent conversation history management

Rate Limiting Details

API Rate Limits

Free Plan:

Requests: 60 per hour (1 per minute)
Burst: 10 requests in 1 minute
Reset: Rolling 60-minute window
```text

**Pro Plan:**
```text
Requests: 1,000 per hour (~17 per minute)
Burst: 100 requests in 1 minute
Reset: Rolling 60-minute window
```text

**Team Plan:**
```text
Requests: 10,000 per hour (~167 per minute)
Burst: 500 requests in 1 minute
Reset: Rolling 60-minute window
```text

### Interactive Rate Limits

**Chat Messages:**
- Free: 20 messages per hour
- Pro: 500 messages per hour
- Team: 2,000 messages per hour

**File Uploads:**
- Free: 5 files per month
- Pro: 100 files per day
- Team: 1,000 files per day

**Agent Executions:**
- Free: 5 per day
- Pro: 100 per day
- Team: 1,000 per day

## Storage and Memory Limits

### File Storage

**Storage Allocation by Plan:**
```text
Free: 100MB total
├── Documents: Up to 50MB
├── Images: Up to 30MB
└── Other files: Up to 20MB

Pro: 10GB total
├── Documents: Up to 5GB
├── Images: Up to 3GB
└── Other files: Up to 2GB

Team: 100GB total
├── Shared storage: 80GB
└── Individual storage: 20GB per member
```text

### File Size Limits

**Individual File Limits:**
- **Documents**: 25MB per file
- **Images**: 10MB per file
- **Code files**: 5MB per file
- **Spreadsheets**: 15MB per file

**Processing Limits:**
- **OCR documents**: 100 pages per file
- **Image analysis**: 4K resolution maximum
- **Code analysis**: 50,000 lines per file

### Memory System Limits

**Memory Entries:**
```text
Free Plan:
├── Total entries: 50
├── Entry size: 500 characters
└── Search results: 5 per query

Pro Plan:
├── Total entries: 1,000
├── Entry size: 2,000 characters
└── Search results: 20 per query

Team Plan:
├── Total entries: 10,000
├── Entry size: 5,000 characters
└── Search results: 50 per query
```text

## Model-Specific Limits

### Context Window Limits

**By Model:**
```text
GPT-4o: 128,000 tokens
├── Effective context: ~120,000 tokens
├── System prompts: ~2,000 tokens
└── Response buffer: ~6,000 tokens

Claude 4 Sonnet: 1,000,000 tokens
├── Effective context: ~990,000 tokens
├── System prompts: ~3,000 tokens
└── Response buffer: ~7,000 tokens

Claude 3.5 Sonnet: 200,000 tokens
├── Effective context: ~190,000 tokens
├── System prompts: ~3,000 tokens
└── Response buffer: ~7,000 tokens

o1-preview: 128,000 tokens
├── Effective context: ~100,000 tokens
├── Reasoning space: ~20,000 tokens
└── Response buffer: ~8,000 tokens
```text

### Processing Timeouts

**Request Timeouts:**
- **Simple queries**: 30 seconds
- **Complex analysis**: 5 minutes
- **Agent execution**: 15 minutes
- **File processing**: 10 minutes

**Background Processing:**
- **Large file analysis**: 30 minutes
- **Batch operations**: 1 hour
- **Report generation**: 2 hours

### Concurrent Request Limits

**Simultaneous Requests:**
```text
Free: 1 concurrent request
Pro: 5 concurrent requests
Team: 20 concurrent requests
Enterprise: 100+ concurrent requests
```text

## Performance Limits

### Response Time Targets

**Service Level Objectives (SLOs):**
```text
Chat responses: 95% under 3 seconds
File uploads: 95% under 10 seconds
Search queries: 95% under 2 seconds
Agent execution: 90% under 30 seconds
```text

### Throughput Limits

**Platform Capacity:**
- **Peak concurrent users**: 100,000+
- **Messages per second**: 10,000+
- **File uploads per hour**: 50,000+
- **API requests per second**: 5,000+

## Monitoring and Alerts

### Usage Dashboards

Track your current usage against limits:

```text
📊 Current Usage Status:
├── API Requests: 450/1,000 per hour (45%)
├── Storage: 2.3GB/10GB (23%)
├── Monthly Tokens: 45,230/100,000 (45%)
└── Concurrent Chats: 3/10 (30%)
```text

### Automatic Alerts

**Limit Approach Warnings:**
```text
🟡 Warning: 75% of limit reached
🟠 Caution: 90% of limit reached
🔴 Critical: 95% of limit reached
⛔ Blocked: 100% of limit reached
```text

### Custom Thresholds

Set personalized alert levels:
- **Budget alerts**: Custom percentage thresholds
- **Storage alerts**: Custom GB thresholds
- **Usage alerts**: Custom daily/weekly limits

## Limit Increases

### Automatic Scaling

**Plan Upgrades:**
- Instant limit increases upon plan upgrade
- Prorated billing for mid-cycle upgrades
- Immediate access to higher tier limits

### Custom Limits (Enterprise)

**Available Customizations:**
- **Rate limits**: Custom API request rates
- **Storage**: Dedicated storage allocations
- **Concurrency**: Higher parallel processing
- **Timeouts**: Extended processing windows

### Temporary Increases

**Special Circumstances:**
- **Event-based scaling**: Higher limits during launches
- **Seasonal adjustments**: Increased capacity for peak periods
- **Migration assistance**: Temporary higher limits during onboarding

## Best Practices

### Staying Within Limits

**Rate Limit Management:**
- Implement exponential backoff in API clients
- Use batch processing for bulk operations
- Cache responses to reduce redundant requests

**Storage Optimization:**
- Regularly clean up unused files
- Compress large documents before upload
- Use external storage for large datasets

**Token Efficiency:**
- Optimize prompts for clarity and brevity
- Use appropriate models for task complexity
- Implement context management strategies

### Planning for Growth

**Usage Forecasting:**
- Monitor growth trends in usage dashboards
- Plan plan upgrades before hitting limits
- Consider enterprise options for predictable scaling

**Architecture Design:**
- Design systems to handle rate limiting gracefully
- Implement proper error handling for limit exceptions
- Use asynchronous processing for large operations

## Troubleshooting Limits

### Common Limit Issues

**1. Rate Limit Exceeded**
```text
Error: "Rate limit exceeded. Try again in 42 seconds."
Solution: Implement retry logic with exponential backoff
```text

**2. Storage Quota Full**
```text
Error: "Storage quota exceeded. Free up space to continue."
Solution: Delete unused files or upgrade plan
```text

**3. Context Window Exceeded**
```text
Error: "Input too long for model context window."
Solution: Reduce conversation history or file size
```text

### Resolution Strategies

**Immediate Actions:**
- Wait for rate limit reset periods
- Delete unnecessary files to free storage
- Switch to models with larger context windows

**Long-term Solutions:**
- Upgrade to higher plan tier
- Implement usage optimization strategies
- Consider enterprise custom limits

## Next Steps

- [Learn about rate limits in detail](/limits/rate-limits)
- [Understand storage limitations](/limits/storage)
- [Explore model-specific limits](/limits/models)

## Related Resources

- [Token Usage](/tokens/overview)
- [Billing & Pricing](/tokens/billing)
- [Account Management](/account/overview) 

Getting Started

Chats

AI Models

AI Agents

AI Tools

Files & Memory

Integrations

Legal AI

Model Context Protocol

Collaboration

Evaluation System

Micro-Agent Patterns

Organization Management

Enterprise

Account Settings

Tokens & Billing

Limits & Quotas

Security & Privacy

Videos

Support

Legal

API Reference

Limits & Quotas Overview

Types of Limits

Rate Limits

Storage Limits

Model Limits

Limit Categories by Plan

Free Plan Limits

Pro Plan Limits

Team Plan Limits

Enterprise Limits

Understanding Limit Enforcement

Soft Limits

Hard Limits

Graceful Degradation

Rate Limiting Details

API Rate Limits

Getting Started

Chats

AI Models

AI Agents

AI Tools

Files & Memory

Integrations

Legal AI

Model Context Protocol

Collaboration

Evaluation System

Micro-Agent Patterns

Organization Management

Enterprise

Account Settings

Tokens & Billing

Limits & Quotas

Security & Privacy

Videos

Support

Legal

API Reference

​Types of Limits

​Rate Limits

​Storage Limits

​Model Limits

​Limit Categories by Plan

​Free Plan Limits

​Pro Plan Limits

​Team Plan Limits

​Enterprise Limits

​Understanding Limit Enforcement

​Soft Limits

​Hard Limits

​Graceful Degradation

​Rate Limiting Details

​API Rate Limits

Types of Limits

Rate Limits

Storage Limits

Model Limits

Limit Categories by Plan

Free Plan Limits

Pro Plan Limits

Team Plan Limits

Enterprise Limits

Understanding Limit Enforcement

Soft Limits

Hard Limits

Graceful Degradation

Rate Limiting Details

API Rate Limits