Simple, Transparent Pricing
Choose the plan that fits your needs. All plans include our core features with no hidden costs.
Available Models
Llama 2
General Purpose
Mistral
Efficient
CodeLlama
Code Generation
Vicuna
Conversational
Alpaca
Instruction Tuned
WizardLM
Advanced Reasoning
MPT
High Performance
Falcon
Open Source
Infrastructure Options
Cloud Providers
Popular Regions
Choose Your Plan
Starter
Professional
Enterprise
Feature Comparison
| Feature | Starter | Professional | Enterprise |
|---|---|---|---|
| Hardware Specs | 4 vCPU, 16GB RAM | 8 vCPU, 32GB RAM | 16 vCPU, 64GB RAM |
| Storage | 100 GB | 500 GB | 2 TB |
| Performance | 100 req/s | 500 req/s | 2000+ req/s |
| Auto-scaling | |||
| Health Monitoring | Basic | Advanced | Enterprise |
| Support | Priority | Dedicated | |
| SLA | - | - | 99.9% |
Frequently Asked Questions
What's the difference between plans?
Plans differ in hardware specifications, performance limits, and support levels. Starter is great for development and testing, Professional for production workloads, and Enterprise for high-scale applications.
Is there a free tier?
We don't offer a free tier. All plans are paid and start immediately, ensuring you get the full performance and features from day one.
Can I change plans later?
Yes, you can upgrade or downgrade your plan at any time. You'll be billed hourly when changing plans with prorated charges.
What cloud providers do you support?
We support AWS, Google Cloud, Microsoft Azure, and DigitalOcean. You can choose your preferred provider and region for deployment.
How does scaling work?
All plans include auto-scaling. You can scale your infrastructure up or down based on demand, and we'll automatically adjust your billing accordingly.