Llama 4 Maverick is Meta's natively multimodal open-weight flagship, with a 1M context window and strong general performance. At $0.15/$0.60 per 1M tokens on hosted providers it is the leading open-weight option for most production workloads.
Pricing Breakdown
| Volume | Input Cost | Output Cost | Combined (50/50) |
|---|---|---|---|
| 1,000 tokens | $0.0001 | $0.0006 | $0.0004 |
| 10,000 tokens | $0.0015 | $0.0060 | $0.0038 |
| 100,000 tokens | $0.0150 | $0.0600 | $0.0375 |
| 1,000,000 tokens | $0.1500 | $0.6000 | $0.3750 |
Strengths
- ✓Natively multimodal open-weight flagship
- ✓Competitive with GPT-5.4 mini and Claude Sonnet on many tasks
- ✓1M token context window
- ✓Self-hostable for full data control
- ✓Available cheap on DeepInfra, Fireworks, Together
Weaknesses
- ✗Behind top closed models on hardest reasoning benchmarks
- ✗8x H100 required for self-hosting at full precision
Compare Llama 4 Maverick With
Frequently Asked Questions
How much does Llama 4 Maverick cost?
Llama 4 Maverick costs $0.15/1M per 1M input tokens and $0.60/1M per 1M output tokens.
What is Llama 4 Maverick's context window?
Llama 4 Maverick has a context window of 1M tokens, which means it can process up to 1,000,000 tokens in a single request.
Is Llama 4 Maverick open source?
Yes, Llama 4 Maverick is open source. The model weights are publicly available and can be self-hosted.
What is Llama 4 Maverick best used for?
Llama 4 Maverick is best suited for: coding, research, long-context, low-cost, image-understanding. Natively multimodal open-weight flagship.
What is Llama 4 Maverick's ELO score?
Llama 4 Maverick has an ELO score of 1350 on the LMSYS Chatbot Arena leaderboard, placing it among top-tier models.