Home/Use Cases/Low Cost

Best LLM for Low Cost

12 models ranked for low cost tasks. Sorted by benchmark quality score, with price as a secondary factor.

Best Quality
Gemini 2.0 Flash
Google
ELO 1330
Cheapest Option
Mistral 7B
Mistral AI
$0.04/1M/1M input

All Models for Low Cost

#ModelProviderInput / 1MOutput / 1MELOFlags
🥇Gemini 2.0 FlashGoogle$0.10/1M$0.40/1M1330
🥈DeepSeek V3DeepSeek$0.27/1M$1.10/1M1320
OSS
🥉Qwen 2.5 72BAlibaba$0.35/1M$0.40/1M1280
OSS
4GPT-4o miniOpenAI$0.15/1M$0.60/1M1272
5Llama 3.1 70BMeta$0.35/1M$0.40/1M1247
OSS
6Gemma 2 27BGoogle DeepMind$0.27/1M$0.27/1M1220
OSS
7Gemini 1.5 FlashGoogle$0.07/1M$0.30/1M1211
8Command RCohere$0.15/1M$0.60/1M1200
9Claude 3 HaikuAnthropic$0.25/1M$1.25/1M1179
10Llama 3.1 8BMeta$0.06/1M$0.06/1M1176
FreeOSS
11Phi-3.5 MiniMicrosoft$0.10/1M$0.10/1M1112
FreeOSS
12Mistral 7BMistral AI$0.04/1M$0.04/1M1072
FreeOSS

Why We Picked These Models

Gemini 2.0 Flash
$0.10/1M/1MELO 1330

Gemini 2. Latest-gen quality with Flash-tier pricing.

DeepSeek V3
$0.27/1M/1MELO 1320

DeepSeek V3 shocked the AI industry by achieving GPT-4o-level performance at a fraction of the cost. Frontier-level quality at 10x lower cost than GPT-4o.

Qwen 2.5 72B
$0.35/1M/1MELO 1280

Qwen 2. Best-in-class for Chinese/Japanese/Korean languages.

Compare Top Models

Gemini 2.0 Flash vs DeepSeek V3Gemini 2.0 Flash vs Qwen 2.5 72BGemini 2.0 Flash vs GPT-4o miniGemini 2.0 Flash vs Llama 3.1 70B

Frequently Asked Questions

What is the best LLM for low cost?

Gemini 2.0 Flash by Google is rated as the best model for low cost with an ELO score of 1330. Latest-gen quality with Flash-tier pricing.

What is the cheapest LLM for low cost?

Mistral 7B is the most affordable option for low cost at $0.04/1M per 1M input tokens. It is also available for free.

Is there a free LLM for low cost?

Yes, Llama 3.1 8B and Phi-3.5 Mini and Mistral 7B are available for free and suitable for low cost.