Question 1

How much does Llama 3.1 405B cost?

Accepted Answer

Llama 3.1 405B costs $2.70/1M per 1M input tokens and $2.70/1M per 1M output tokens.

Question 2

What is Llama 3.1 405B's context window?

Accepted Answer

Llama 3.1 405B has a context window of 131K tokens, which means it can process up to 131,072 tokens in a single request.

Question 3

Is Llama 3.1 405B open source?

Accepted Answer

Yes, Llama 3.1 405B is open source. The model weights are publicly available and can be self-hosted.

Question 4

What is Llama 3.1 405B best used for?

Accepted Answer

Llama 3.1 405B is best suited for: coding, research, translation, reasoning. Open source — can be self-hosted for data privacy.

Question 5

What is Llama 3.1 405B's ELO score?

Accepted Answer

Llama 3.1 405B has an ELO score of 1267 on the LMSYS Chatbot Arena leaderboard, placing it in the mid-tier range.

Volume	Input Cost	Output Cost	Combined (50/50)
1,000 tokens	$0.0027	$0.0027	$0.0027
10,000 tokens	$0.0270	$0.0270	$0.0270
100,000 tokens	$0.2700	$0.2700	$0.2700
1,000,000 tokens	$2.7000	$2.7000	$2.7000

Llama 3.1 405B

Pricing Breakdown