Question 1

How often is the table updated?

Accepted Answer

The footer date shows the last refresh. Refreshes happen on a roughly monthly cadence and immediately after any major model launch. If you spot a stale entry, the feedback link at the bottom of the page goes straight to the maintainer.

Question 2

Why is pricing listed at list rates only?

Accepted Answer

Enterprise and committed-use rates vary too widely to summarize. The list rate is the public floor; if you have negotiated rates, they will be lower. The list rate is the right number for first-pass cost modeling.

Question 3

What if a model has multiple context window options?

Accepted Answer

Each tier is its own row. A 32K and a 128K version of the same base model show up separately because pricing and behavior differ.

Question 4

How is the knowledge cutoff defined?

Accepted Answer

It is the date publicly disclosed by the model provider for the training data freshness. Some providers state a specific date; others state a month. The table uses whatever the provider has published.

Question 5

Are capability tags self-assessed?

Accepted Answer

Vision means the model accepts image inputs in its standard API. Code means the provider markets it for coding workflows. Agentic means it has been released with tool-use or computer-use features. Fast means the provider sells it as a low-latency or high-throughput variant. The tags are descriptive, not benchmarked.

Question 6

Is there a benchmark column?

Accepted Answer

No — public benchmarks are inconsistent across providers and become stale within weeks. For task-specific choice, run your own evaluation. The table covers the structural facts that do not depend on benchmark methodology.

Question 7

Why is Llama in here if it is not a managed API?

Accepted Answer

Llama models are widely used via Bedrock, Together, Fireworks, and self-hosting. Pricing reflects the cheapest mainstream managed-API rate; self-hosted costs vary by hardware. Included because it is part of the practical model-choice landscape.

Question 8

What about open models from Mistral, Qwen, Cohere, or others?

Accepted Answer

Future updates may expand the table. The current four families cover the bulk of production traffic; adding more rows is balanced against keeping the table scannable. Open to expansion if specific gaps recur in feedback.

LLM Comparison Table