✨ Updated on August 3rd, 2025 - Latest LLM data and pricing
Compare up to 5 models across key performance dimensions. Each metric is normalized to show relative strengths and weaknesses.
Select pre-configured model groups for common comparison scenarios
Choose up to 5 models to visualize their efficiency profiles side-by-side
Artificial Analysis Intelligence Index score. Higher values indicate better reasoning and task performance.
Output tokens per second during generation. Uses logarithmic scaling due to extreme variations (20-2500+ t/s).
Inverted price score - lower cost models score higher. Represents cost-effectiveness.
Inverted latency score - lower time-to-first-token scores higher. Indicates how quickly models start responding.
Maximum context window size. Uses logarithmic scaling due to extreme variations (16k-10M tokens).
Select up to 5 models from the list above to see their efficiency radar chart comparison.
Use the quick comparison presets or search for specific models to get started.