ThermoQA: 293 Questions, 6 Models, 3 Tiers — The AI Thermodynamic Reasoning Report Card
293 questions, 3 tiers, 6 frontier models, 18 evaluations. Opus leads at 94.1% composite, MiniMax trails at 73.0%. A 21-point spread proves thermodynamics discriminates AI. Memorization ≠ reasoning.