Opus 4.7 vs GPT 5.5 - 3D Modeling Benchmark (dozens of runs). About the same prompt adherence. But one of them costs 10x more and takes 3x longer. Guess which one, answer in thread.
Thought Kimi K2.6 will replace Gemini 3.1 for 3D Model generation because of roughly equal intelligence & double the speed... Not yet. This screenshot is just one of a hundreds of comparisons I run. Kimi on the right.
Ran a bunch of 3D Modeling benchmarks on Gemini 3.1 vs Gemini 3.
Unsurprisingly 3.1 performs a bit better. But surprisingly it costs 2.6x as much ($0.14 vs. $0.37 per 3D Model Generation) and is 2.5x slower (1m 24s vs. 3m 28s).
Ran 63 3D model generations with GPT-5.2. First it failed so much I had to update system prompt with a totally obvious instruction. Now with the updated system prompt it works but it's still slower, more expensive and generally worse than Gemini 3.
GPT-5.2 dropped. Tomorrow I will run 80 3D model generations to see how it performs. Curious to see if there are any quality/speed improvements vs Gemini 3.
Burned through 18mil tokens for 3D Model generation benchmark to compare Gemini 3, GPT-5 and GPT-5.1.
Gemini is clearly SOTA model. Findings in the thread.
Burned through 18mil tokens for 3D Model generation benchmark to compare Gemini 3, GPT-5 and GPT-5.1.
Gemini is clearly SOTA model. Findings in the thread.
I run over 200 3D model generations in last 24 hours. GPT-5.1 is about 2x faster than GPT-5 on the same medium reasoning settings. BUT it comes at a cost. It is less capable than GPT-5 and lands somewhere between Gemini 2.5 Pro and GPT-5.