Could we stop comparing raw models (llama, stable diffusion,...) with APIs (gpt4, claude,...). Most APIs probably include a lot of engineering tricks and even several models under the hood chained or MOEd together so these are not the same things at all and can't be compared.