getting closer to frontier capability @ home. I am personally running ds4 on Framework Desktop (AMD Strix Halo).
> Full SWE-Bench Verified score [for 2-bit quant DeepSeek-V4-Flash] is between 67.5–85%.
> The headline SWE-Bench Verified score for DeepSeek-V4-Flash is 80.8% for full-precision version.
> It is incredibly impressive that the version of the same model having some layers quantized down to 2 bits still performs comparatively well.
> To put it in a perspective, Claude 4.5 Opus scores 76.8% according to the official leaderboard.
That's why people using DS4F with DwarfStart, 2 bit quantized, are often surprised by the results. It's not a frontier model but it is not a toy, it is something you can actually use to get work done, and nobody can tell you want to do with it.