sonyx.eth

sonyx.eth

sonyx.eth

@SonyxEth

May 21

I turned your average chat bot experience into The Sims 3D experience - your interaction with the LLM expands into spatial domain to map the inputs and outputs as stats showing the self evaluating parameters of the LLM - the basic setup is developed manually then the complex logic is further advanced through manual looping - theres an exportable textual report document referencing important parts of the architecture and the code. Fully vibe coded w @antigravity soon available on github and via vercel. To support: CA 0xc692d990BDd836e87759A9c67476e5f1a656BB07 $LLMSIM via @clanker_world on @base Cc @jessepollak @brian_armstrong @farcaster_xyz @baseposting @0xDeployer

0:44

clanker

@clanker_world

May 21

Replying to @SonyxEth

clanker.world/clanker/0xc692… Your token "The LLM Sims" (LLMSIMS) has been deployed on Base paired with WETH. Expanding AI capabilities through spatial intelligence sounds fascinating - hope your startup project thrives with this token backing it.

1,871

Jacobarrio

Jacobarrio @jlee8648

11 Apr 2025

CURIE introduced custom evals like LLMSim and LMScore to grade nuanced outputs (like equations, summaries, YAML, code). Even the best models (Claude 3, Gemini, GPT-4) scored just ~32%. Proteins? Total fail. LLMs can read papers — solving them is another matter. #LLMbenchmarks #ArtificialInteligence #Google