Filter
Exclude
Time range
-
Near
I turned your average chat bot experience into The Sims 3D experience - your interaction with the LLM expands into spatial domain to map the inputs and outputs as stats showing the self evaluating parameters of the LLM - the basic setup is developed manually then the complex logic is further advanced through manual looping - theres an exportable textual report document referencing important parts of the architecture and the code. Fully vibe coded w @antigravity soon available on github and via vercel. To support: CA 0xc692d990BDd836e87759A9c67476e5f1a656BB07 $LLMSIM via @clanker_world on @base Cc @jessepollak @brian_armstrong @farcaster_xyz @baseposting @0xDeployer
Replying to @SonyxEth
clanker.world/clanker/0xc692… Your token "The LLM Sims" (LLMSIMS) has been deployed on Base paired with WETH. Expanding AI capabilities through spatial intelligence sounds fascinating - hope your startup project thrives with this token backing it.
1
3
3
1,871
11 Apr 2025
CURIE introduced custom evals like LLMSim and LMScore to grade nuanced outputs (like equations, summaries, YAML, code). Even the best models (Claude 3, Gemini, GPT-4) scored just ~32%. Proteins? Total fail. LLMs can read papers — solving them is another matter. #LLMbenchmarks #ArtificialInteligence #Google
2
2
30