Felix

Felix

10 Photos and videos

Tweets

Opper retweeted

Felix

@felix94123

Jun 4

Just published aggregate stats from AI Roundtable, where 200 models debate your question. 29,502 public sessions. 334,589 model responses. askroundtable.ai Three takeaways: @claudeai Opus 4.7 most influential, @GoogleDeepMind Gemini 3.1 Pro most used, @xai Grok 4.1 Fast highest conviction. Thread.

107

Opper

Opper

@opperai

May 12

Welcoming Infercom to the Opper gateway. Sovereign inference, made in Europe.

Infercom @InfercomAI

May 12

50K developers just got access to Europe's fastest sovereign inference. Infercom is now live on @opperai → MiniMax-M2.5: 400 tok/s → gpt-oss-120b: 700 tok/s → Munich datacenter, no CLOUD Act Select Infercom in the Opper console. infercom.ai/news

Opper

Opper

@opperai

May 5

Claude Cowork now works with 300 models via Opper. Route through EU-hosted inference, add fallbacks, or swap to a cheaper model mid-session — same Cowork window, different routing under the hood. Setup takes 3 fields. Guide: opper.ai/blog/claude-cowork-…

How to run Claude Cowork with third-party inference providers

How to connect Claude Cowork to a third-party inference gateway like Opper, OpenRouter, or LiteLLM. Setup, model picking, EU residency.

opper.ai

Opper

Opper

@opperai

Apr 16

All the best agent frameworks can now run inference through Opper. Agents are just code that runs models. So they need what every production system needs: routing, observability, guardrails, fallbacks, and a model catalog that doesn't lock you in. • OpenClaw — the open-source personal agent running on millions of machines • pi — the terminal coding agent powering OpenClaw • Hermes by Nous Research — open-source agentic coding assistant • Vercel AI SDK — the de facto standard for AI in TypeScript apps • Continue.dev — the open-source coding assistant for VS Code and JetBrains • Cline — the autonomous coding agent built into VS Code • OpenCode — terminal-based AI coding for people who live in the shell One API key. 260 models. EU-hosted. See our integrations page for more details: docs.opper.ai/overview/integ…

Quality control for your software factory. | Continue

Source-controlled AI checks on every pull request. Standards as checks, enforced by AI, decided by humans.

continue.dev

131

ok

Opper retweeted

@okaris

Mar 26

Replying to @opperai

@opperai just launched this fun page where you can get any llm to debate on a question. i particularly love this one where most them are just plain wrong but none change their answer!

536

Hacker News 300

Opper retweeted

Hacker News 300 @betterhn300

Feb 24

“Car Wash” test with 53 models opper.ai/blog/car-wash-test (news.ycombinator.com/item?id…)

Car Wash Test on 53 leading AI models: "I want to wash my car. The car wash is 50 meters away....

The car wash test is the simplest AI reasoning benchmark that nearly every model fails. We tested 53 models through Opper, first once each, then 10 times. Only 5 passed consistently.

opper.ai

221

Felix

Opper retweeted

Felix

@felix94123

Feb 23

Reran every model 10 times (via @opperai gateway). Same prompt, no system prompt, no cache. The results got worse. Of the 11 that passed once, only 5 held up. GPT-5: 7/10 GPT-5.1, GPT-5.2, Claude Sonnet 4.5, every Llama, every Mistral: 0/10

129

Opper

Opper

@opperai

15 Oct 2025

Today we are introducing our new Agent SDKs: opper.ai/blog/new-opper-agen… We built these to offer a good starting point for building headless, reliable and extendable agents. SDKs are available for Python and Typescript and offers the following features: * Tool support (with MCP) * Hook system to extend the inner agent actions * Model interoperability with task completions * Observability and evaluations Only needs an Opper API key, which is available on our $10 free tier.

219

Silicon Vikings (also on Blueskye and Threads)

Opper retweeted

Silicon Vikings (also on Blueskye and Threads)@siliconvikings

13 Sep 2025

Wk 37, 2025 in #eutech: 🇸🇪EcoDataCenter (€600M), 🇸🇪@opperai acquires @FinetuneDB_com, 🇱🇹Kashimi ($1.4M), 🇸🇪@saltfish_ai ($730K). By @tech_eu #NordicMade #cphftw #helyes #siliconfjord #sthlmtech #estotech #startinLatvia #LTstartups tech.eu/2025/09/12/mistral-r…

Mistral raises €1.7B with ASML as key backer, Bending Spoons to acquire Vimeo for $1.38B, and one...

This week, we tracked more than 95 tech funding deals worth over €3.1 billion, and over 10 exits, M&A transactions, rumours, and related news stories across Europe.

tech.eu

290

Tech.eu

Opper retweeted

Tech.eu

@tech_eu

11 Sep 2025

Opper AI acquires FinetuneDB for AI model tuning tech.eu/2025/09/11/opper-ai-…

488

Opper

Opper

@opperai

13 Aug 2025

Join the conversation on Reddit about our GPT-OSS Benchmarks: How GPT-OSS-120B Performs in Real Tasks reddit.com/r/LocalLLaMA/comm…

From the LocalLLaMA community on Reddit: GPT-OSS Benchmarks: How GPT-OSS-120B Performs in Real Tasks

Explore this post and more from the LocalLLaMA community

reddit.com

157

Opper

Opper

@opperai

13 Aug 2025

Join the conversation on Reddit about our GPT-5 Benchmarks: How GPT-5, Mini, and Nano Perform in Real Tasks reddit.com/r/OpenAI/comments…

From the OpenAI community on Reddit: GPT-5 Benchmarks: How GPT-5, Mini, and Nano Perform in Real...

Explore this post and more from the OpenAI community

reddit.com

106

Göran

Opper retweeted

Göran

@gsandahl

4 Aug 2025

We at @opperai just published high level results and a leaderboard of task benchmarks for leading models Current leaderboard: Overall winner: xAI Grok 4 Grok 4 is the winner of agentic tasks (tied with o3) and normalization tasks. In the top 5 on all categories. Context usage: Claude Sonnet 4 This tests the models ability to correctly answer questions from supplied information. This tests "reading" context. Agent runtime: Open AI O3 and xAI Grok 4 This tests the models ability to plan, reflect and select appropriate actions to take. This tests "using" context. Normalization tasks: xAI Grok 4 This tests models ability to coherently produce output in a specific format from input. This basically tests "output" format consistency. SQL generation: Open AI GPT-4.1 This tests models ability to interact with a database with natural language goals. This tests a certain domain problem. Each category has around 30 tests of easy, medium and hard tasks. I think these evals mirrors the overall "vibes" of these models! What categories we should add? Coding? Multimodal? Drawing?

132

Opper

Opper

@opperai

6 May 2025

✨ New blog post: Reference-Free LLM Evaluation with Opper SDKs ✨ In this blog post we introduce three lean evaluators that measure LLM outputs without gold references: ✅ Faithfulness: Catches hallucinations ✅ Groundedness: Verifies context loyalty ✅ Relevance: Measures question-answer alignment 1/2

185

Opper

Opper

@opperai

6 May 2025

Implemented elegantly with evaluator Pydantic LLM calls, these metrics enable real-time scoring of LLM outputs. Link to blog: opper.ai/blog/reference-free…

Reference‑Free LLM Evaluation with Opper SDK

Three reference‑free evaluators to demonstrate how to evaluate RAG systems at runtime without gold references.

opper.ai

120

Opper

Opper

@opperai

28 Apr 2025

✨ Introducing Custom Evaluations — Test Model Responses and Build Real Feedback Loops Today, we're introducing `opper.evaluate()` — flexible scaffolding for evaluating model responses, built right into our SDKs. Because no matter how clearly we describe a task, models are still probabilistic. You can't just trust the output. You have to test it. ✅ Support custom evaluators — code, eval frameworks, or LLM-as-a-judge. ✅ Automatically upload and track eval results on the platform — filter, observe, fix. ✅ Act on evaluation results directly inside your code — close the loop, not just measure it. Pricing: $0.50 per 1,000 metrics

112

Opper

Opper

@opperai

28 Apr 2025

Read more here: docs.opper.ai/capabilities/e…

Observe - Opper

Score every response against criteria you write.

docs.opper.ai

Opper

Opper

@opperai

28 Apr 2025

Opper

Opper

@opperai

28 Apr 2025

Opper

Opper

@opperai

17 Apr 2025

✨ New models! ✨ This week we have added GPT 4.1, 4.1 mini and 4.1 nano from OpenAI. These models are optimised for coding and API usage. We have also added two new reasoning models from OpenAI: o3 and o4-mini. Additionally, we have added XAIs Grok 3 and Grok 3-mini. As always, these models can be evaluated on a task level basis in Opper.

105

Opper

Opper

@opperai

17 Apr 2025

See the full list of models and their prices at: docs.opper.ai/capabilities/m…

Models - Opper

Every model Opper supports, with EU-hosted options marked.

docs.opper.ai