Tom Dörr

Tom Dörr

16 Photos and videos

Tweets

PareaAI retweeted

Tom Dörr

@tom_doerr

9 Sep 2024

.@PareaAI also looks like a good LLM monitoring tool and is open source

2,240

Joschka Braun

PareaAI retweeted

Joschka Braun

@JoschkaBraun

12 Aug 2024

How do you detect unreliable behavior of your LLM app? Recently, we talked to the team at @sixfoldai and they shared with us a simple, yet powerful way to assess the reliability of their LLM app using @PareaAI. More about how they test their risk assessment AI solution for insurance underwriters in the article in the thread

857

Joschka Braun

PareaAI retweeted

Joschka Braun

@JoschkaBraun

3 Aug 2024

Saturdays are for doc upgrades

546

Joel Alexander

PareaAI retweeted

Joel Alexander @joel_a_wilde

1 Aug 2024

🚀 New deep dive notebook on @PareaAI experiments and LLM evals 📝🔬. I cover some of the key functionalities illustrating the power and flexibility of our API. 🔽 Link in comments 🔽

483

Joel Alexander

PareaAI retweeted

Joel Alexander @joel_a_wilde

30 Jul 2024

Replying to @cohere

@cohere 's actually pretty awesome. More folks should be exploring their models. @PareaAI , now has auto-instrumentation for the Cohere py sdk 🚀

382

Joel Alexander

PareaAI retweeted

Joel Alexander @joel_a_wilde

29 Jul 2024

Replying to @PareaAI

Also, learn more about the research behind each here: docs.parea.ai/blog/eval-metr…

Evaluation Metrics for LLM Applications In Production - Parea AI

How to measure the performance of LLM applications without ground truth data.

docs.parea.ai

153

Joel Alexander

PareaAI retweeted

Joel Alexander @joel_a_wilde

29 Jul 2024

There are so many “black box” evals that force users to instantiate eval classes. Never fully understood this. At @PareaAI we see evals as just functions. You can copy the source code and modify as you see fit, all OSS and based on latest research. Check these out👇🏾

150

Joschka Braun

PareaAI retweeted

Joschka Braun

@JoschkaBraun

24 Jul 2024

📝 Updated integration docs ⭐️ Checkout @PareaAI's updated docs to automatically trace apps powered by @LangChain, instructor by @jxnlco, @LiteLLM, DSPy by @lateinteraction, SGLang by @lmsysorg, and @triggerdotdev. Docs: docs.parea.ai/integrations/o…

532

Joschka Braun

PareaAI retweeted

Joschka Braun

@JoschkaBraun

23 Jul 2024

Day 1 support for llama 3.1 via @FireworksAI_HQ in @PareaAI's playground! 🧨🦙

278

Cyrus

PareaAI retweeted

Cyrus

@cyrusnewday

23 Jul 2024

And to help you understand what's going on, we integrate with observability platforms like @ArizePhoenix, @langchain's LangSmith, @langfuse, @PareaAI, and @lunary_hq so you can explore the experiments that zenbase/core automates. Cookbooks here: github.com/zenbase-ai/core/t…

798

Joel Alexander

PareaAI retweeted

Joel Alexander @joel_a_wilde

23 Jul 2024

Def agree this could be great. Probably best if you can train the router yourself. @anyscalecompute's RouterLLM tracing support with @PareaAI

Matthew Berman

@MatthewBerman

22 Jul 2024

RouteLLM is one of the most impactful algorithmic innovations in AI that I've ever seen. I don't think people realize how important it truly will become. Here's a full tutorial for how to use it:

11:10

290

Joel Alexander

PareaAI retweeted

Joel Alexander @joel_a_wilde

23 Jul 2024

With the latest @GroqInc models for tool calling, we figured it was time to make Groq available across @PareaAI's playground and SDK's. Be on the lookout for an updated tool-calling benchmark, OpenAI v Claude v Groq!

165

Joschka Braun

PareaAI retweeted

Joschka Braun

@JoschkaBraun

22 Jul 2024

📝 Updated self-deployment docs ⭐️ Deploy @PareaAI on-prem via @Docker in 4 steps: 1. Clone the repo 2. Specify organization slug 3. Pull docker images & run them 4. Point SDK backend URL to self-deployed backend URL 🔗 -> 🧵

234

Joel Alexander

PareaAI retweeted

Joel Alexander @joel_a_wilde

18 Jul 2024

There have been so many new models lately. Most recently, @MistralAI 's codestral-mamba. I figured it'd be great to highlight how to use @PareaAI for Regression Testing. Check out the Notebook below, where I test codestral-latest vs mamba on LeetCode questions. 👇

230

Joel Alexander

PareaAI retweeted

Joel Alexander @joel_a_wilde

18 Jul 2024

At this point I could probably have an llm monitor the top foundation model providers and then produce a PR for me that adds any new models to @PareaAI the moment they launch.

Joschka Braun

PareaAI retweeted

Joschka Braun

@JoschkaBraun

18 Jul 2024

Blog: python.useinstructor.com/blo…

173

Joschka Braun

PareaAI retweeted

Joschka Braun

@JoschkaBraun

18 Jul 2024

If you use structured outputs with Instructor, track validation errors instantly with @PareaAI. Concretely, the integration automatically: - groups any LLM call due to retries together under a single trace - tracks any field which failed validation with the respective error message - visualizes validation error count over time Instrument calls made via the Instructor client by adding two lines: p = Parea(api_key="PAREA_API_KEY") p.wrap_openai_client(client, "instructor") Read the full blog post on the instructor docs in the 🧵

ALT Using p.wrap_openai_client one can automatically instrument instructor calls.

ALT Track instructor validation error count over time

ALT See all retries grouped under one trace

2,160

Joel Alexander

PareaAI retweeted

Joel Alexander @joel_a_wilde

17 Jul 2024

Moving from demos to production-ready LLM apps can be challenging. In this post, I outline a practical workflow to help teams make this transition, focusing on: - Hypothesis testing - Dataset creation - Effective evals - Experimentation Full post here: zurl.co/27Ad