Symflower

Symflower

154 Photos and videos

Tweets

Pinned Tweet

Symflower @symflower

20 Jul 2022

Ever wished you can just generate your #unittests instead of painfully writing them? In this video 👇 Evelyn demonstrates of how to use get.symflower.com to speed up your daily development workflow 🚀✨ youtu.be/hYWpwJ6O7hE #golang #java

Symflower for IntelliJ IDEA - Smart Unit Test Generator for Java

Use Symflower to automatically create unit tests for Java and Go source code in IntelliJ IDEA, Visual Studio Code, GoLand, Android Studio, your terminal and CI.

symflower.com

Symflower

Symflower @symflower

26 May 2025

Benchmarking LLM agents 💡 Our latest post covers useful benchmarks for evaluating LLM code generation agents and agentic software development workflows. symflower.com/en/company/blo… #LLMAgents #SoftwareDevelopment #Benchmarking #AI #Coding

Benchmarks evaluating LLM agents for software development

Benchmarks for evaluating LLM code generation agents and agentic software development workflows.

symflower.com

Symflower

Symflower @symflower

23 Apr 2025

New to LLM coding agents? 🤖 Our introduction covers the capabilities, limitations, and use cases of LLM agents for software development 👇 symflower.com/en/company/blo…

An introduction to LLM agents for software development

An introduction to LLM agents for software development and their capabilities, their limitations, and use cases.

symflower.com

Symflower

Symflower @symflower

17 Mar 2025

Updated DevQualityEval v1.0 results are in 👀 Check out how our new king of cost-effectiveness (Google’s Gemini 2.0 Flash Lite) performed, and find out if Claude 3.7 Sonnet (Thinking) is worth the additional costs 👇

Markus Zimmermann @zimmskal

13 Mar 2025

Insights of analyzing >100 LLMs for the DevQualityEval v1.0 (generating quality code) in latest deep dive - 👑 Google’s Gemini 2.0 Flash Lite is the king of cost-effectiveness (our previous king OpenAI’s o1-preview is 1124x more expensive, and worse in score) - 🥇 Anthropic’s Claude 3.7 Sonnet is the functional best model (with help) … by far - 🏡 Qwen’s Qwen 2.5 Coder is the best model for local use - Models are on average getting better at code generation, especially in Go - Only one model is on-par with static tooling for migrating JUnit 4 to 5 code - Surprise! providers are unreliable for days for new popular models - Let’s STOP the model naming MADNESS together: we proposed a convention for naming models - We counted all the votes, v1.1 will bring: JS, Python, Rust, … - Our hunch with using static analytics to improve scoring continues to be true All the other models, details and how we continue to solve the "ceiling problem" in the deep dive: 👇🧵 (now with interactive graphs 🌈) Looking forward to your feedback :-)

154

Symflower

Symflower @symflower

1 Oct 2024

We analyzed 80 #LLMs for generating quality code 👀 Here‘s the deep dive blog post for the DevQualityEval v0.6: symflower.com/en/company/blo…

OpenAI's o1-preview is the king 👑 of code generation but is super slow and expensive (Deep dives...

Deep dive on evaluating 80 LLMs on how well they can write code for Go, Java and Ruby with DevQualityEval v0.6.

symflower.com

Symflower

Symflower @symflower

30 Sep 2024

We analyzed >80 LLMs in the deep dive blog post from DevQualityEval v0.6 for generating quality code. Check out the insights and results 👇

Markus Zimmermann @zimmskal

30 Sep 2024

OpenAI's o1-preview is the king 👑 of code generation but is super slow and expensive 😱 This and other insights of analyzing >80 LLMs in the deep dive blog post from the DevQualityEval v0.6 for generating quality code 👇 - OpenAI’s o1-preview and o1-mini are slightly ahead of Anthropic’s Claude 3.5 Sonnet in functional score, but are MUCH slower and chattier. - DeepSeek’s v2 is still the king of cost-effectiveness, but GPT-4o-mini and Meta’s Llama 3.1 405B are catching up. - o1-preview and o1-mini are worse than GPT-4o-mini in transpiling code - Best in Go is o1-mini, best in Java GPT4-turbo, best in Ruby o1-preview Please support our work for the community by liking and sharing this post! 🙏 All the details and how we will solve the "ceiling problem" in the deep dive symflower.com/en/company/blo… (2x the content as the previous one!)

146

Symflower

Symflower @symflower

18 Sep 2024

#Java 23 is out! 🥳 Learn about all the updates & new features in #JDK23:symflower.com/en/company/blo…

511

Symflower

Symflower @symflower

12 Sep 2024

Execute only the tests you need 💡We see a 29% reduction in test execution times with just a basic approach. Details of the benchmark, example & guide: symflower.com/en/company/blo…

Test impact analysis: Automatically run affected tests only

Showcasing how even a basic test impact analysis can already save 29% test execution time on average.

symflower.com

Symflower

Symflower @symflower

4 Sep 2024

Need to cut #LLM costs? 🤑 Read up on the key practices you can use to optimize your LLM spending 👌 symflower.com/en/company/blo…

LLM cost management: how to reduce LLM spending?

An overview of best practices for controlling costs when using Large Language Models to generate text or software code.

symflower.com

Symflower

Symflower @symflower

30 Aug 2024

#LLM #observability 👀 Monitoring can help improve the performance of your LLM applications. Here’s what you need to know & the most useful tools for LLM observability 🔍 symflower.com/en/company/blo…

LLM observability: tools for monitoring Large Language Models

An overview of LLM observability and the top tools you can use to monitor the behavior of Large Language Models (LLMs).

symflower.com

Symflower

Symflower @symflower

30 Aug 2024

We used #LLMs to #transpile #Java and #Golang code to #Ruby 🦾 Here‘s what we experienced: symflower.com/en/company/blo…

Transpiling Go & Java to Ruby using GPT-4o & Claude 3.5 Sonnet

We used LLMs to transpile Java and Go source code to Ruby, to extend the DevQualityEval with Ruby.

symflower.com

1,036

Symflower

Symflower @symflower

13 Aug 2024

Are you using #AI-powered tools in your #softwaredevelopment workflow❓ Aider is a good example that works well and even offers voice coding 🦾 Here’s our guide to using Aider: symflower.com/en/company/blo…

Using Aider AI for code generation

Learn more about Aider, an LLM-powered coding assistant, and find out how to use it in your workflow.

symflower.com

Symflower

Symflower @symflower

12 Aug 2024

Lost in the sea of #LLM #codegeneration tools? 🌊 We’ve got you! Here’s our list of the top #AI tools for #softwaredevelopment: symflower.com/en/company/blo…

The best LLM tools for software development

This post covers the various AI tools that support software development, with a description of key features and supported technologies.

symflower.com

Symflower

Symflower @symflower

22 Jul 2024

How well do #LLMs generate code ❓ There’s only one way to find out: #benchmarking models for #softwaredevelopment tasks. Here’s a roundup of popular LLM benchmarks & insights into our take on the topic 🤓 symflower.com/en/company/blo…

Comparing LLM benchmarks for software development

A comparison of the most widely used LLM benchmarks for coding, and insights to a new code generation benchmark.

symflower.com

Symflower

Symflower @symflower

12 Jul 2024

Looking to evaluate LLMs? 👀 This post helps you navigate the #LLM #benchmark landscape 🧭 symflower.com/en/company/blo…

What are the most popular LLM benchmarks?

A comprehensive description of the most widely used LLM benchmarks.

symflower.com

Symflower

Symflower @symflower

11 Jul 2024

What metrics do you track when evaluating #LLMs? 👀 Here‘s an overview of complex statistical and model-based scorers 💡 Bonus: we also cover the #evaluation #frameworks that help you get started assessing #LargeLanguageModels. symflower.com/en/company/blo…

Evaluating LLMs: complex scorers and evaluation frameworks

An overview of unsupervised LLM evaluation metrics and the most popular frameworks to help you evaluate models for your use case.

symflower.com

Symflower

Symflower @symflower

8 Jul 2024

Have you ever tried to fix performance issues in your #GoLang application but could not find why it was taking longer sometimes? 🚀 Instrumenting your application for #Go #tracing 💡might be what you need: symflower.com/en/company/blo…

Analyzing application performance: A guide to manual instrumentation with Go tracing

Learn how to use Go's built-in tracing to analyze your application's execution and visualize the data with the Go trace tool.

symflower.com

Symflower

Symflower @symflower

5 Jul 2024

#Java 23 is coming in September 🥳 Here’s what you can get excited about in #JDK23! Check out all the updates in this release: symflower.com/en/company/blo…

Symflower

Symflower @symflower

5 Jul 2024

Do you #reuse code? ♻️ Optimizing code for #reusability helps drive down development effort and cost while improving quality. Here’s a list of the most important reusability best practices for #Java #coding: symflower.com/en/company/blo…

How to write reusable code? Guide & best practices for reusability in Java

These best practices will help optimize your Java code for reuse, saving you development effort and costs.

symflower.com

Symflower

Symflower @symflower

4 Jul 2024

Confused by LLM evaluation? 😵‍💫 We can’t blame you. Our new series on LLM #benchmarking guides you through all you need to know about measuring #LLM performance: symflower.com/en/company/blo…

How does LLM benchmarking work? An introduction to evaluating models

Learn the basics of LLM evaluation including general metrics and the most important benchmarks.

symflower.com

418

Symflower

Symflower @symflower

3 Jul 2024

Struggling with performance bottlenecks in your #GoLang app? 🤔 #Go #tracing to the rescue! Explore our comprehensive guide and conquer even the toughest optimization challenges 💪 symflower.com/en/company/blo…

Analyzing application performance: A guide to manual instrumentation with Go tracing

Learn how to use Go's built-in tracing to analyze your application's execution and visualize the data with the Go trace tool.

symflower.com