We hope you had a wonderful time at PyCon China 2024.🐍🎉
Huge thanks to all our sponsors, volunteers, partners, and everyone who made this event unforgettable!
See you next year for PyCon China 2025! #PyCon#PyConChina#PyConChina2024
o1 is not an upgrade of gpt-4o, it’s more like a sibling that performs better in “complicated” situations. Now, the problem is how do you identify these “complicated” scenarios in an automatical way?
Gru.ai ranked first with a high score of 45.2% in the latest data released by SWE-Bench-Verified Evaluation, the authoritative standard for AI model evaluation, which is a collaboration between SWE and OpenAI. #GruAI#OpenAI#SWEBench
Behind our winning score is Bug Fix Gru, an Agent designed to auto-fix bugs based on user issues. Here is a video about how Bug Fix Gru works. youtu.be/Dv5SQxziE_A
Gru.ai provides three more agents: Assistant Gru will Helps users solve standalone technical issues, which is now in public use. Test Gru can Generates unit test code automatically and Babel Gru will Assists in building end-to-end projects
Our score was just officially accepted by SWE bench, and Gru.ai received a score of 35.67%! This ranks us first among the teams that provided a trajectory. Well down Gru!
swebench.com#SWEBench
As a developer, you often face a lot of tedious tasks. That's where Gru.ai comes in to help. Let's check out two examples.
youtube.com/watch?v=To8xxlXs…
Babel is already using Babel Agent for Long-term Complex Jobs, such as writing backend management systems, and integrating with the Stripe payment system, etc.
youtube.com/watch?v=jP9bbrK0…
Babel is already using Babel Agent for Long-term Complex Jobs, such as writing backend management systems, and integrating with the Stripe payment system, etc.
youtube.com/watch?v=jP9bbrK0…
Babel’s CEO, Hailong Zhang, wakes up at 5 a.m. for a business trip and hands over the task description to Babel Agent. By the time his plane lands, the task is completed without any intervention from him.
medium.com/@connect_33559/ho…