Principal Data Scientist at Endava. Kaggle 2xGM. GDE. Engineer. Christian. Father. Passionate about Literature, History and Mountains. Opinions my own.
I just published: Introduction to Agentic AI with Google ADK 🚀 on @towards_AI (@Medium): lnkd.in/d2ctaemU
Learn how to build AI agents with Gemini that can:
• use tools
• orchestrate workflows
• collaborate with agents
• deploy on GCP
#AI#AgenticAI#ADK#A2A
Why should Romanian politics be a choice between skilled flamboyant thieves, boring yet kind of competent bureaucracy, and delusional pro-Russian traitors?
Is a high accuracy score on static benchmarks enough to trust AI in the real world?
Static accuracy is no longer enough. @PredaGabi explores how Kaggle Benchmarks are bridging the gap between the leaderboard and the real world. 🧵
🌟 Kaggle Community Spotlight!
Can LLMs handle the nuance of historical facts or do they just repeat popular myths? 🏛️
Kaggle Grandmaster @PredaGabi uses the new Kaggle Community Benchmarks to put leading models to the test on complex, disputed topics.
Learn more: medium.com/@gabi.preda/how-g…
ALT A screenshot of a leaderboard titled "Does LLMs know history?" comparing the performance of four AI models across ten historical tasks. The models—Gemini 3 Pro Preview (0.90), Claude Sonnet 4 (0.80), DeepSeek-R1 (0.80), and Gemini 2.5 Flash (0.80)—are displayed in a grid showing "PASS" and "FAIL" results. The tasks cover topics such as the Byzantine Empire, the Roman Empire, and Joan of Arc. Notably, Gemini 3 Pro Preview is the only model to pass the "last_egypt_pharaoh" task, while all four models failed the "vestal_virgin_jurisdiction" challenge. The interface is clean, featuring a white background with green and red status indicators for each task.