Recce - Making Data Productive. (@DataRecce)

24 Apr 2025

Recce 1.0 is now live on Product Hunt! producthunt.com/posts/recce-… Upvote and leave a comment to help us grow the Recce community and bring better data review processed to more data teams Thanks for your support! #OpenSource #Data #DataEngineering #Analytics #DeveloperTools #dbt

129

Recce - Making Data Productive.

Jun 12

"A trillion dollars is arrayed against us." Stonebraker at 82, betting SQL should orchestrate data queries, not LLMs. youtube.com/shorts/3PPwQViNy… #AgenticAI #TextToSQL #SQL

A Trillion Dollars Is Arrayed Against Us -- Mike Stonebraker

"A trillion dollars is arrayed against us." At 82, Mike Stonebraker...

Recce - Making Data Productive.

Jun 9

NoSQL to "not only SQL" to "not yet SQL." Stonebraker's three-word history of a trillion-dollar industry detour. youtube.com/shorts/eATNtE37T… #NoSQL #SQL #DatabaseHistory

The Three-Word History of NoSQL -- Mike Stonebraker

"At the beginning, they said don't use SQL. Then not only SQL. Now?...

Recce - Making Data Productive.

Jun 8

"Don't ever bet against the compiler." Stonebraker on 40 years of database history and why NoSQL converged back to SQL. youtube.com/shorts/eduL-osMx… #SQL #NoSQL #Postgres

Don't Ever Bet Against the Compiler -- Mike Stonebraker

"Don't ever bet against the compiler." Mike Stonebraker won the Tur...

Recce - Making Data Productive.

Jun 5

"We tried out the LLMs that everyone else was touting. Accuracy was about 10%, not 80%." Stonebraker on real enterprise text-to-SQL. youtube.com/shorts/iMKrev7ZK… #TextToSQL #LLM #DataWarehouse

Text-to-SQL Gets 10% on Real Data Warehouses -- Mike Stonebraker

LLMs score 80% on text-to-SQL benchmarks. On MIT's real 1,400-table...

Recce - Making Data Productive.

Jun 2

Mike Stonebraker created Postgres, won the Turing Award, and just tested text-to-SQL on real warehouses. The result: 10%, not the 80% benchmarks claim. New episode. heavybit.com/library/podcast… #TextToSQL #Postgres #DataRenegades

Data Renegades | Ep. #11, Contrarian Bets and AI Skepticism with Michael Stonebraker | Heavybit

On episode 11 of Data Renegades, CL Kao sits down with Michael Stonebraker, legendary database pioneer and creator of Ingres and Postgres.

heavybit.com

Recce - Making Data Productive.

Jun 1

Day 1 of #SnowflakeSummit is a lot. Booth visits, back to back talks, so many handshakes. Tomorrow night, relax with us. 🍻 Data Renegades Happy Hour, Tuesday June 2, 6-9pm, 7 min walk from the summit. Drinks, not slides. 😉 luma.com/vnyf1nij?utm_source…

Data Renegades Snowflake Summit Happy Hour · Luma

In town for Snowflake Summit? Come decompress. Join us for drinks, good conversation, and zero keynote slides. 😉 We're bringing together data practitioners…

luma.com

Recce - Making Data Productive.

May 12

Data review flagged 99.999% row-count variance. PR was two lines. Base: 5 years of prod history. Current: 1-hour CI build. False alarms train reviewers to scroll past. That's the damage. blog.reccehq.com/session-bas… #dbt #DataEngineering

Session Base per PR: Why Data Reviews Lie

Data PR review breaks when the base and current environments are built differently. Here is why, and how session base per PR fixes the false alarms.

Recce - Making Data Productive.

Apr 29

90% of enterprise programmers spend their time on maintenance, not greenfield development. Michael Stonebraker's take on where AI actually earns its keep inverts the marketing narrative completely.

more replies

Recce - Making Data Productive.

Apr 29

And listen at: heavybit.com/library/podcast…

Data Renegades | Ep. #11, Contrarian Bets and AI Skepticism with Michael Stonebraker | Heavybit

On episode 11 of Data Renegades, CL Kao sits down with Michael Stonebraker, legendary database pioneer and creator of Ingres and Postgres.

heavybit.com

Recce - Making Data Productive.

#DataEngineering #AI #SoftwareEngineering #Analytics

Apr 29

Recce - Making Data Productive.

Apr 28

Michael Stonebraker was right about CODASYL. Right about NoSQL. Now he's run text-to-SQL on a real enterprise warehouse and got 10% accuracy against an 80% benchmark. The pattern is hard to ignore. blog.reccehq.com/benchmarks-… #DataEngineering #TextToSQL #AI

Benchmarks Lie: What a Turing Award Winner Found When He Tested Text-to-SQL on Real Data

Text-to-SQL benchmarks show 80% accuracy. A Turing Award winner tested the same models on a real 1,400-table warehouse and got 10%. Here is why.

Recce - Making Data Productive.

Apr 21

Before you let agents touch your codebase, build these gates. Not because you don't trust the agent, but because you wouldn't trust anyone without them. Including yourself. blog.reccehq.com/before-you-… #ClaudeCode #AIAgents #DevWorkflow

Before You Let Agents Touch Your Codebase, Build These Gates

Quality gates are what make Recce's agent-driven development actually work. Here's the pre-commit hooks, linting config, and review process keeping AI-written code production-ready.

Recce - Making Data Productive.

Apr 8

A wiki is something you look at. A shared AI system is something you work through. When knowledge lives inside the workflow, it stays current. Every time someone runs a skill, outdated entries get noticed. Gaps get filled. blog.reccehq.com/we-didnt-se… #ClaudeCode #DataEngineering

We Didn't Set Out to Build a Team AI Plugin

How Recce built a Claude Code plugin to share team knowledge, voice, product context, and workflows, across every AI session.

Recce - Making Data Productive.

Apr 1

AI coding tools generate plausible but wrong SQL constantly. The fix isn't waiting for a smarter model. AI skills are markdown files that encode domain knowledge into coding tools. No framework, just structured text in a repo.

Recce - Making Data Productive.

Apr 1

The loop: code → review → handoff → skills update. Every session makes the next one smarter. One aggregation bug became a permanent rule enforced automatically. @data_dori broke it all down at Data Debug SF. Full writeup: blog.reccehq.com/ai-skills-f… #DataEngineering #AI

A Practical Guide to AI Skills for Analytics Engineering

I built a self-improving AI skill system for analytics engineering at Recce. Here's the framework, a real bug it caught, and how we scaled it.

Recce - Making Data Productive.

Apr 1

Our own Kent Chen wrote up the multi-agent architecture the team built for Recce's AI Data Review. Single agent kept forgetting findings as PRs got complex. Fix: orchestrator two specialists, each with its own 200k context window.

Recce - Making Data Productive.

Apr 1

One subagent fetches full PR context via a single GitHub GraphQL MCP call (replaced 5-10 gh CLI round-trips). The other explores data through 6 Recce MCP tools: lineage_diff, schema_diff, row_count_diff, custom queries.

Recce - Making Data Productive.

Apr 1

Subagents return summaries to orchestrator, not raw payloads. Built with Claude Agent SDK and MCP. Read full post here: blog.reccehq.com/designing-r… #dbt #DataEngineering #AI #MCP

Designing Reliable AI Agents for dbt Data Reviews

Code changes have AI review tools. Data changes don' - until now. Here's how we went from a single prompt to an AI agent that performs the first pass on data validation in every PR.