Day 1 of #SnowflakeSummit is a lot. Booth visits, back to back talks, so many handshakes. Tomorrow night, relax with us. 🍻
Data Renegades Happy Hour, Tuesday June 2, 6-9pm, 7 min walk from the summit.
Drinks, not slides. 😉
luma.com/vnyf1nij?utm_source…
Data review flagged 99.999% row-count variance. PR was two lines.
Base: 5 years of prod history. Current: 1-hour CI build.
False alarms train reviewers to scroll past. That's the damage.
blog.reccehq.com/session-bas…#dbt#DataEngineering
90% of enterprise programmers spend their time on maintenance, not greenfield development.
Michael Stonebraker's take on where AI actually earns its keep inverts the marketing narrative completely.
Michael Stonebraker was right about CODASYL. Right about NoSQL. Now he's run text-to-SQL on a real enterprise warehouse and got 10% accuracy against an 80% benchmark.
The pattern is hard to ignore.
blog.reccehq.com/benchmarks-…#DataEngineering#TextToSQL#AI
A wiki is something you look at. A shared AI system is something you work through.
When knowledge lives inside the workflow, it stays current. Every time someone runs a skill, outdated entries get noticed. Gaps get filled.
blog.reccehq.com/we-didnt-se…#ClaudeCode#DataEngineering
AI coding tools generate plausible but wrong SQL constantly. The fix isn't waiting for a smarter model.
AI skills are markdown files that encode domain knowledge into coding tools. No framework, just structured text in a repo.
The loop: code → review → handoff → skills update. Every session makes the next one smarter. One aggregation bug became a permanent rule enforced automatically.
@data_dori broke it all down at Data Debug SF. Full writeup:
blog.reccehq.com/ai-skills-f…#DataEngineering#AI
Our own Kent Chen wrote up the multi-agent architecture the team built for Recce's AI Data Review.
Single agent kept forgetting findings as PRs got complex. Fix: orchestrator two specialists, each with its own 200k context window.
One subagent fetches full PR context via a single GitHub GraphQL MCP call (replaced 5-10 gh CLI round-trips). The other explores data through 6 Recce MCP tools: lineage_diff, schema_diff, row_count_diff, custom queries.