Datalake developer advocate, software designer, and prolific musician. Check out my music ffm.bio/eyqbn6

Joined March 2010
669 Photos and videos
Shawn Gordon retweeted
Most AI for Data works in demo but falls apart in production. For real-life environments, LLMs identify the right data for a given prompt about 10% of the time. That has made AI for Data a non-starter for serious data work. Yesterday at Collate Summit we announced Collate 2.0. Collate translates the prompt into reliable action, grounded in a semantic context graph spanning every data source in the enterprise. This trust is grounded in three primitives, with benchmarks showing that data agent performance was boosted by up to 7X. • Context. 130 native connectors map every source to a single open metadata graph. • Semantics. A rich ontology lets agents reason over business meaning, not just column names. • Memory. Corrections and definitions become reusable "memory nuggets" shared across all agents and users who follow. Collate 2.0 brings vibe-working to data analysts, data engineers, and governance professionals. Now, you can ship trusted AI for Data into production across all your data teams. Read the announcement: buff.ly/X6hFwdF #ContextLayer #AIforData #AIAgents #OpenMetadata #DataGovernance
1
2
57
Shawn Gordon retweeted
Last chance to register! Collate Summit '26 is tomorrow. Join data and AI leaders for practical sessions on AI in production, governed data, and semantic context. Register: buff.ly/CC7BH2T #CollateSummit #AIReadiness #ContextLayer #AIforData
1
4
42
Shawn Gordon retweeted
Collate Summit '26 is one week out. Data & AI in Production. This year: the open context layer for AI. Turning metadata into semantic context that AI agents and people can both trust. Production stories from @OpenAI, @Airbus, @RakutenGroup, @Scout24, and more. Less theory. More evidence. Register: buff.ly/CC7BH2T #CollateSummit #AIAgents #SemanticContext
1
2
4
83
We've been furiously working on putting together @CollateData Summit, an amazing virtual event with some incredible speakers and talks. I've been able to review some of what's going to be presented, and you're not going to want to miss it. In addition to the keynotes from our co-founders, we've got talks from OpenAI, Airbus, Yelp, and more. Check out the lineup and register for this free event. getcollate.io/summit2026
1
55
Shawn Gordon retweeted
Your AI is missing context that lives in Confluence pages, shared drives, documents, and the institutional knowledge your teammates carry but never wrote down. Live demo June 18, 9 am PDT: Collate Context Center. 30 minutes. Register Now: buff.ly/v10g7KZ #Collate #OpenContextLayer #DataGovernance #OpenMetadata #DataEngineering
3
4
63
Shawn Gordon retweeted
@cassandra is the quiet powerhouse behind some of the internet's most demanding applications. Netflix streams video to millions; Apple processes transactions globally; major retailers run their entire e-commerce platforms on it. Of course, Collate, powered by @open_metadata , supports it, and Jason Haugland , along with @progrockrec , show you what that looks like in this latest Collate Solutions video. 🎥👉Watch here: buff.ly/3vyQpCh #apachecassandra #dataengineering #datagovernance #datalineage #dataquality
1
2
198
Shawn Gordon retweeted
Replying to @Rakuten
@Rakuten runs data at global e-commerce scale. Muqtafi Akhmad will share how their @open_metadata deployment became an AI context layer — not just a catalog. Collate Summit | June 10 | Free buff.ly/yHOH3jG #CollateSummit #DataEngineering #AIAgents #ContextLayer
2
4
123
Shawn Gordon retweeted
@unionbankph: 38K assets, lineage across Snowflake SageMaker QuickSight, quality gates for compliance. How they scaled data governance with @open_metadata. Cirene Simbahan covers the full story @CollateData Summit '26. June 10. Free. buff.ly/EdSMe62 #OpenMetadata #DataEngineering #DataGovernance
1
3
123
Shawn Gordon retweeted
Apache Iceberg is the dominant force in the table format space, and as such, it's an important part of your data ecosystem, and it is, of course, supported by #OpenMetadata. The support is less obvious and more convenient than you might have thought. In this new episode of Ask the Experts, Teddy Crepineau and @progrockrec will explain it all to you and give a practical demonstration. Not to be missed! 🎥👉youtu.be/aMCpmmywEGs #apacheiceberg #dataengineering #datagovernance #datalineage #dataquality
1
5
339
Shawn Gordon retweeted
The @CollateData Summit '26 speaker lineup is something else. Bonnie Xu — @OpenAI Amy Forest — @Yelp Lukas Patzke — @Airbus Angelita Frozza Sanches — @Scout24 Muqtafi Akhmad — @Rakuten Andrew Ford — @AO Dan Kostecki — @AmbryGenetics Jeppe Johansen — @Unity Cirene Simbahan — @unionbankph Suresh Srinivas Sriharsha Chintalapani — @CollateData Practitioners. Not analysts. They're sharing what production actually looks like. June 10. Free. Virtual. 🔗 buff.ly/HcxGymJ #CollateSummit26 #OpenMetadata #DataGovernance #DataEngineering #ModernDataStack #DataCommunity #AIReadiness #VirtualEvent
2
3
173
Shawn Gordon retweeted
Jo Perez and Aydin Geering teamed up for this latest episode of Data 30, "From Metadata to Meaning: RDF & Ontologies". This session shows how Collate uses semantic technologies to transform raw metadata into a unified knowledge graph, capturing the meaning, relationships, and business context behind your data. This semantic foundation enables AI agents to move beyond surface-level answers and instead reason over your data with accuracy and context. You'll learn how ontologies define the structure and meaning of your data, while the semantic layer bridges technical systems and business concepts, creating a consistent, governed view that AI and users can rely on. Whether you're enabling AI copilots, improving data discovery, or strengthening governance, this session will highlight how Collate becomes the intelligence layer that connects your data to real-world understanding, powering more reliable insights and smarter decision-making. 👉🎥Watch here: buff.ly/UaSDtXU #Data30 #OpenMetadata #SemanticIntelligence #Ontology #KnowledgeGraph #AIAgents #datagovernance
2
3
43
Shawn Gordon retweeted
Before you deploy an AI agent, ask yourself one question: Does your data have an agreed-upon meaning that a machine can actually read? If the answer is no, the agent will guess. Every time. Semantic intelligence is the trust layer that fixes this. It turns metadata into machine-readable context that both humans and AI can rely on. At @CollateData, we're going deep on how to build it. June 10. Free. 🔗 getcollate.io/summit2026 #SemanticIntelligence #AI #DataGovernance
1
3
46
Our Iceberg solution is robust and Incredibly simple and not what you expect
Apache Iceberg is the dominant force in the table format space, and as such, it's an important part of your data ecosystem, and it is, of course, supported by #OpenMetadata. The support is less obvious and more convenient than you might have thought. In this new episode of Ask the Experts, Teddy Crepineau and @progrockrec will explain it all to you and give a practical demonstration. Not to be missed! 🎥👉youtu.be/aMCpmmywEGs #apacheiceberg #dataengineering #datagovernance #datalineage #dataquality
2
233
Shawn Gordon retweeted
This Wednesday at 11 AM PDT, our CEO & Co-Founder @suresh_m_s is on stage with @DataSciConnect. The question on the table: as enterprises push AI into production, how do you ensure outputs reflect the right data, tone, and constraints, every time? It's a context architecture question. And the answer is becoming foundational to trustworthy AI. Free. One hour. Worth it. Register here 👉️ buff.ly/7qGVSx3 #ContextLayerAI #DataScienceConnect #SemanticIntelligence #RAG #GenerativeAI #AIAgents #DataGovernance #OpenMetadata
1
2
46
Shawn Gordon retweeted
Tomorrow at 3:15 PM ET, @suresh_m_s takes the stage at Data Summit. Spoiler alert: AI doesn't fail because of bad models. It fails because the data underneath has no shared meaning. Come hear the fix live. Then join us tomorrow evening for the @Collatedata @open_metadata Boston meetup 🍷 RSVP Here 👉️ luma.com/ewm14xmh #DBTA #DataSummit2026 #OpenMetadata #SemanticIntelligence #DataGovernance #AI #Boston #DataEngineering #GenAI
1
3
57
Shawn Gordon retweeted
Boston week is here 👋 We're en route to Data Summit 2026! Two full days of modern data architecture, GenAI, semantic layers, and the people actually building this stuff. Don't miss our Co-Founder & CEO @suresh_m_s on Wednesday at 3:15 PM ET for his live discussion, "The Missing Layer: How Semantics & Metadata Connect People, Data & AI." Then that evening, join us at the @CollateData @open_metadata community meetup. Come for the conference, stay for the conversation. Two events. One awesome city. One very good week. 🎟 Meetup RSVP: luma.com/ewm14xmh 🔗 DBTA Summit: buff.ly/72RVnXv #DBTA #DataSummit2026 #OpenMetadata #SemanticIntelligence #DataGovernance #AI #Boston #DataEngineering #GenAI #DataArchitecture
1
3
47
Shawn Gordon retweeted
@CollateData engineer, Teddy Crepineau, recently became a maintainer for sqlalchemy-redshift and has been driving new developments on the popular Python package, a critical dependency for my open-source projects, including @OpenMetadata! You can read more about Teddy's work and sqlalchemy-redshift v1.0.0 in our blog here: buff.ly/nN4CN0d #OSS #OpenMetadata #redshift #sqlalchemy #python
2
5
183
.@realDailyWire I don't understand why you keep having @RealMattFradd do hit pieces on the LDS church. We are your natural supporters, but he is virtually wrong about everything he says and refuses to actually have a member of the faith on the program to discuss it. My entire Ward has dropped their support of you. We don't finance bigotry.
1
38
Our new release has some amazing new features , notably our new AI Analytics which can replace your BI dashboards in many cases, and that's just one of the amazing new things in the release
📰 News: Today, we announced Collate AI Analytics: an AI data analyst that actually understands your data. For the first time, analysts can come in cold (no prior knowledge of where data lives, which metrics are trusted, or how business concepts are defined), ask a question in plain language, and move from discovery to visualization to dashboard creation in one chat. No data engineers. No separate BI tools. What makes the answers trustworthy: Collate's Semantic Context Graph connects AI agents to the right data sources and business understanding for every prompt, ensuring accuracy and compliance with internal data policies. "Other AI-driven analytics tools could generate a chart, but they couldn't tell you if the chart was right. Every answer needed a second opinion. With Collate AI Analytics, our analysts don't have to worry about a dashboard being grounded in incorrect or ungoverned data. It encodes how our business works and gives the AI that foundation before the question is even asked. There's no second-guessing." — Peeyush Nahar, Chief Product & Technology Officer, 📖 Read the Collate AI Analytics announcement: blog.getcollate.io/collate-g… #AIAnalytics #AIDataAnalyst #SemanticContext #DataGovernance #DataAnalytics #Collate
1
53
Shawn Gordon retweeted
Apache Kafka is the standard in the streaming space, and in this Collate Solutions video, Aydin Geeringh and @progrockrec will demonstrate what is involved with bringing it into your Collate environment. You'll see how Kafka integrates with your lineage, and how you can document your data streams with robust metadata. 👉🎥Watch here: youtu.be/bRwQtuCeC_s #dataengineering #dataquality #datagovernance #collate #apachekafka
2
4
64