Generate any data, in any format, of any structure, in any database. All on demand.
You can now get complex, realistic synthetic data just by asking for it.
We built an AI Agent that turns natural language into comprehensive datasets for AI and software development.
1. Create structured SQL databases and unstructured files (PDF, DOCX, JSON) in the same workflow.
2. Describe the data shape, edge cases, and relationships in plain English.
3. Download immediately as SQL, JSON, CSV, and raw files or send data to any destination database of your choosing like @Oracle , @databricks , @Snowflake , @Supabase, Neon, @postgres , @MySQL , @SQLServer, and many more
I'd love your support on Product Hunt: producthunt.com/products/ton…
You can also try the Agent out for yourself. tonic.ai/fabricate
Shout out to @vercel for their outstanding streamdown library: streamdown.ai/. No more layout/style shift as markdown chunks stream in from Fabricate's data agent!
Today we released Fabricate 3.2.0, which allows users to export to:
- Parquet
- Databricks
- MongoDB
- Oracle
Generate synthetic data using Fabricate's data agent: tonic.ai/products/fabricate
The @tonicfakedata team is back and energized from our annual offsite glamping at Zion National Park with @theautocamp
We spent the week working, hiking, and bonding in nature. We're returning with fresh ideas and clear focus for 2026.
It’s going to be a great year.
Generating a synthetic FHIR from @HL7 resource is a foundational capability for developing healthcare software and AI systems, but generating a realistic patient journey is the true challenge for building advanced agents.
A real patient journey has temporal and causal relationships. Symptom → Diagnosis → Procedure → Medication.
The events and the timing between them are critical for testing clinical AI. Most synthetic data tools miss this completely.
Today, a Fortune 500 healthcare company asked us at @tonicfakedata if our Data Agent could model this entire care continuum, including the complex inter-resource relationships. The answer is yes.
It's not just a data generator; it's a system simulator. It understands the logic of a clinical pathway and can generate the chain of #FHIR resources to match.
The new workflow: Describe the data you need, in plain English (or any other language). Generate it on demand. In any format or structure. Export it to a CSV or a destination database of your liking. This is the shift from finding data to fabricating it synthetically. It moves control from a central team directly to the developer.
Agentic AI now makes this possible. Instead of waiting for access, you can have a conversation to generate millions of rows (or PDFs, Word, PPT, etc.) of realistic, safe, and statistically-representative data in minutes.
The AI race just shifted again.
Qualcomm’s new chips are gunning for Nvidia & AMD. Meta’s spending $27B on data centers. Open AI lined up deals for $1.5 million in chips over the next several years.
It means the battle is now infrastructure over models. Whoever owns the silicon, supply chain & rack scale footprint is setting the pace.
Whoever controls the racks controls the race.
LLMs have already consumed all human knowledge.
Now the frontier labs are paying 7 figures for human generated data.
Why not just use high quality synthetic data that is indistinguishable from the real thing?
The AI revolution is on a collision course with reality. According to We are running out of the one resource that fuels all progress: high-quality data. Without a new source, the AI revolution stops here.
Synthetic data generation represents a paradigm shift, enabling organizations to create unlimited, privacy-compliant datasets that fuel AI innovation without compromising sensitive information.
Enter the Fabricate Synthetic Data Agent. An AI that you chat with to generate any type of data you need for your AI use case.
Lack training data to develop an Clinical pathways agent and need 100,000 Clinical SOAP notes?
Need PDFs or Word documents to test a data pipeline for your sensitive data detection platform?
We got you. Sign up for our waitlist here:
tonic.ai/fabricate/waitlist?…
Yesterday, a customer asked if we could generate synthetic PDFs, DOCX, and EML files. The feature didn't exist.
So we just built it. Overnight.
This is the speed you get when your foundation is a flexible AI agent. No multi-week sprint planning. Just a clear customer need and focused execution. We're moving fast.
It used to take hours manually creating synthetic PDFs and DOCX files to test a data pipeline or an agentic workflow. The data was never realistic enough to find real issues.
With this new feature, you just upload a sample invoice to the Tonic Fabricate Data Agent and task it to "generate more invoices like this. Can you give me 100 of them as word docs?"
Of course, the Data Agent had questions
It took 90 seconds to generate perfectly crafted invoices in both PDF and Word. This was with absolutely zero configuration, sophisticated prompting, or instruction. The capabilities are infinite with the synthetic data agent.
This is a massive accelerator for anyone building with LLMs. Check out the video and if you want to try it out for yourself, sign up for the waitlist here:
tonic.ai/fabricate/waitlist?…
We launch November 12th.
Earlier this year, @tonicfakedata acquired Fabricate, a pioneer in generating the complex, unstructured data needed to test modern applications.
I’ve been fortunate to lead GTM efforts for Fabricate working with its creator @realmarkbrocato to continue working on the hardest synthetic data challenges. We are now the clear leader in synthetic data for software and AI development.
But the fundamental bottleneck remains: getting the exact data you need, right when you need it, is still too slow, or in some cases impossible. The friction of data access and generation kills development velocity and leaves AI projects in endless POCs.
We're officially launching the private preview of the Tonic Fabricate Data Agent. It's an AI agent that generates both structured and unstructured synthetic data from a simple conversation.
For the next 3 weeks, I'll be sharing our progress publicly. This is day 1. You can join the waitlist here:
tonic.ai/lp/fabricate-data-a…