Dave Zilberman

Dave Zilberman

79 Photos and videos

Tweets

Dave Zilberman @Zilberman

Jun 4

We’re proud to welcome @GeneralistAI to the @NorwestVP portfolio! The company is on a mission to make general-purpose robots a reality and is led by a world-class team in @peteflorence and @andyzengineer and Andrew Barry bloomberg.com/news/articles/…

Nvidia-Backed Robotics Startup Generalist AI Valued at $2 Billion

The company raised $400 million in a funding round led by Radical Ventures

bloomberg.com

Jerry Liu

Dave Zilberman retweeted

Jerry Liu

@jerryjliu0

Jun 2

We Parse PDFs We spent 7 figures to put this on billboards throughout SF. I thought long and hard about putting something more creative and whimsical. But then you wouldn’t know what we do. AI agents (and humans) are consuming exponentially more documents as they do real work. They need the best quality document parser to not output garbage on downstream tasks. This is what we do today as a company. If you have any PDFs (or other documents), we parse them :) If you’re around SF in June for one of the following events, come stop by our booths: ✅ Snowflake Summit (this week, Booth 1123) ✅ Databricks Data AI Summit (June 15-18, Booth 137) ✅ AI Engineer World Fair(June 29-July 2, Booth L-G47) You can find us by the same sign we put on our billboards! We Parse PDFs @llama_index

2:26

160

14,889

Dave Zilberman

Dave Zilberman @Zilberman

May 28

Invest and sign a definitive agreement in the same quarter? Well, this is a new one! Congrats to @pratyus and the @natomalabs team on entering into a definitive agreement with @Snowflake. The demand for agentic AI infrastructure is moving fast. natoma.ai/blog/natoma-snowfl…

AI Agent Enablement for the Enterprise | Natoma

Natoma sits between your AI and your enterprise systems — connecting any AI client or agent to every application, database, and tool through one secure, governed platform.

natoma.ai

Jerry Liu

Dave Zilberman retweeted

Jerry Liu

@jerryjliu0

May 23

If you're stopping by the SF Caltrain station over Memorial Day weekend, you might catch a glimpse of our digital ads 📺 We parse (PDFs) (50 other document types)

11,531

Glauber Costa

Dave Zilberman retweeted

Glauber Costa

@glcst

May 4

Turso now includes unlimited active databases in every plan. We already had unlimited databases, but we would charge you based on how many of them were active. That is now gone. You want a database, you get a database.

0:39

387

113,722

Jerry Liu

Dave Zilberman retweeted

Jerry Liu

@jerryjliu0

Apr 14

This is why we released liteparse :) Free, open-source, designed for agents. Natively supports OCR / screenshotting for deeper visual understanding in a document when needed.

Andrej Karpathy

@karpathy

Apr 9

Replying to @kepano

I just tried it this morning on the 245-page Mythos pdf and it failed badly and the outputs were all mangled. Converting pdfs is really hard, I think it has to probably be a Skill not a program, for a SOTA LLM for it to work properly.

547

89,051

Jerry Liu

Dave Zilberman retweeted

Jerry Liu

@jerryjliu0

Apr 13

We’re open sourcing the first document OCR benchmark for the agentic era, ParseBench. Document parsing is the foundation of every AI agent that works with real-world files. ParseBench is a benchmark that measures parsing quality specifically for agent knowledge work: ✅ It optimizes for semantic correctness (instead of exact similarity) ✅ It has the most comprehensive distribution of real-world enterprise documents It contains ~2,000 human-verified enterprise document pages with 167,000 test rules across five dimensions that matter most: tables, charts, content faithfulness, semantic formatting, and visual grounding. We benchmarked 14 known document parsers on ParseBench, from frontier/OSS VLMs to specialized parsers to LlamaParse. Here are some of our findings: 💡 Increasing compute budget yields diminishing returns - Gemini/gpt-5-mini/haiku gain 3-5 points from minimal to high thinking, at 4x the cost. 💡 Charts are the most polarizing dimension for evaluation. Most specialized parsers score below 6%, while some VLM-based parsers do a bit better. 💡 VLMs are great at visual understanding but terrible at layout extraction. GPT-5-mini/haiku score below 10% on our visual grounding task, all specialized parsers do much better. 💡 No method crushes all 5 dimensions at once, but LlamaParse achieves the highest overall score at 84.9%, and is the leader in 4 out of the 5 dimensions. This is by far the deepest technical work that we’ve published as a company. I would encourage you to start with our blog and explore our links to Hugging Face to GitHub. All the details are in our full 35-page (!!) ArXiv whitepaper. 🌐: Blog: llamaindex.ai/blog/parsebenc… 📄 Paper: arxiv.org/abs/2604.08538?utm… 💻 Code: github.com/run-llama/ParseBe… 📊 Dataset: huggingface.co/datasets/llam… 🎥 YouTube: youtube.com/watch?v=g5p7G-Nw…

1:09

524

107,659

Jerry Liu

Dave Zilberman retweeted

Jerry Liu

@jerryjliu0

Mar 23

We're excited to collaborate with @googledevs on building an agentic workflow over complex financial documents - using LlamaParse and Gemini 3.1 Pro Brokerage statements have complex layouts, dense tables, and oftentimes visual elements like charts. Our multi-step agentic workflow does the following: 1. Ingest PDF into LlamaParse 2. Extract text and tables 3. Generate human-readable summary using Gemini Shoutout to @Vish_ow and @itsclelia 🙌 Check it out: developers.googleblog.com/bu…

Google for Developers Blog - News about Web, Mobile, AI and Cloud

developers.googleblog.com

Google for Developers

@googledevs

Mar 23

Improve document parsing accuracy by 15% for financial PDFs. Use LlamaParse and Gemini 3.1 Pro to extract high-quality data from unstructured brokerage statements and complex tables. 📈 Precise reasoning 📂 Structured PDF data ⚡️ Event-driven scaling Dive into the code on GitHub → goo.gle/4dCPjjd

0:08

258

30,793

Norwest

Dave Zilberman retweeted

Norwest

@NorwestVP

Mar 23

We’re proud to share that @OuroMeds , a Norwest portfolio company, has signed a definitive agreement to be acquired by @GileadSciences. When we co-led Ouro Medicine’s Series A in 2024, we deeply believed in its mission to fundamentally change how chronic immune-mediated diseases are treated. Congratulations on this significant milestone, and we look forward to supporting the company in its next chapter with Gilead. Read more: gilead.com/news/news-details…

215

Jerry Liu

Dave Zilberman retweeted

Jerry Liu

@jerryjliu0

23 Dec 2025

The DOJ messed up some redactions on the latest Epstein files 🗄️🔏 - they didn’t flatten the PDF layers and you can highlight/copy the underlying text. If you want to extract this text at scale, you *can’t* just feed everything to a VLM (gpt-5.2, sonnet-4.5, gemini 3). VLMs only look at the top-level visual layer of the page, and will output the redacted blocks. You need to also reconstruct the text from the PDF binary itself, which is more in line with “traditional” techniques. LlamaParse uses a combination of both VLMs along with reading the underlying binary. * If you try out our agentic mode by default, it will output the redacted blocks in the markdown `md` field, but extract out the full text in the `text` field * With a simple prompt change you can also extract out the full text in `md`. Prompt: "Do not output redactions if the underlying extracted text already exists - output the full extracted text instead" Whether you want to comb through any set of released government documents or any other file, come check out LlamaParse! Source reddit thread: reddit.com/r/Epstein/s/ax3we… File: justice.gov/multimedia/Court… To use LlamaParse, sign up to LlamaCloud: cloud.llamaindex.ai/

388

86,580

Dave Zilberman

Dave Zilberman @Zilberman

11 Dec 2025

Congratulations to @NorwestVP portfolio company @tv_scientific as they are joining @Pinterest

Jeff Crowe @jeffmcrowe

11 Dec 2025

Today, @Pinterest announced that it has reached a definitive agreement to acquire @NorwestVP portfolio company @tv_Scientific. Proud to have led the Series A because we believed CTV would become a true performance channel. Jason and David proved that. norwest.co/tvscientific-acqu…

Dave Zilberman

Dave Zilberman @Zilberman

2 Dec 2025

Huge Congratulations to @thakurtarun and @vezainc as @ServiceNow announces intent to acquire. Deeply thankful for allowing @NorwestVP to be a part of the journey from very early on. businesswire.com/news/home/2…

ServiceNow to Expand Security Portfolio With Acquisition of Veza’s Leading AI-native Identity...

ServiceNow (NYSE: NOW), the AI control tower for business reinvention, today announced its intent to acquire Veza, a leader in identity security. The acquisi...

businesswire.com

110

Jerry Liu

Dave Zilberman retweeted

Jerry Liu

@jerryjliu0

1 Dec 2025

Claude Code over Excel 🤖📊 Claude already 'works' over Excel, but in a naive manner - it writes raw python/openpyxl to analyze an Excel sheet cell-by-cell and generally lacks a semantic understanding of the content. Basically the coding abstractions used are too low-level to have the coding agent accurately do more sophisticated analysis. Our new LlamaSheets API lets you automatically segment structure complex Excel sheets into well-formatted 2D tables. This both gives Claude Code immediate semantic awareness of the sheet, and allows it to run Pandas/SQL over well-structured dataframes. We've written a guide showing you how specifically to use LlamaSheets with coding agents! Guide: developers.llamaindex.ai/pyt… Sign up to LlamaCloud: cloud.llamaindex.ai/

0:47

LlamaIndex 🦙

@llama_index

1 Dec 2025

Build scripts that automate spreadsheet analysis using coding agents and LlamaSheets to extract clean data from messy Excel files. 🤖 Set up coding agents like @claudeai and @cursor_ai to work with LlamaSheets-extracted parquet files and rich cell metadata 📊 Use formatting cues like bold headers and background colors to automatically parse complex spreadsheet structures ⚙️ Create end-to-end automation pipelines that extract, validate, analyze, and generate reports from weekly spreadsheets 🔍 Leverage cell-level metadata to understand data types, merged cells, and visual formatting that conveys meaning The video below is an example of metadata analysis of spreadsheets via LlamaSheets, with Claude creating a script to parse budget spreadsheets by reading formatting patterns and generating structured datasets automatically. Complete setup guide with sample data and workflows: developers.llamaindex.ai/pyt…

0:47

317

75,904

Jerry Liu

Dave Zilberman retweeted

Jerry Liu

@jerryjliu0

15 Oct 2025

You might’ve known us as a “RAG framework” company - but we’ve been a best-in-class, agentic document OCR/workflow company for the past 1.5 years! 📑🤖 We’re building the future of knowledge work over documents. Our website is awesome - check it out if you haven’t already 👇 llamaindex.ai/

0:19

480

56,683

Dave Zilberman

Dave Zilberman @Zilberman

1 Oct 2025

Drop the mic @tursodatabase

Glauber Costa

@glcst

1 Oct 2025

Next week. The next evolution of SQLite is here.

239

Dave Zilberman

Dave Zilberman @Zilberman

21 Sep 2025

Let’s go @tursodatabase

Guillermo Rauch

@rauchg

20 Sep 2025

Turso is an incredible technical feat. A Rust rewrite of sqlite, with an async-first architecture, incoming support for concurrent writes, vector search, and browser / wasm support out of the box. I think this has a very good chance of being a foundational piece of infrastructure of the vibe-coding age. On-demand, sqlite-compatible global databases that can also run in-browser and on-device. The pace at which the project is evolving is most definitely *not normal*. @penberg and @glcst are built different. Demo: shell.turso.tech

0:19

116

Turso

Dave Zilberman retweeted

Turso

@tursodatabase

18 Sep 2025

TURSO LAUNCH PARTY IS OCTOBER 8 🎉 We're hosting a Launch Party in San Francisco on October 8 to celebrate the Turso Beta Launch. Join us in-person! RSVP at tur.so/soma More details to follow.

11,890

Dave Zilberman

Dave Zilberman @Zilberman

17 Sep 2025

Well said @jerryjliu0 and snazzy new website too!

Jerry Liu

@jerryjliu0

16 Sep 2025

I’ve excited to announce a brand-new website and documentation hub 💫 that solidifies our evolution towards automating knowledge work over your documents. You might’ve followed us since the “RAG framework” days. Even then, the biggest challenge users faced was figuring out how to actually ingest an entire collection of unstructured docs (.pdf, .pptx, .docx, and more) for chatbot/agentic workflow use cases. Over the past year we’ve progressively built up incredibly deep tech around document parsing, extraction, and indexing - while teaching developers how to build various workflows on top. We’re now going all in on documents, and we’re the only company that has both 1) SOTA document processing and file management 📈, and 2) agentic orchestration on top to solve use cases like deep research, report generation, and document workflows end-to-end. Our llamas will continue to love all sorts of data (we have 600 integrations on the open-source framework!), but they now especially love automating paperwork 🦙📄. If you would also love to automate paperwork, come check out our new website and come talk to us! Site: llamaindex.ai Developer Hub: developers.llamaindex.ai/

3:08

Pekka Enberg

Dave Zilberman retweeted

Pekka Enberg

@penberg

20 Aug 2025

Turso 0.1.4 alpha is now out! Highlights: 🔄 Initial bidirectional sync support 📦 TypeScript/JavaScript SDK improvements 🗄️ MCP server for accessing databases ⚡ Performance improvements and more Full release notes 👇

9,819

Glauber Costa

Dave Zilberman retweeted

Glauber Costa

@glcst

1 Jul 2025

Turso, the next evolution of SQLite (previously codenamed "Limbo") reaches its first alpha milestone and first official release. We are so confident in our foundations, that we are offering you $1,000 if you find any data corruption bugs through our partnership with @algoraio (more details below) We understand that you love SQLite because of its legendary reliability. Which is why we wrote Turso with two layers of Deterministic Simulation Testing to generate the most improbable combinations of corner cases and surface every bug: our own, built-in to the project, and our partnership with @AntithesisHQ that catches every bug that survives the simulator. Including bugs in the simulator itself. We are rewriting SQLite because we believe that despite its incredible track record, SQLite could be more. And the secret to unlocking this is an open community of contributors that can unleash the true power of Open Source. In the months leading up to the alpha, Turso has already counted more than 120 contributors. And we are hoping you are next. For the upcoming beta, work has already started on features like concurrent writes, better schema changes, and change data capture. All features we constantly hear from people are sorely needed in SQLite. Performance was not the focus of our alpha. Yet, some workloads already perform a lot better on Turso. Which is why companies like @spice_ai are reaching for Turso as a replacement for SQLite to accelerate AI workloads. Lastly, we are testing Turso, A LOT. It would be a lot harder to do this without the high performance CI infrastructure that @useblacksmith provides. Learn more about Turso 👇

445

87,505