Extract web data at scale with multimodal codegen || YC S23 AI Grant 3

Joined May 2023
25 Photos and videos
Reworkd retweeted
An intern at a YC startup needed to extract SEC filings earnings data for all Fortune 100 companies. He shipped it in 2 days using GPT-4.1 — and got a return offer. The agent: - Scrapes each company’s investor relations page - Pulls 10-Ks, 10-Qs, earnings slides, and transcripts - Syncs updates daily via API Teams of analysts. Custom pipelines. Replaced by one intern in 48 hours. The bar has changed.
62
128
1,753
501,767
1 May 2025
April Product Update 🎉 - Browser Traces – One viewer with Reworkd/Playwright events, console logs, network activity, and CDP events for easier debugging. - Inline Code Diffs – View side-by-side additions and deletions between the new code and the previous version. - Weekly Summary Email – A report with key metrics: total output counts and the sources with the largest data swings. To learn more about the product update, visit our blog (link in the 🧵).
2
1
7
3,890
Reworkd retweeted
If you want to join an early-stage YC startup, I’ve handpicked 15 Series A companies that are currently hiring right now. Using GPT-4.1, I built scrapers that have already pulled 125 open roles and I’ll keep adding more over the next few weeks. 1) @usepylon - The only customer support platform built for B2B. 2) @gumloop - A no-code platform for automating workflows with AI 3) @mintlify - The modern standard for documentation 4) @infisical - Open-source secrets manager for developers 5) @resend - Email for developers 6) Recall.ai - The universal API for meeting video, audio and metadata 7) @happyrobot - AI Communication. Built for Logistics. 8) @join_arc - Banking & Funding for Startups 9) @gethockeystack - The operating system for inbound B2B revenue 10) @NumeralTax - SaaS & Ecommerce sales tax on autopilot. 11) @AlloyAutomation - Alloy is a platform for building and managing SaaS integrations 12) @try_glimpse - Glimpse offers AI services for retail brands starting with deductions 13) @buildwithfern - Instantly offer SDKs and API Docs 14) @reductoai - The most accurate API to parse documents 15) @DetectDeepfakes - Enterprise Deepfake Detection 16) @AstroMechanica - Supersonics for the new Jet Age Want the full list and scraper access? Comment “Scraper” below, and I’ll send it over. I built this using @ReworkdAI
2
3
15
2,461
23 Apr 2025
Build your own custom scraper at app.reworkd.ai/

An ecommerce tech company almost spent $450K on 3 engineers to build scrapers for 200 sites. GPT-4.1 now does it with just a single SWE. - Agents auto-generate site-specific Playwright code - Auto-heal when selectors break & redeploy on the fly
1
4
1,679
16 Apr 2025
March Product Update 🚀 We’ve officially launched our self-serve scraping platform — now anyone can try it out. 👉 Try the interactive demo (in the 🧵) What’s new: - Chat-based agent for easier debugging - Real-time streaming – no more page refreshes To learn more about the product update, visit our blog.
3
1
4
1,280
11 Mar 2025
We just launched on Product Hunt! You can support us by liking our Product Hunt post and leaving a comment. Our launch is live at producthunt.com/ 🎉
Reworkd is your scraping co-pilot. It understands website structures and auto-generates Playwright code to take actions, visit subpages, scrape, and save data based on your custom schema. With @ReworkdAI you can simplify web data extraction at scale. ycombinator.com/launches/N0m…
4
3
18
12,858
12 Feb 2025
How Lookbk Extracted 350,000 E-Commerce Products with Reworkd Lookbk came to us because they were spending 40 hours every month fixing their existing web scrapers when websites changed. With plans to scale their data pipeline 10×, they needed a solution that could keep up. "Before Reworkd, we spent countless frustrating hours fixing scrapers every time a site changed—now it’s automated. Their advanced captcha solving also unlocks data from sites we couldn’t access before. Scaling our data pipeline is suddenly no longer an issue." - @caelin_sutch, Co-founder of Lookbk 👇 Check the link in the 🧵 to read the full case study.
1
1
3,685
7 Feb 2025
January Product Update 🚀 We've rolled out several major updates over the past month: 📋 Review Flow: A built-in review system for the QA team—verify scraped data directly on the platform with full history. 🛡️ Anti-Bot Solution: Improved browser stealth capabilities to bypass even the most heavily protected sites. ⚡ Performance Boost: Increased the platform’s ability to handle an order of magnitude more load per day. We are launching our self‑serve tool on March 11th—DM us or email at srijan@reworkd.ai for early access. To learn more about the product update, visit our blog (link in the 🧵).
1
1
5
1,653
24 Jan 2025
How @Axis_HQ Automated Regulatory Data Scraping from 2,500 Sites with Reworkd Axis came to us after their previous vendor couldn't keep up with their data extraction needs. Scaling to 2,500 sites seemed daunting - but in just a few weeks, we automated the entire process, helping them extract over 5M data points. "The Reworkd team is extremely responsive, and their fully managed solution means we don’t have to worry about the quality assurance of our data. Combine that with their competitive pricing, and it was a no-brainer for us." - Mishaal Al Gergawi, CEO of Axis 👇 Check out the full case study here: reworkd.ai/blog/how-axis-aut…
3
1,683
Reworkd retweeted
Our founding engineer built the platform's dashboard within his first two weeks. One month in, he's crushing Linear tickets faster than we can create them🫡 Certified 🐐
21
13
35
29,325
12 Dec 2024
We've rolled out many major updates the past few months including: - a brand-new dashboard page - code templating functionality - and much-improved exporting options. Check out the thread 🧵 below for more info:
19
7
29
28,518
12 Dec 2024
3. Exports Easily track and manage all of your exported data from our new exports page. - Select the exact site(s) you want data from. - Export the data in your preferred format - JSON or CSV. - Pick a date to export data scraped after that day. - Access export history and more
2
3
10
19,505
12 Dec 2024
If you're interested in trying any of these features out, book a meeting here (reworkd.typeform.com/to/qscf…) or send us an email at srijan@reworkd.ai– we’d love to hear your feedback and help you get started!
4
2
5
14,534
Reworkd retweeted
24 Jul 2024
Excited to announce our $2.75 million seed round from investors like @paulg himself, @natfriedman, @danielgross, @ycombinator, and many more ⚡️
64
33
876
150,756