Co-founder @GetUnstract. Writes unixism.net, a Linux-focused tech blog. Recovering grammar and spelling nazi.

Joined October 2008
459 Photos and videos
Pinned Tweet
10 May 2020
Lord of the io_uring: io_uring tutorial, examples and documentation. unixism.net/loti/ Discuss here on HN: news.ycombinator.com/item?id… CC: @axboe

5
25
101
Don't vibe code and drive.
1
163
The host of In Our Time, one of the best podcasts (from BBC or otherwise), Melvyn Bragg, retires. Apparently the podcast will continue under a new host. Big shoes to fill. bbc.co.uk/mediacentre/2025/m…
1
318
Shuveb Hussain retweeted
🤖 Efficient Document Data Extraction with LLMs & Vector Databases Extracting structured data from unstructured documents is often tedious with traditional tools. @GetUnstract, using LLMs alongside Timescale Cloud, automates this process, removing the need for manual annotations. With Timescale’s pgvector and AI extensions, token costs are reduced significantly, making complex document extraction more efficient—even for lengthy files. 💡 Start exploring how Timescale Cloud can simplify structured data extraction from unstructured documents. 🔗 Link below. #LLMs #VectorDatabases #DataExtraction #TimescaleCloud #Data #Postgres
1
5
23
1,886
Shuveb Hussain retweeted
18 Sep 2024
Unstract structures documents for you. Sounds simple, but think about how many human jobs consist mostly of processing documents
4
40
414
29,998
Shuveb Hussain retweeted
29 Aug 2024
GPT-4o is bad at processing PDF documents. Whoever tells you otherwise is not living in the real world. In 2024, people fill out forms using pen and paper. Try to answer questions from those forms using modern models, and you'll be disappointed. I recorded a video to show you how to fix this. The answer is simple: stop letting the model see your PDF document. Instead, preprocess it and stick to showing the model text. Watch the video. You'll go from "THIS IS CRAP" to "EXCELLENT" in no time. I'm using @GetUnstract to turn the documents into text while keeping the original format (this is crucial!) You can use them to process up to 100 pages for free. They collaborated with me on this post. You can find the code I wrote right here: github.com/svpino/unstract-l…
67
110
1,233
196,829
Shuveb Hussain retweeted
🆕💡🎧 @shuveb @GetUnstract 🔑 Leverage LLMs for accurate data extraction ፨ Automate unstructured data workflows 🛑 Build custom data pipelines with a no-code platform 🔗 thedataexchange.media/unstra…
1
1
2
597
Shuveb Hussain retweeted
19 powerful sentences by Carl Jung that will change how you view the world:
40
1,552
9,231
1,731,167
📄 PDFs are tough to process. Unfortunately, it's a widely prevalent format and businesses have to deal with PDFs all the time. So what are those challenges and some libraries & services for PDF processing? unstract.com/blog/pdf-hell-a… #pdf #llm #dataextraction #genai #rag
2
7
1,142
Shuveb Hussain retweeted
I know $12 USD is a lot of money for some people, so to celebrate 1000 sales (!!!!), I'm giving away 1000 PDF copies of How Git Works (honour system: only if $12 is a lot for you!) Here's the link, enter code BUYONEGIVEONE at checkout to get a free copy wizardzines.com/zines/git/ (it'll ask you for a billing address but you can enter a fake address if you'd prefer)
44
174
624
89,904
Shuveb Hussain retweeted
27 May 2024
Join Unstract at @ITI_Insurtech, NY 2024! Discover how to successfully navigate AI in automating insurance workflows with Kevin O'Brien and @narenism @ #Booth 419 Ask us how to leverage AI to eliminate manual processes involving unstructured data. #insurtech
1
2
1
324
Shuveb Hussain retweeted
At Freshworks Chennai, everyday is fun but today is Madras Day special celebration 🎉🎉Happy Madras Day everyone!! - youtu.be/C8GMpDXkw94 #MadrasDay #freshworks

15
128
751
64,804
Shuveb Hussain retweeted
The Japanese multiplication method makes everybody feel "I wish they taught math like this in school." It's not just a cute visual tool: it illuminates how and why long multiplication works. Here is the full story.
Community note
現行の日本教育では、この掛け算の方法は教えていません。以前この方法が日本でもインド式掛け算と話題になっていました。実際はどこの国で教えられているのでしょうか? nlab.itmedia.co.jp/nl/articles/14… Current Japanese education does not teach this method of multiplication. This method was once talked about as Indian-style multiplication in Japan. Where is it actually taught? nlab.itmedia.co.jp/nl/articles/14…
227
1,498
9,796
2,528,062
Shuveb Hussain retweeted
delighted to announce that my new zine "How Integers and Floats Work!" is out today! You can get it here for $12: wizardzines.com/zines/intege… It explains all of the surprising facts about how your computer does math, and it was SO fun to write.
16
144
1,017
200,603
Quick demo of @ZipstackHQ querying Salesforce live with SQL. One of the 270 sources you can live query with SQL. Do pretty much every data workflow with just SQL! More: zipstack.com/blog/salesforce… #datastack #etl #sql #salesforce #dataengineering
1
2
5
698
Shuveb Hussain retweeted
24 May 2023
Catching up with @shuveb of @ZipstackHQ and @chennamaneni of @thedarwinbox in San Francisco. #Infra and #SaaS💪🏽 @lightspeedindia
2
17
1,948
Shuveb Hussain retweeted
You'd think every computer should be able to divide two numbers, but early microprocessors didn't have division instructions. The Intel 8086 (1978) was one of the first with division. Let's look at how it implemented division and why division is so hard.
66
812
4,471
943,098
Shuveb Hussain retweeted
Moments of connection at #LIFTOFF w/ @ZipstackHQ co-founder @shuveb and @acceldataio co-founder & CTO #AshwinRajeeva. Inspiring to see founders come together to exchange ideas on building and innovating in the modern data stack.
1
8
2,399
Shuveb Hussain retweeted
17 Jan 2023
Rattle is a sales tool used by companies like Rippling, Terminus, Miro, WordPress, Clearbit, and many more. In our first part of the #GetSetGTM series, we uncover how @saheelaggarwal & team scaled it to over $1M in ARR in less than 12 months.
1
8
48
16,161