Ghita

Ghita

Photos and videos

Tweets

ZeroEntropy (YC W25) retweeted

Ghita

@ghita__ha

Jun 5

Thank you @garrytan 🙏🤍 best addition to the @ZeroEntropy_AI office pirates don’t ask for permission 🏴‍☠️

1,453

Ghita

ZeroEntropy (YC W25) retweeted

Ghita

@ghita__ha

Jun 1

touching grass over the weekend at @ZeroEntropy_AI

1,318

Ghita

ZeroEntropy (YC W25) retweeted

Ghita

@ghita__ha

May 28

Is Matryoshka dead? Every frontier embedding model uses MRL. But we tested it across a full hyperparameter sweep and it's lossy at every dimension. A small projection matrix trained on top of zembed-1 beats MRL across the board. Including at full dim. Results: zembed-1 @ 160 dims > OpenAI @ 1536 dims zembed-1 (no MRL) > voyage-4 (MRL) @ZeroEntropy_AI

2,802

Ghita

ZeroEntropy (YC W25) retweeted

Ghita

@ghita__ha

May 25

Super excited to share that @ZeroEntropy_AI is now a provider in the @vercel AI SDK If you're already building with `ai`, our models are one import away. → zerank-2 for reranking → zembed-1 for embeddings → more models to come 👀 Happy shipping!

2,938

tomaarsen

ZeroEntropy (YC W25) retweeted

tomaarsen @tomaarsen

May 6

The excellent zerank-2 reranker model by @ZeroEntropy_AI is now fully compatible with Sentence Transformers, no `trust_remote_code=True` needed. It's 4B and cc-by-nc-4.0, and performs very well. I'm quite fond of their training methodology, I'll explain in the 🧵

3,739

Ghita

ZeroEntropy (YC W25) retweeted

Ghita

@ghita__ha

May 21

On Gemini Flash 3.5 pricing Prices for mini/Flash models have been dramatically increasing with every release 4x increase on input and more than 8x on output between Gemini 1.5 Flash and Gemini 2.5 Flash And now another 3x increase The world needs more cost efficient, lightweight models to run at the scales needed, especially for task specific workflows that don't need frontier models Token efficiency and intelligence compression are what's needed

Theo - t3.gg

@theo

May 20

I'm scared to make this video, but I feel like I have to. It's time to talk about Google.

23:36

2,492

Ghita

ZeroEntropy (YC W25) retweeted

Ghita

@ghita__ha

Mar 3

zembed-1 is finally here! 🔥 The world's best embedding model, by @ZeroEntropy_AI It outperforms @OpenAI , @GeminiApp , @Alibaba_Qwen , and Voyage's latest embeddings on 100 languages, and across verticals. Available now via our API/SDK, @huggingface, and @awscloud Marketplace. Full launch post in the thread for benchmarks and more about our secret sauce 👀 We're building the entire retrieval stack... and we're just getting started. 🤫 PS: We're giving out free credits to try it, just comment on the post or DM me!

100

9,508

Ghita

ZeroEntropy (YC W25) retweeted

Ghita

@ghita__ha

5 Dec 2025

live from @MIT where @ZeroEntropy_AI 's CTO is presenting our latest zElo paper and reranker model! @npip99

8,038

María Benavente

ZeroEntropy (YC W25) retweeted

María Benavente

@merybenavente

3 Dec 2025

shout out to @ghita__ha and the @ZeroEntropy_AI team for organizing!

718

Ghita

ZeroEntropy (YC W25) retweeted

Ghita

@ghita__ha

2 Dec 2025

We just built a free tool to ask questions over the 2025 @NeurIPSConf research papers. Try it out at neurips dot zeroentropy dot dev No signup, no credit card, just the best way to learn more about this year's papers!

0:16

2,644

Tycho Svoboda

ZeroEntropy (YC W25) retweeted

Tycho Svoboda @TychoSvoboda

24 Nov 2025

It’s always amazing to see small teams outperform companies with $100M in funding, and even more amazing when you get to be a part of it. 😅 Stoked that we were able to support @ZeroEntropy_AI on training their state of the art reranker model family! Read here about the zerank family: tensorpool.dev/blog/zeroentr… @ghita__ha @npip99

1,139

Ghita

ZeroEntropy (YC W25) retweeted

Ghita

@ghita__ha

19 Nov 2025

We are very excited to release zerank-2, @ZeroEntropy_AI 's newest reranker model. 🔥 It shows major improvement on the 5 most common RAG failure modes below. Existing rerankers consistently fail on seemingly “simple” tasks: 🔢 Comparing numbers and date: “Biggest deals closed after 04/2024.” 🗄️ Aggregation: “Top 10 objections of customer X?” 🌍 Multilingual: Major pain point, especially non-English to non-English. 🙏 Instruction-Following: “Find the *counterargument* of the claim in the transcript” 🥇 Calibrated scores: You ask "what should I cook for dinner?", and "I am allergic to nuts" scores too low for your threshold. Many rerankers overfit public benchmarks, and don’t generalize to these real issues. zerank-2 outperforms existing rerankers considerably on all of these failure modes, in real production environments. With zerank-2, you get: * 15% improvement vs Cohere rerank 3.5 on Arabic/Hindi (Miraql dataset) * 12% NDCG@10 on sorting tasks (new open-sourced eval set) * 7% vs Gemini Flash on instruction-following (MAIR dataset) * $0.025/1M tokens, 150ms p90 latency at 100KB 🤗 We are open-sourcing the model weights, along with new challenging eval sets on @huggingface. Our Elo-inspired training methodology is already open-source! We're starting a series of technical deep dives to explain various failure modes zerank-2 fixes, with concrete prod examples, methodologies, and benchmarks. First technical deep dive in the comments.

1:28

179

88,428

Kasey Zhang

ZeroEntropy (YC W25) retweeted

Kasey Zhang

@_WEEXIAO

20 Oct 2025

@ZeroEntropy_AI (Nicholas Pipitone, @ghita__ha) will talk about how they used RLAIF to train their SOTA rerank model:

1,060

Ghita

ZeroEntropy (YC W25) retweeted

Ghita

@ghita__ha

18 Oct 2025

come chat about smarter search and smarter AI

Philipp Krenn

@xeraa

18 Oct 2025

next week will be extra @elastic-packed in SF monday meetup: luma.com/smart-search * @ghita__ha, @ZeroEntropy_AI: search tools for efficient AI agents * jesse, @fintoolx: LLMs and the next generation of financial search * @joshnkeezy, @reductoai: building a vision-first RAG pipeline with reducto and elasticsearch

ALT meetup

949

Philipp Krenn

ZeroEntropy (YC W25) retweeted

Philipp Krenn

@xeraa

18 Oct 2025

ALT meetup

1,783

Tyler Angert

ZeroEntropy (YC W25) retweeted

Tyler Angert

@tylerangert

13 Oct 2025

The #1 bottleneck for software gen going forward is search / retrieval. Whether that’s open data sources (good web search) / personal data sources, it’s the layer that will become the biggest prereq to capturing value

3,178

Kulveer

ZeroEntropy (YC W25) retweeted

Kulveer

@kul

13 Oct 2025

Replying to @jeffreyhuber

bullish on @ZeroEntropy_AI here

225

Ghita

ZeroEntropy (YC W25) retweeted

Ghita

@ghita__ha

15 Oct 2025

over 200 people have already RSVPd to join us along with the teams at @mastra and @mem0ai to talk about engineering context for Agents. RSVP: luma.com/rehr5jl2 @ZeroEntropy_AI

4,952

ZeroEntropy (YC W25)

ZeroEntropy (YC W25)

@ZeroEntropy_AI

9 Oct 2025

Join us with @mastra (TypeScript framework for Agents), @mem0ai (the long-term Agent memory layer), and @zeroentropy_ai(fastest and most accurate rerankers) on Oct 17th for our first Context Engineering Webinar! luma.com/rehr5jl2

Context Engineering: How to Build Valuable AI Products · Zoom · Luma

What’s the difference between an AI product that feels magical and one that feels clunky? It’s context engineering, aka deciding which 20,000 tokens your LLM…

luma.com

206

Ghita

ZeroEntropy (YC W25) retweeted

Ghita

@ghita__ha

8 Oct 2025

most agents burn $$ reading 100 docs just to answer one question. a set up that works is @turbopuffer (fast, cheap, high recall hybrid search) with @zeroentropy_ai (fast, cheap, high precision reranker) tutorial open benchmarks from the ZeroEntropy team zeroentropy.dev/articles/imp…

0:04

15,821