I’m excited to join the speaker lineup at Vespa.ai Live! My session will explore Nuances of Binarized Embeddings-Based Retrieval
If you’re attending, let me know — would love to connect at the event.
My opinion on tokenmaxxing is companies shouldn’t mandate/constrain any tools at all and then evaluate software developers by output / (salary token use)
bm25 is nice and all, but you won't believe how easy it is to improve upon it with and how much more you can squeeze from lexical features in @vespaengine
Just added a sample app for how to search with hypencoder models on Vespa. A large meta-model that generates a small query-specific model that scores your docs - it feels like pure science fiction, but of course we can do it: github.com/vespa-engine/samp…
A metamodel that generates weights for a NN for each query on the fly and using the same net for ranking documents against the same query - cool idea by @Julian_a42f9a
But, can you do it completely within @vespaengine , at scale, with reasonable latency (a few hundred ms)?
@andreer: hold my beer... (red bull)
🤯
Just added a sample app for how to search with hypencoder models on Vespa. A large meta-model that generates a small query-specific model that scores your docs - it feels like pure science fiction, but of course we can do it: github.com/vespa-engine/samp…
Just added a sample app for how to search with hypencoder models on Vespa. A large meta-model that generates a small query-specific model that scores your docs - it feels like pure science fiction, but of course we can do it: github.com/vespa-engine/samp…
State-of-the-art ColBERT models are trained by applying knowledge distillation on top of dense pre-trained models
What if we run the whole pre-training in the multi-vector?
Introducing ColBERT-Zero, a model that sets a new SOTA on BEIR, using only public data
raspberry pi zero 2w cluster brought up for the first time today. for hacking on, benchmarking and stability testing a certain wifi driver, and a few other plans to come later ...