Joined December 2009
576 Photos and videos
Bioinfhotep retweeted
Late-interaction retrieval is incredibly powerful, but scaling it is computationally challenging. k-means is a huge bottleneck. Our new architecture, TACHIOM, is fully open-source and tackles this problem: up to 247x faster clustering and 9.8x faster retrieval. ⚡ 🧵👇
3
23
120
8,338
Beware #RStats users, according to Misanthropic, an R to C transpiler is a security risk 😄 Fable turned out, as expected, to be La Fable I think some french speaker at Misanthropic is doing some kind of joke with these model names
45
Bioinfhotep retweeted
Arguably the most boring step in genomics is the first one: normalization. Settled science. Scale log. Move on. Except that here's been a huge blind spot in the field. And it matters for AIxBio. A 🧵about what I think may be one of the most important papers I've written. 1/
18
147
670
113,914
Find it incredible that REVEL remains undefeated
GLM-Missense: A fine-tuned genomic language model captures nucleotide-level information overlooked by missense variant impact predictors biorxiv.org/content/10.64898…
1
104
Bioinfhotep retweeted
One billion 3D protein structures: Should you care about ESMfold2? youtube.com/shorts/ucRZwlQZc…
2
6
1,973
Bioinfhotep retweeted
"What if I took a physics test in French and a French test in physics?" Apparently the French language is not compatible with physics. aifails.substack.com/p/physi…
2
2
4
919
Bioinfhotep retweeted
Replying to @MilesCranmer
I think the basic issue in all such discussions is that these kinds of competencies aren't some kind of essential "human attributes". Measuring cognitive properties by humans is myopic and causes all kinds of pseudoproblems. The emerging field of diverse intelligence has better frameworks. A spectrum of highly variable intelligences, and lots of research on which kinds of architectures enable which kinds of patterns (of behavior, of computation, of physiology, etc.), is more useful for discussions of natural, artificial, and hybrid agents of varying provenance and composition. frontiersin.org/articles/10.… journal.frontiersin.org/arti… and more at drmichaellevin.org/publicati…, plus lots of other good labs.

10
18
209
9,773
Quite nice crate ! so i made un bebe vibeslop R binding with some generic hacks i use for multiversion SIMD support in external C bindings (where cargo-multivers does not apply github.com/ronnychevalier/ca…) github.com/sounkou-bioinfo/R…
I put together a minimalistic pure-Rust, CPU-only implementation of the recent LFM2.5-8B-A1B language model from @liquidai that you can directly link into your Rust projects😎 github.com/maximecb/bebelm
1
7
2,134
Bioinfhotep retweeted
notes from my cascadiajs talk cascadia.wzrrd.sh/
2
7
110
17,022
Bioinfhotep retweeted
To all folks dealing with biological data: do you ever need to check if your reads contain barcodes/adapters/primers/...? Or off-target matches? Sassy is the tool to use! A super fast implementation of "approximate string matching" with a grep-like CLI. curiouscoding.nl/posts/sassy…
1
9
23
3,122
Bioinfhotep retweeted
Good software is invisible. That's why all software is bad because the managers demand visibility of programmers work.
76
230
3,456
94,997
Bioinfhotep retweeted
been working through lots of quadratic equations lately. not in school or anything, just exploring free will :) made a video about the quadratic formula, perfect squares, and DIY implementation in R. #rstats you might find it interesting, idk. youtu.be/CRyBiJfh9sU?si=_TdN…
1
5
210
Bioinfhotep retweeted
Our study led by @Yunfeng_Ruan describes a method of interpolated polygenic risk scoring (DiscoDivas) for more accurate scoring across ancestries, particularly in settings of admixture sciencedirect.com/science/ar… @AJHGNews @AniruddhPatelMD @skoyamamd @buutrg @somijemmacho @HornsbyWhitney @AndrewHaoyu @nilanjan10c
22
77
7,573
Bioinfhotep retweeted
Can CPUs outperform top-tier GPUs in protein folding? 🧬 Thrilled to share our new bioRxiv preprint: "Revisiting CPUs for Protein Folding: Xeon-Based Acceleration of AlphaFold2." We achieve a 1.76x speedup for end-to-end pipeline over an H100 GPU baseline (FastFold)! 🧵👇 (1/n)
2
7
22
1,889
Bioinfhotep retweeted
I'm super excited to share a tool I've been working on for a while now-- a full stack population genetics library that is purpose built for GPUs -- pg_gpu biorxiv.org/content/10.64898…

1
24
80
11,563
Bioinfhotep retweeted
Analyzing phenotypes on the wrong scale reduces power and creates spurious genetic interactions. SIQReg, a new method from my postdoc Zhenhong Huang, learns an optimal phenotype scale to fix this: biorxiv.org/content/10.64898…

1
14
40
3,054
Bioinfhotep retweeted
DistPCA: Tera-Scale Genomic PCA via Out-of-Core Distributed Parallelism biorxiv.org/content/10.64898…
1
3
7
1,038
More fun :D
The standard GPT-5.5 reproduced the proof ~ 👇 chatgpt.com/share/6a0e9e04-8… You don't need to wait for oai's internal model!
1
169