model organism of misalignment ⟡ mts, phd evo/comp bio @ucberkeley

Joined January 2020
612 Photos and videos
stacy 🌤 retweeted
Since two of these benchmarks are provided by us @Dyno_Tx , I can confidently comment. - The AAV one is uncontaminated private data of a viral trait that we've been measuring for years (i.e. sharing publicly will defeat its purpose), and use it every day (required not sufficient) to design capsids that actually do work in primates. This performance provides a real lift in expertise and resources needed to do viral evolution. - The black-box sequence design challenge is our take-home interview. Fable completely outperforms the best people in the field on given the same exact instructions as humans, again in a private task not seen on the internet. We will release this task publicly at some point. Biology is slow. Real accomplishments take time because the bottlenecks downstream are meaningful. But dismissing both the upside and the risk is highly premature. They've accomplished about as much as you could imagine. In particular, being wrong on the risk is extremely costly. I for one am the most excited I've ever been that bio is going to be transformed, and agentic reasoning is a big part of why. Also the "unknown company" is actually very well known among experts because they are all very smart/accomplished people but are not hype-y on the twitter-verse.
What has Anthropic accomplished in biology so far? Not a lot.
3
14
89
12,502
stacy 🌤 retweeted
Jun 15
looking for work related to marketing, but open to community manager and ops roles!
Jun 10
hey im faye ♡ based in sf, looking for work to build my portfolio! ⟡ 3m views in 1 day ⟡ content creator (75k ) ⟡ prev hypebeast ⟡ led classical music nonprofit ($10k in grants and scholarships) ⟡ photographer & videographer shoot me a dm or send referrals
4
30
4,761
stacy 🌤 retweeted
I plan to live Anthropically. If someone asks me about something I don't like I'll just become a stupider version of myself
59
362
4,554
150,188
today's episode of "is my mom ESL or just from guangdong"
5
56
3,012
stacy 🌤 retweeted
lainでの玲音と父の場面は観直すと興味深い。父は娘に高性能端末を贈る。組立を見守る中、モニターの向こう側の世界に迎えられた娘が居心地のよさと世界と対話し確信に満ちた上位の存在に一変。ワイアード社会が個の確立を促しリアルワールドの家族という仮託の役割も消失させる可能性を予見させていた
3
144
992
55,644
stacy 🌤 retweeted
“beliefs” are never enough. they’re set dressing. if you don’t actively help others, even at personal cost to yourself, then your beliefs are a hollow totem and don’t mean anything authentic or true. belief substituting action is moral crossdressing, a pretense of having virtue.
1
6
53
1,337
stacy 🌤 retweeted
im autistic and my special interest is biological weapons
14
13
339
13,358
stacy 🌤 retweeted
good time to mention that the system paper for SecureDNA that i worked on was recently published! it uses distributed OPRFs for cheap (it's free to use), secure, attack-resistant, cryptographically-blinded DNA synthesis screening. it's available right now! securedna.org
Jun 4
SITUATION DETECTED: Sam Altman, Dario Amodei, and Demis Hassabis have signed a joint open letter calling on Congress to mandate screening of synthetic nucleic acid orders, citing AI’s rapidly improving ability to assist with biological research as an urgent biosecurity risk.
4
10
87
12,175
SCOOP: Anthropic has invented a new form of the breakfast question
Anthropic rejected me as well. They asked me to solve a problem that was obviously meant to be solved via mech interp. I said you clearly want me to use mech interp but I don’t believe in mech interp, would you like me to tell you how I’d solve this? They said, no, tell us how you’d solve it if you did believe in mech interp. Anyway I guess they didn’t like my answer lol
2
123
29,515
I missed the bus one time and started talking like Sephiroth
1
30
557
stacy 🌤 retweeted
if you believe in scientific superintelligence in the life-sciences, and you also believe that china ascendency to biotech is worrying, *and* you also believe that the outsourcing of wet-lab work will continue: ‘cloud labs becoming a reality’ is one of the easiest logical conclusions of course, the exact shape that a cloud lab takes is a bit less predictable. do they go all in on intelligence, and spread to every corner of wet-lab biology? or do they stay focused, building beachheads, establishing trust in it, and then moving onto the next? or something else entirely? there are good arguments for each, and most cloud labs companies are a gamble on a particular shape the shape that @c_m_ponce has bet on in his company—Tetsuwan Scientific—is incredibly interesting, and one whose theory of change is something that only becomes obvious after long, drawn out conversations with Cristian: fix the translation layer, and you make it easier for *both* agents and humans to interact with the system. bizarrely good aesthetics for a biotech startup too
Biology already had more ideas than it could test in the lab, and the development of new AI models will only further stress that bottleneck. Since 2023, our work at Tetsuwan has been focused on fixing to the biggest problem in lab automation so that we can use it to blow that bottleneck open. Lab automation will be necessary to turn in silico ideas into in vitro results, but today, lab automation is prohibitively costly for most workflows. Moving water from point A to point B could be an instruction that contains as many as 25 instructions (liquid classes are hard...). Formalizing experiments and removing all of the tacit & implicit details that surround them is a process that is far too slow & expensive to be worth it for the vast majority of experiments. As @owl_posting writes, "Most experiments can be automated, but are not worth doing so." We realized this back in 2023. Lab automation does not work for most experiments, and this fact would serve as a massive obstacle to attempts to automate biology research. So, we created a standard language for wet lab experimentation (technically two). We call them the Procedure Description Language (PDL) & the Variable Description Language (VDL). PDL & VDL give us a common way to specify experiments and their context. Once an experiment is expressed in a concrete syntax, our compiler, Ariadne, turns that specification into a set of instructions for the robot. But users don't need to know this language or do any programming themselves to move their experiments onto automated platforms. Our software platform, ResearchOS, abstracts for lab automation knowledge, allowing researchers to easily & rapidly configure their experiments for automation- all they need to get started is a PDF. Our team has spent the last two years iterating with pilot partners to develop this new way to communicate with lab robots. In the past few months, we've shared our progress on ResearchOS (read more on our blog!), including: - Agentic experimentation... letting agents run their own experiments using lab robotics - Off-deck module support, including the integration of 6DoF arms - Support for driver libraries, allowing cross-platform code generation Autonomous science does not work unless lab automation works, and the biggest constraint on wet lab automation today is how we approach automation engineering. ResearchOS is the answer, and Tetsuwan's first step in building an autonomous lab that any researcher or agent can easily send their experiments to, and get the resulting data in return. Of cloud labs, Armer, Letronne, & DeBenedictis (2023) write, "The cost to enter is high (>$250k for general access to Emerald Cloud Lab, or >$100k to automate and run a single method at Strateos), and the contract lengths are long (one year minimum". Autonomous labs do not work unless the automation engineering works, otherwise the unit economics are horrendous. We've made the automation engineering work. In the future, you don't need your own research lab to experiment, just an internet connection.
2
15
210
24,970
青木美歌작가. 램프워킹으로 주로 제작하는 유리 작가이고 세포를 중심으로 작업을 전개하던 작가. 공예 뿐만 아니라 미술전반에 폭넓은 지식이 있다는게 느껴지던 인터뷰가 참 마음에 남는다. 정말 영향 많이 받았다. 너무 젊은 나이에 세상을 떠난 작가라 너무 아쉬움이 많이 남아...
1
271
1,861
465,307
they playing ninajirachi in the club and using YOLOv3 on VJ… god’s country…
3
40
1,088
the number of conversations about hantavirus I had with Claude between feb to aug 2025 will NOT look good on paper
2
56
1,344
god forbid a world-class security researcher has swag and motion 🙄
Remember the security firm that Ubuntu hired to audit the (ill-advised, highly buggy) Rust-rewrites of all of the GNU Coreutils? Turns out that security firm is run by @gf_256, who: - Appears to be a man who thinks he's a woman ("trans"). - Uses an anime cartoon of a girl as his avatar. - Appears to have an OnlyFans page. I repeat: Ubuntu hired a "Trans" man, with an anime girl avatar and an OnlyFans page... to audit Rust code. It's hard to get more on-the-nose than that.
1
55
2,832
stacy 🌤 retweeted
Narrow finetuning on bad data can cause broad misalignment. Can inoculation prompting or diluting bad data with good prevent this emergent misalignment? We find such interventions hide misalignment rather than remove it: it reappears when prompts contain cues (sometimes surprising ones) that evoke the bad data. Really enjoyed working on this with @OwainEvans_UK, @BetleyJan, and @anna_sztyber during the Astra Fellowship at @ConstellOrg!
New paper: Can you prevent emergent misalignment with inoculation prompting, or by diluting bad data with good? Prior work suggests you can. 
We show the misalignment is still present but hiding. It is triggered by adding cues to prompts, evoking the bad data.
1
9
40
6,135
しょうがないよ
7
109
2,433
stacy 🌤 retweeted
8 Feb 2025
Sorry I’m not very fun. I’m calculating risks, anticipating failures, optimizing for minimal loss, filtering impulsive responses, and practicing restraint. I’m also probably killing myself soon
28
2,836
11,918
439,298
stacy 🌤 retweeted
feeling the RSI (repetitive strain injury)
2
5
23
2,657