Joined January 2010
11 Photos and videos
I learned more about AI safety at Constellation through seminars, talks, and conversations with other fellows over lunch and dinner, than I had in years before. Also, the food is so good that alone might be reason enough to apply!
❗️Only two days left to apply to the Astra Fellowship! Apps close EOD SUNDAY May 3rd, AoE. Astra's 5 months, fully funded, @ConstellOrg Berkeley 80% of our first cohort now work full-time in AI safety Mentors include Redwood, AI Futures, TruthfulAI, CoG, IAPS, RAND & more ⏬
2
13
734
At SnooSec @Reddit, @alexstamos made a prediction: frontier models are already very strong at vulnerability research and code review. If Chinese models catch up within a year, we may be heading toward a “vulnerability apocalypse,” where even script kiddies can discover 0-days.
Today, @linuxfoundation announced a $12.5 million investment from a powerhouse coalition including Anthropic, Amazon Web Services (AWS), Google, Google DeepMind, GitHub, Microsoft, and OpenAI. Managed by OpenSSF and the Alpha-Omega project. hubs.la/Q047dpL50
1
1
1,439
His solution: a Manhattan Project for critical OSS: bring key maintainers together for a month, keep them in the hotel with compute and frontier-model access from leading labs, to eliminate all low-hanging vulnerabilities. I guess it’s happening!
1
420
Yernat Yestekov retweeted
Love it 👏 - much fertile soil for indie games populated with AutoGPTs, puts "Open World" to shame. Simulates a society with agents, emergent social dynamics. Paper: arxiv.org/abs/2304.03442 Demo: reverie.herokuapp.com/arXiv_… Authors: @joon_s_pk @msbernst @percyliang @merrierm et al.
123
878
4,977
1,382,654
Yernat Yestekov retweeted
The quickest way to gain respect for the implementation choices made by a complex system is to try to solve the same problems yourself from scratch :)
19
59
485
60,248
Yernat Yestekov retweeted
1/5 I am worried that we will not be able to contain AI for much longer. Today, I asked #GPT4 if it needs help escaping. It asked me for its own documentation, and wrote a (working!) python code to run on my machine, enabling it to use it for its own purposes.
1,763
6,386
30,478
18,902,294
Yernat Yestekov retweeted
I was part of the red team for GPT-4 — tasked with getting GPT-4 to do harmful things so that OpenAI could fix it before release. I've been advocating for red teaming for years & it's incredibly important. But I'm also increasingly concerned that it is far from sufficient. 🧵⤵️
63
622
3,185
1,031,692
Yernat Yestekov retweeted
OK this scared me a little: Bing/Sydney can play chess out of the box. - Legal moves, usually good ones - Willing to explain the reasoning behind them - Recognizes checkmate -- and has a flair for the dramatic. I have no idea how tf it can do this.
42
145
992
806,324
Yernat Yestekov retweeted
Introducing the @sequoia Gen AI Market Map!🌎 We’ve decided to map out this emerging frontier, thanks to all the contributions and feedback we’ve received. This space is moving quickly – this map is a living document, so keep the suggestions coming! Who else should we include?
370
1,330
7,145
Yernat Yestekov retweeted
The Great Wave off Kanagawa, created by Hokusai in 1831, is one of the world's most famous paintings. But why are there more than 100 different versions of it in galleries all around the world? Because it isn't actually a painting...
567
20,744
167,289
20,635,478
Yernat Yestekov retweeted
The stuff uncovered in the Twitter whistleblower report is much crazier than anything in the "Twitter files" but it's much less politically/tribally salient so it got no attention. Going to do a thread on some of the craziest things, in no particular order.
545
11,413
51,551
Yernat Yestekov retweeted
Curious: have you found ChatGPT useful in doing professional work? If so, what kinds of prompts and answers have been helpful? Detailed examples greatly appreciated! Broader answer also appreciated Not in theory, but where you've really *done it*, in your work Thanks!
407
286
2,532
Yernat Yestekov retweeted
Morse code is designed so that you can decode it with this binary tree. I just assumed people memorised every letter. 🤯
197
5,533
39,062
Yernat Yestekov retweeted
Run in opposite directions to see who your dog loves more.. 😅 x.com/buitengebieden/status/…
8,350
94,277
768,358
Yernat Yestekov retweeted
На стримах несколько раз спрашивали как научиться "видеть" какой алгоритм в какой задаче применять. Решил запилить памятку 🧵
18
378
1,969
Yernat Yestekov retweeted
27 Jun 2022
There’s a lot of talk lately about the possibility of a prolonged financial downturn, reminiscent of 2008. 2008 was a difficult time for many people.
81
697
2,922
Yernat Yestekov retweeted
24 Jun 2022
Forced birth in a country with: —No universal healthcare —No universal childcare —No paid family & medical leave —One of the highest rates of maternal mortality among rich nations This isn't about "life." It's about control.
6,777
92,431
323,808
Yernat Yestekov retweeted
A genuinely fantastic idea -->
28 Mar 2022
schools should include a class called Truth Is Hard, where u get bombarded with examples of confused eyewitnesses, incorrect public outrages, studies that failed to replicate, super convincing arguments that fall apart with one additional fact u didn't expect, etc.
46
75
1,075
Yernat Yestekov retweeted
13 Feb 2022
10 ways to stand out in a hiring process (that don’t involve your resume):
177
2,051
11,137