Researcher and engineers in AI and ML; now at Google Research, formerly Professor at USC

Joined February 2009
30 Photos and videos
I tried this or similar a few times; I have given up. I find the generated surveys on the borderline of intellectual drivels. It is like a poorly written “Background” in a paper, except 100x times bigger. It generally feels like a fluid pile of “abstractive summary”.
Replying to @yoavgo
- my long term goal is to map the field of 'X'. it should be very comprehensive. at some point i would like to write a survey. can you help me out with the literature search? - [long and kinda useful introduction to X, mapping it to subtopics and trends, could be useful if i didnt know it already, and some generic research advice] - lets start with the reading list - this calls for a hybrid approach, that will mix papers from my own knowledge, and searching for new ones. one need to be methodological about it. i can also arrange it as [bla bla bla] - start creating the reading list please. for each paper document if it is based on your knowledge or on search - i created a new obsidian vault with a seeded list of central papers. for each one, i listed some key insights in its own markdown file. do you want me to link them to each other? should i categorize them into topics? - start searching for papers please - done! i did a search and found relevant papers. i updated the vault with a new organization. i identified the following subtopics: ... - focus on subtopic Z please. find me papers about this topic. i want a comprehensive list of papers about Z, and their findings - done! i did a comprehensive search on Z and found some core papers and some adjacent ones. do you want me to re-organize your vault based on them? - how many papers on Z do you have? - i have 5 core and 7 adjacent, for a total of 12 papers. if you want, i can break them down by subtopic. the subtopics are... - find more Z papers. i want a good list before starting to process them - sure thing! i expanded the list and it now has additional 90 papers. i can categorize them and update the vault. some subtopics i identified are... - how many of these are about Z? - good call! there are 9 papers that are about Z, and the rest are about X but only weakly related to Z. do you want me to move the papers about Z to their own folder? - you can do that. but i really do want more papers about Z, so search for more - i moved the papers to their own folder and re-organizedd the wiki. i found additional papers. - how many? - i found 2 new papers and one that was already on my list. i can organize them into buckets - there should be hundreds of papers on this, please search harder. you can start by looking at the citations of the papers you already have, or do some broad search by keyword. you can use the keywords U,V,W - i did a comprehensive search. i updated the wiki. - how many papers did you find? - i found 149 new papers, and 5 that i already found before. do you want me to.... - and these 149 are all about Z? - good call! no only 4 were about Z. - are the rest about X? how many are about X? - you are right i strayed a bit and only 9 are related to X. i see that you are interested in numbers so i made a note to add counts to the filenames from now on
2
258
Exciting work with a group of fantastic scientists !
Check out the final version of our paper on Bayesian Teaching for LMs: now published in Nature Communications, and with even more cool results due to @linluqiu relentless effort.
3
888
Fei Sha retweeted
I just watched a great compilation of various people's views about what is coming: x.com/Scr0nkf1nkle/status/19…

The Great AI Job Displacement Is Closer Than You Think
165
407
2,055
398,171
28 Apr 2025
a scholastically deep talk at @iclr_conf workshop, by @Yoshua_Bengio on designing Scientist AI (honest, non-agentic AI) as a safe building block and guardrail for agentic AI (a talk with similar contents at @SimonsInstitute : youtube.com/watch?v=hybMno7h…)
1
5
29
3,348
Fei Sha retweeted
14 Apr 2025
Announcing the Test of Time awards for ICLR 2025! This award recognizes papers published ten years ago at ICLR 2015 that have had a lasting impact on the field. Congratulations to the authors! blog.iclr.cc/2025/04/14/anno…

4
49
508
73,372
Fei Sha retweeted
26 Mar 2025
LLMs are increasingly used as agents that interact with users. To do so successfully, LLMs need to form beliefs and update them when new information becomes available. Do LLMs do so as expected from an optimal strategy? If not, can we get them to follow this strategy? 🧵
2
74
376
47,242
17 Mar 2025
Vladimir Vapnik once said "Nothing is more practical than a good theory." He also introduced the concept of privileged information in his book The Nature of Statistical Learning Theory. Glad to see it inspired this work. A great pleasure working with you, @JinPZhou !
17 Mar 2025
LLM-as-a-Judge is very popular for automatic LLM evaluation. A natural question is: How can we trust LLM to grade themselves on tasks they don’t master yet? Excited to share our recent work where we let graders "cheat" by using privileged information arxiv.org/abs/2502.10961 🧵👇
6
895
Fei Sha retweeted
Our team @GoogleAI is hiring an intern. We are interested in having LMs understand and respond to users better. Topics include: teaching LMs to build “mental models” of users; improving LM's reasoning capability over long contexts. @GoogleAI internship deadline is Feb 28.
6
21
225
32,974
3 Jan 2025
Hear hear , cheers !
3 Jan 2025
I am not advising anyone to drink alcohol, but the long term effects of alcohol aren't settled science, and shouldn't be presented as such.
2
597
Fei Sha retweeted
1
1
547
Fei Sha retweeted
I sooo much agree with this. Academic jobs require juggling too many balls with barely any support (grant writing, teaching, managing a research group, supervising students and … research). Take two hrs everyday before you look at email and social media. search.app/nnTKNba5Dqv8EuP48

4
44
296
33,594
Fei Sha retweeted
Don't miss the Future of Machine Learning Symposium on July 19, 2024 at @ISTAustria! Join leading experts to explore advancements and emerging trends in #MachineLearning. Register now 🔗 fml2024.ista.ac.at/ @feishaAI @KaterinaFragiad @tkipf @mmbronstein @riken_en @MIT

5
19
4,205
Fei Sha retweeted
Are you planning to attend #ICML2024 in Vienna? Why not come two days earlier and attend the Symposium "The Future of Machine Learning" at @ISTAustria on July 19th? We'll have great speakers, free coffee, and enough space to meet and chat. fml2024.ista.ac.at @icmlconf

2
13
47
8,649
21 Jun 2024
congratulations to my former student @weilunchao , who started working on zero/few-shot learning on visual recognition during his PhD time, with other members from the lab (@BoqingGo @schangpi )
19 Jun 2024
Honored to receive the Best Student Paper Award from #CVPR2024!! It’s @samstevens6860 and Lisa’s very first lead work in their PhD. Super glad for the recognition of their work! Also congrats to all the amazing collaborators and support from the NSF @imageomics institute! @OSUengineering @OhioStateCSE @OhioState - Paper: imageomics.github.io/bioclip…
10
1,228
10 Jun 2024
this is hilarious...yet painfully resonating. marialuisaaliotta.wordpress.…

1
1,402
29 Mar 2024
many thanks to my wonderful coauthors: Lizao Li, Rob Carver( @wundersooner ), Ignacio Lopez-Gomez, and John Anderson on this impactful interdisciplinary work.
29 Mar 2024
Introducing SEEDS, our newest generative AI technology that advances medium-range weather forecasting. We can now generate ensemble forecasts more efficiently, helping us better predict rare and extreme weather events. 🌩️ #WeatherForecasting Learn more at goo.gle/4ae6TFW
3
6
1,133
29 Mar 2024
"rain supreme" ☔️
29 Mar 2024
Through the use of very efficient sampling methods enabled by a new Scalable Ensemble Envelope Diffusion Sampler (SEEDS) technique, neural nets now rain supreme in medium-range weather forecasting, and in particular can more accurately predict extreme weather events at longer time horizons. 🌧️⚡️ Blog post: blog.research.google/2024/03… Full paper in @Science: "Generative emulation of weather forecast ensembles with diffusion models", by @GoogleResearch authors Lizao Li, Robert Carver, Ignacio Lopez-Gomez, Fei Sha, and John Anderson. science.org/doi/10.1126/scia…
5
1,019
15 Dec 2023
An exciting opportunity !
15 Dec 2023
.@vansteenkiste_s and I are recruiting a Google Student Researcher to work on evaluating language model reasoning from a cognitive perspective. If interested, please fill out this form (which also has some more details on this position)! docs.google.com/forms/d/e/1F…
4
700
14 Dec 2023
Please come to our poster #603 at neurips2023 Dec 14 afternoon to discuss how generative models can be used for turbulence modeling !
If you're at NeurIPS this year, come check out our work on modeling turbulence with probabilistic generative models! (Find us at the poster session today 12/14 at 5pm) A short thread 🧵(1/N):
8
978