Aji Ghose

Aji Ghose

26 Photos and videos

Tweets

Pinned Tweet

Aji Ghose @AI_Modeller

15 May 2019

On Monday I passed my PhD viva with flying colours - so happy! 🥳

122

Natasha Malpani 👁

Aji Ghose retweeted

Natasha Malpani 👁

@natashamalpani

18 Oct 2025

karpathy says we’re a decade away from AGI, because we don’t yet know how to make systems learn continuously. the deeper problem is that we’ve built this entire field on metaphors, not mechanics. we keep saying AI can think, reason, remember, create. but those are human verbs, not model capabilities. AI isn’t intelligent. it’s efficient. it doesn’t reason . it pattern-matches. it doesn’t remember. it reconstructs. it doesn’t reflect. it re-runs. we confuse language with understanding. just because a model can describe thought doesn’t mean it’s having one. real intelligence has intent. it knows why it’s thinking. AI predicts what comes next. and yet, even without intent, these systems are starting to functionally mimic cognition. they reason, recall, and reflect. not consciously, but effectively. that’s why both statements can be true. AI is a bubble. because capital, hype, and valuations have outpaced genuine capability. but it’s also here to stay. because the direction of progress is right. the crash will clear the noise. what remains will be systems that truly learn. memory that compounds, feedback that refines, intelligence that grows by living inside workflows. we’ll look back on this phase the way we look at the early web: messy and magical. the beginning of machines that finally learn, not just perform.

Dwarkesh Patel

@dwarkesh_sp

17 Oct 2025

The @karpathy interview 0:00:00 – AGI is still a decade away 0:30:33 – LLM cognitive deficits 0:40:53 – RL is terrible 0:50:26 – How do humans learn? 1:07:13 – AGI will blend into 2% GDP growth 1:18:24 – ASI 1:33:38 – Evolution of intelligence & culture 1:43:43 - Why self driving took so long 1:57:08 - Future of education Look up Dwarkesh Podcast on YouTube, Apple Podcasts, Spotify, etc. Enjoy!

2:26:08

110

198

1,652

264,265

Richard Sutton

Aji Ghose retweeted

Richard Sutton

@RichardSSutton

27 Sep 2025

💯

Chris Hayduk

@ChrisHayduk

27 Sep 2025

Everyone posting about the Dwarkesh interview (including Dwarkesh himself!) is missing this subtle point. When LLMs imitate, they imitate the ACTION (ie the token prediction to produce the sequence). When humans imitate, they imitate the OUTPUT but must discover the action

660

101,278

Richard Sutton

Aji Ghose retweeted

Richard Sutton

@RichardSSutton

26 Sep 2025

Dwarkesh and I had a frank exchange of views. I hope we moved the conversation forward. Dwarkesh is a true gentleman.

Dwarkesh Patel

@dwarkesh_sp

26 Sep 2025

.@RichardSSutton, father of reinforcement learning, doesn’t think LLMs are bitter-lesson-pilled. My steel man of Richard’s position: we need some new architecture to enable continual (on-the-job) learning. And if we have continual learning, we don't need a special training phase - the agent just learns on-the-fly - like all humans, and indeed, like all animals. This new paradigm will render our current approach with LLMs obsolete. I did my best to represent the view that LLMs will function as the foundation on which this experiential learning can happen. Some sparks flew. 0:00:00 – Are LLMs a dead-end? 0:13:51 – Do humans do imitation learning? 0:23:57 – The Era of Experience 0:34:25 – Current architectures generalize poorly out of distribution 0:42:17 – Surprises in the AI field 0:47:28 – Will The Bitter Lesson still apply after AGI? 0:54:35 – Succession to AI

1:07:09

203

3,589

654,835

Dmitrii Kovanikov

Aji Ghose retweeted

Dmitrii Kovanikov

@ChShersh

11 Sep 2025

Many people think CS is just DSA. It's not. My CS curriculum had: 0. Data Structures and Algorithms 1. Calculus 2. Linear Algebra 3. Physics 4. Mathematical Physics 5. Differential Equations 6. Theory of Functions of a Complex Variable 7. Theory of Probability and Mathematical Statistics 8. Calculus of Approximations 9. Functional analysis 10. Numerical Methods 11. Optimisation Methods 12. Cryptography 13. Compression Algorithms 14. Game Theory 15. Algebra and Geometry 16. Discrete Mathematics 17. Automata-based Programming 18. Formal Language Theory 19. Compilers and Interpreters 20. Computational Complexity Theory 21. Math Logic 22. Type Theory 23. Lambda Calculus 24. Functional Programming 25. OOP Design Pattern 26. Machine Learning 27. AI 28. Hardware Architecture 29. Java 30. C 31. C 32. Computational Geometry 33. Operating Systems 34. Computer Networks 35. Databases, Relational Algebra, and SQL 36. Parallel Programming 37. Economics 38. Organisation and management of business processes 39. History 40. Philosophy 41. English 42. Health and Safety Training Course 43. PE

114

301

3,448

217,232

Jack Morris

Aji Ghose retweeted

Jack Morris

@jxmnop

2 Aug 2025

hypothetical two-year degree in AI: - Coding in Python - Semiconductors 101 - Intro to machine learning - Intro to data science - Data visualization & dimensionality reduction - Machine learning engineering - Language models - Deep learning (basics & architectures) - Reinforcement learning - Computer vision - Generative modeling - Robotics & Planning - LLMs 1 (Pre-training) - LLMs 2 (Post-training) - GPU Architecture & Intro to CUDA - Energy & Datacenters - AI Governance - AI Safety - Federated & private learning - Advanced CUDA

989

68,610

Georgios Voulgaris

Aji Ghose retweeted

Georgios Voulgaris @GeorgiosVoulga1

3 Aug 2025

Replying to @SchmidhuberAI @JFPuget @ylecun

The Neocognitron was trainable, but not via backpropagation. It used analogue threshold units, not ReLU, sigmoid, or tanh. CNNs became scalable and trainable with backpropagation, notably through LeNet.

4,555

JFPuget 🇫🇷🇺🇦🇨🇦🇬🇱

Aji Ghose retweeted

JFPuget 🇫🇷🇺🇦🇨🇦🇬🇱

@JFPuget

3 Aug 2025

Replying to @SchmidhuberAI

True, but these CNNs were not trainable. AFAIK @ylecun is the one training CNNs for the first time.

12,139

Ajit

Aji Ghose retweeted

Ajit @dead_relu

4 Jul 2025

The downfall is real 😬

204

22,438

Sherpa

Aji Ghose retweeted

Sherpa

@LLMSherpa

5 Jul 2025

If Claude is really doing so much of the coding for Anthropic, why haven't they used it to create a fucking ui for Claude Code? It's 2025. Why the fuck am I forced to use a cli for everything as if it were 1995?

606

168

6,931

1,032,821

Santiago

Aji Ghose retweeted

Santiago

@svpino

12 Apr 2025

"pip install" is dead. "uv add" is the new king.

185

229

3,574

512,488

@levelsio

Aji Ghose retweeted

@levelsio

12 Apr 2025

Sergey Brin and Larry Page must be rolling in their graves now

Klaas

@forgebitz

10 Apr 2025

google is entering its yahoo phase

275

433

12,264

1,607,421

Olivia Guest · Ολίβια Γκεστ

Aji Ghose retweeted

Olivia Guest · Ολίβια Γκεστ @o_guest

11 Apr 2025

Out last week in @cogsci_soc! @samhforbes and I ”urge extreme caution in the use of LLMs in classrooms lest we further normalize: pupils losing their privacy, reducing contact between learner and educator, deskilling teachers & polluting the environment”‼️ PDF/DOI links below.

Huettig and Christiansen in an earlier issue argue that large language models (LLMs) are beneficial to address declining cognitive skills, such as literacy, through combating imbalances in educational equity. However, we warn that this technosolutionism may be the wrong frame. LLMs are labor intensive, are economically infeasible, and pollute the environment, and these properties may outweigh any proposed benefits. For example, poor quality air directly harms human cognition, and thus has compounding effects on educators' and pupils' ability to teach and learn. We urge extreme caution in facilitating the use of LLMs, which like much of modern academia run on private technology sector infrastructure, in classrooms lest we further normalize: pupils losing their right to privacy and security, reducing human contact between learner and educator, deskilling teachers, and polluting the environment. Cognitive scientists instead can learn from past mistakes with the petrochemical and tobacco i

ALT Huettig and Christiansen in an earlier issue argue that large language models (LLMs) are beneficial to address declining cognitive skills, such as literacy, through combating imbalances in educational equity. However, we warn that this technosolutionism may be the wrong frame. LLMs are labor intensive, are economically infeasible, and pollute the environment, and these properties may outweigh any proposed benefits. For example, poor quality air directly harms human cognition, and thus has compounding effects on educators' and pupils' ability to teach and learn. We urge extreme caution in facilitating the use of LLMs, which like much of modern academia run on private technology sector infrastructure, in classrooms lest we further normalize: pupils losing their right to privacy and security, reducing human contact between learner and educator, deskilling teachers, and polluting the environment. Cognitive scientists instead can learn from past mistakes with the petrochemical and tobacco i

3,431

Indian Tech & Infra

Aji Ghose retweeted

Indian Tech & Infra

@IndianTechGuide

5 Apr 2025

🚨 A message in startup mahakumbh event at Bharat Mandapam, New Delhi.

629

3,178

23,950

1,317,323

Jeremy Howard

Aji Ghose retweeted

Jeremy Howard

@jeremyphoward

6 Apr 2025

Replying to @JeffDean

Yes I can; as can you. But I'm primarily interested in what's widely available in the community, where a single 4090 GPU machine is already a very rich investment. Remember also that 3090s were the last consumer card with nvlink, so 4090 and 5090 cards aren't good at multi gpu

214

15,028

Pedro Domingos

Aji Ghose retweeted

Pedro Domingos

@pmddomingos

29 Mar 2025

Neural networks don’t have to be distilled into other neural networks. They can be distilled into decision trees or sets of rules. And then interpretability becomes dramatically easier.

1,333

146,189

MIT CSAIL

Aji Ghose retweeted

MIT CSAIL

@MIT_CSAIL

21 Mar 2025

Happy birthday Joseph Fourier, whose 1822 equation allows us to listen to mp3s today: bit.ly/22kbNfi

602

2,937

254,199

Sanchit Ahuja

Aji Ghose retweeted

Sanchit Ahuja @SanchitAhuja7

12 Mar 2025

Our team at Microsoft Research India is looking for a Research Intern for a 6 month position. The position will be on-site in BLR. You will get to work on multilingual data, modelling and evals. Please DM me with a short blurb about yourself and your CV/Resume.

202

15,797

Micah G. Allen

Aji Ghose retweeted

Micah G. Allen

@micahgallen

10 Mar 2025

Can we enhance interoception by controlling the heart? Thrilled to share our new study, led by @AshleyTyrer, where we use computational modeling to show that blockading peripheral noradrenaline uniquely alters awareness of heart rate & breathing! biorxiv.org/content/10.1101/… 🧵👇

10,267

Richard Sutton

Aji Ghose retweeted

Richard Sutton

@RichardSSutton

7 Mar 2025

I am pretty happy with this 30-minute summary of my views on the current state of AI and alignment. youtube.com/watch?v=w177Ov-Y…

PTJC 20th Anniversary: Distinguished Lecture: Dr. Rich Sutton

Dr. Richard Sutton is Professor of Computing Science at the Univers...

youtube.com

103

723

122,758

Jack Morris

Aji Ghose retweeted

Jack Morris

@jxmnop

7 Mar 2025

someone made a Lego Transformer

1,043

72,423