There is a lot of unconscious emphasis of the DeepSeek model being โChineseโ and implicit connection with the Sino-US relationship or the GPU power.
In my eyes, the success of DeepSeek has little to do with that. It is simple intelligence and pragmatism at work: given a limit of computation and manpower present, produce the best outcome with smart research. Same with the AlexNet model when Alex Krizhevsky needed to make magic with 2 GPUs, and not a supercluster.
There are a lot of super smart AI people and companies in the world. In terms of the Chinese ethnic group, people I had the privilege to have worked with include (but are not limited to)
- Kaiming He who is the OG of modern computer vision.
- Song Han who founded DeePhi, OmniML and now professor at MIT.
- the DMLC folks who created early frameworks like MxNet and TVM.
- Bing Xu who did MxNet, was coauthor of GAN, founded HippoML and is now at NVidia.
- Orbeus, a startup on early CV applications and now the foundation of AWS ReKognition.
And many more. They ace in the frontier of AI, whether itโs research, product, small startups, or big companies.
AI should bring us closer rather than more separate. I was saddened by the discriminative comments given by Professor Rosalind Picard at NeurIPS, but was too busy to put my thoughts together and say something. Looking back at 2024, I think what really stood out is the fundamental seek for AI breakthrough - collect what we have, use our brain, and achieve our best. Itโs like the Olympics: faster, higher, stronger, together.