ML Engineer & PhD in NLP

Joined August 2009
32 Photos and videos
7 Apr 2024
Now, with voice technology and and AI for grammar correction, I find I can produce 3-4x more content without worrying about minor language issues — between 2,000 to 3,000 words! 🚀This has allowed me to express my ideas more fully and creatively.
1
1
152
7 Apr 2024
Try out Voice Writer today: efficientnlp.com/voice-write…

1
120
7 Apr 2024
I've been using Voice Writer for my blog posts, and I am much more productive. 🌟 Before, my posts averaged around 750 words.
1
121
26 Mar 2024
I built a voice writer tool to help you write things quickly. ⚡️ It uses AI for speech recognition and grammar correction. I have been using it for my book reviews, emails, Slack messages, and more. Here is a demo video. 😊 Try it out here: efficientnlp.com/voice-write…
1
146
1 Mar 2024
In this video, I cover the top 10 most cited papers in the history of natural language processing, ranked by number of Google Scholar citations. 📚 We cover milestones like the Transformer model, RNN, word vectors, and even go back to the roots with WordNet!
1
91
21 Jan 2024
🎥 New Video! In this video, we train a speech recognition model (using OpenAI's Whisper) to recognize our family's Chinese dialect, Teochew, or Chaozhou dialect (潮州话). It has about 10 million speakers and is a part of the Min Nan language family. youtube.com/watch?v=JH_78KmP…

1
2
196
21 Jan 2024
Challenges we faced: - Teochew is related to Mandarin, a high-resource language, but how do we apply transfer learning? - With zero resources for training, we had to build our dataset from scratch. 🛠️ - Teochew doesn't even have a writing system! How do we model that? 🤔
96
6 Nov 2023
📹 New Video! Ever had trouble deciding which AI to use for your projects? Let's solve that with AI. In this video, I will build an AI to find the best AI for you 🤯 youtu.be/2r-SqtxhgmY
1
1
140
6 Nov 2023
More seriously - we'll use the RAG pattern, indexing HuggingFace metadata, integrating OpenAI embeddings with pgvector and chat models. I'll also explain some tips on how to rerank the chatbot's suggestions, deploy the project efficiently, and more.
127
12 Oct 2023
📹 New Video! #LLMs can be slow, so this @GoogleDeepMind paper proposed to speed it up by running two LLMs at the same time. 😕 Wait what?
1
123
12 Oct 2023
It's a new technique called speculative sampling. A smaller LLM generates the easier tokens and a larger LLM checks them. And using a rejection sampling trick, there is no difference in accuracy! Check out my video on how this works ➡️youtube.com/watch?v=S-8yr_Ri…
113
29 Aug 2023
Just published a comprehensive video highlighting EVERY area of Natural Language Processing research, in 24 categories. From Phonology to Translation to Summarization to LLMs, explore all of of NLP in 30 minutes!
88
8 Aug 2023
Ever since its introduction in 2017, the transformer architecture has remained largely unchanged...until now. youtube.com/watch?v=o29P0Kpo…

1
2
126
8 Aug 2023
In this video, I explain RoPE - Rotary Positional Embeddings. Proposed in 2022, RoPE is making its way into LLMs like Google's PaLM and Meta's LLaMa. I unpack the magic behind rotary embeddings and reveal how they combine the strengths of both absolute and relative embeddings.
2
115
22 Jul 2023
📹 New Video: Understanding the KV Cache Why is it so difficult to run LLMs with longer context windows? 🤔 The culprit behind this memory hog is none other than the KV cache. In this video, I explain this essential component and its impact on GPU memory during inference.
1
1
147
22 Jul 2023
Watch the video here ➡️ youtu.be/80bIUggRJf4 📈 Don't forget to subscribe and hit that like button if you find this content informative!
1
92
30 Jun 2023
💡 New Video! How to Optimize Neural Networks for Inference: 4 Top Strategies 🎥
1
105
30 Jun 2023
In this video, I discuss techniques to boost the speed and compress the size of your AI model for inference: from quantization to pruning, knowledge distillation, and engineering optimizations like GPU acceleration and fused kernels.
1
85