We designed a GPT-1 style transformer model from scratch and ran our first pre-training experiment.
Today, I interacted with the model for the first time.
In this video, I shared the first results, what worked, and where we’re heading next.
Big thank you to everyone supporting the research, contributing resources, donating, and following the journey this early.
We’re building a language model in public because Africa needs more people participating in the future of Artificial Intelligence. So we’re demystifying the process and open-sourcing what we learn so anyone can study it, replicate it, and build on it.
In this video, I break down the three stages of building an AI model and where we currently are in the journey.