16 best GitHub repos to build AI engineering projects!
(star bookmark them):
The open-source AI ecosystem has 4.3M repos now.
New repos blow up every month, and the tools developers build with today look nothing like what we had a year ago.
I put together a visual covering the 16 repos that make up the modern AI developer toolkit right now.
The goal was to cover key layers of the stack:
1) OpenClaw
↳ Personal AI agent that runs on your devices and connects to 50 messaging platforms
2) AutoGPT
↳ Platform for building, deploying, and running autonomous AI agents
3) Hugging Face Transformers
↳ The model framework for state-of-the-art ML across text, vision, audio, and multimodal
4) Ollama
↳ Run powerful LLMs locally on your hardware with a single command
5) LangChain
↳ The foundational framework for building agents and LLM-powered applications
6) Open WebUI
↳ Self-hosted, offline-capable ChatGPT alternative with built-in RAG and plugin system
7) ComfyUI
↳ Node-based visual workflow builder for AI image and video generation
8) Sim
↳ Open-source drag-and-drop workflow builder for creating and deploying AI agent pipelines
9) Opik
↳ Open-source platform to trace, evaluate, and monitor LLM apps and agentic workflows
10) Firecrawl
↳ Turn any website into LLM-ready markdown or structured data
11) Airweave
↳ Open-source context retrieval layer that syncs 50 data sources for AI agents
12) vLLM
↳ High-throughput, memory-efficient LLM serving engine for production deployments
13) Unsloth
↳ Fine-tune and run open models 2x faster with 70% less memory
14) OpenPipe ART
↳ Train multi-step AI agents for real-world tasks using reinforcement learning
15) OpenCode
↳ Open-source, provider-agnostic AI coding agent built for the terminal
16) Chandra OCR (by Datalab)
↳ State-of-the-art OCR model for complex tables, forms, handwriting, and 90 languages
These aren't just repos with high star counts but rather the building blocks behind most AI products shipping today.
If you're building anything with LLMs, agents, or RAG in 2026, you're probably already using a few of these.
P.S. This visual was inspired by a similar one from the ByteByteGo team. I extended it to be more engineering and builder-focused.
The AI industry has a dirty little secret:
Half of “AI engineering” is just fighting CUDA errors.
Wrong PyTorch wheels.
Broken NVIDIA drivers.
Mismatched CUDA versions.
Dependency hell that destroys your entire environment overnight.
Meanwhile you’re sitting there wondering why your GPU suddenly stopped existing.
This open-source project fixes that chaos in one command.
It scans your entire AI stack:
GPU → CUDA → cuDNN → PyTorch → TensorFlow → Docker
Then tells you exactly what’s broken and how to fix it.
Honestly one of the most useful AI dev tools I’ve seen this year.
github.com/mitulgarg/env-doc…