🚨 Want ONE BILLION free LLM tokens a month without juggling a dozen different APIs?
Now you can *legally* unlock that massive inference capacity by combining the free tiers of Google, Groq, SambaNova, Mistral, and GitHub Models.
The only problem is the headache of managing all those different limit caps and SDKs.
FreeLLMAPI (built by Tashfeen Ahmed) does the heavy lifting for you. It’s an open-source proxy that unites all your free keys under one standard OpenAI endpoint.
→ No code changes: Just swap out your base_url.
→ No rate limit errors: Automatically falls back to the next provider.
→ No overages: Tracks key usage so you stay in the free tier.
→ No broken context: 30-minute sticky sessions keep your chat coherent.
While the rotating models mean it isn’t stable enough for a live production environment, it is the perfect sandbox for prototyping, testing, and running evals with zero API costs.
Best part?
It's 100% free and open-source.
repo link in 🧵↓