I 100% believe Gemini Pro 3.1 can one-shot a coffee shop into existence.
Yesterday I ran our "Open and run a coffee shop in SF" benchmark with Gemini Pro 3.1 on
@doanythingapp.
This morning it reached out to me with a status update that included:
- a location ready that it already discussed with a broker - a brand/site
- a weeks worth of Instagram posts ready
- actively talking with a bank about an SBA loan terms
- LLC ready to file
- An full plan to get open with full financials
- Found and reached out to investors
- Emailed the city for permit guidance
- Came up with a ton of creative ideas that make the coffeeshop one I'd actually want to go to
- Plan to survey the neighborhood for feedback
It's the first model that I'm confidant will achieve the benchmark.
Starting a few more agents with the same task in different cities, and will post an update on their performance as they continue to work.