The problem: GUI models are all different. Some want pixel coordinates, others want percentages, some use weird tokens like <|loc095|>. We solved this with a universal interface that works the same across Anthropic, OpenAI, Hugging Face, and more:
agent = ComputerAgent(
model="anthropic/claude-3-5-sonnet-20241022",
tools=[computer]
)
2/5