State of Mind
Posts
Computer Use and the Future of Autonomous Commerce Operations

Computer Use and the Future of Autonomous Commerce Operations

Inside Anthropic's Builder Day: The Future of AI Agents Is Closer Than We Think

Dom Steil
November 05, 2024

This past weekend I attended Builder Day at Anthropic HQ and spent some time hacking on the new Computer Use Demo.

Saturday morning at Anthropic HQ

Computer Use

TLDR: The upgraded Claude 3.5 Sonnet model is capable of interacting with tools that can manipulate a computer desktop environment. There will now be a combination of API based function calling & browser based computer use agents. Both being orchestrated in a multi-agent system to create the future of business automation.

Imagine telling your AI assistant to handle that tedious onboarding paperwork you've been putting off, or to automatically fill out those repetitive forms across different systems or to select a contact reason and auto-close tickets from a queue that has been building up over the weekend.

That's not science fiction anymore – it's what Anthropic demonstrated with their new Computer Use capabilities.

The goal is the ability to take human intent and translate those to agentic outcomes. We can think of these new forms of agents as virtual coworkers that work along side us.

At StateSet we are building the best AI agents to power the fastest growing DTC brands in the world. We have designed the perfect engine for deterministic workflows and generative, personalized responses.

Computer Use API… Something New

Unlike traditional RPA and BPM tools that take a while to setup and need complex API integrations, this new Computer Use AI can work with existing websites and legacy systems directly through the browser. Directly through the browser!!! It’s pretty amazing and I am already seeing endless use cases of how we can apply this internally and to our customers.

Our mission is to create the Autonomous Commerce Operating System and Computer Use API is going to be a part of that going forward.

Think of it as having a highly competent virtual coworker who can handle repetitive tasks in the browser while you focus on more strategic work. It is early days but the core use case there.

Tips & Limitations of Computer Use Today

No account creation or social media posting (safety first)
Keyboard shortcuts preferred over mouse movements (reliability)
Built-in verification steps (trust but verify)
Focus on non-time-critical tasks (practical boundaries)

These aren't just limitations – they're features that make the technology more reliable and trustworthy in real-world applications. So it’s important we have this used for repetitive tasks to free up time with the right guardrails in place.

Real World Applications of Computer Use

Automated onboarding processes
Automatically Closing Tickets
Repetitive data entry tasks
Monitoring Social Media Comments
Form filling across multiple systems
Legacy system integration without API development

Practical Implementation Tips:

Break down complex tasks into well-specified steps
Verify outcomes explicitly rather than assuming success
Leverage keyboard shortcuts over mouse movements for UI interactions
Include example screenshots for repeatable tasks

What This Means For The Future of Autonomous Commerce Operations

The implications are profound. We're moving beyond the era of AI assistants that can only chat or generate content. We're entering a world where AI can actually do things in the interfaces we use on a daily basis. We are already using AI Agents on a daily basis for coding, search, updating data and document processing. We have autonomous AI Agents that can complete tasks, handle workflows, and interact with existing systems that may not have APIs available or that may just be browser based.

At StateSet, we are moving towards are world where we will be orchestrating multiple AI Agents on our behalf of our customers to give them the next-generation technology that will power them as they scale. This will be across every modality and now as we can see in the browser as well. It’s sort of like an RPA driven macro machine but with the additional intelligence of being able to perceive the browser and take the next best action.

My general thoughts on Computer Use

- Start with well-defined, repeatable tasks that take time

- Build verification into your workflows.

- Use screenshot examples in your prompts.

- Focus on non-critical but repetitive processes first

Looking Ahead

What we saw at Builder Day isn't just another AI update – it's a glimpse into a future where AI agents can truly understand and execute human intent. But perhaps more importantly, it's a future being built with careful consideration of both capabilities and limitations.

The future of human-AI collaboration isn't just coming – it's already here. And if Builder Day is any indication, it's going to be more practical, more reliable, and more transformative than anyone expected.