Latest benchmarks show Tessa leads on success rate by 12%. Try it now!
If you have tried building browser agents into your vertical AI application, you know how tricky they are. The last thing you want is for the agent to “act up” with your browser and do something other than what you asked for. You need web agents that’s smart, reliable, and accurate.
You need the best one out there and that agent is Tessa, according to the latest benchmarks.
For 8 months running, we have maintained state-of-the-art performance among browser agents. We first achieved SOTA results of 93% success rate on the Web Voyager benchmark in December 2024, before OpenAI's Computer Use Agent and the current wave of browser agents entered the market.
Now, with intense competition and industry attention, our latest benchmarking shows we are still leading by a significant margin.
The REAL benchmark evaluates web agents in complex, real-world scenarios and is scored on mostly classical, deterministic criteria. The latest results show Tessa achieves 54.5% success rate—a 12% lead over Anthropic’s computer use. If you are looking for a browser agent for your vertical AI application, Tessa is your best bet to get the task done.
We benchmarked different models’ performance against the cost in the following two graphs for you. We recommend sonnet-4 for most cases but Gemini 2.5 Flash, o3, and Opus 4.1 are all good choices along the price-performance boundary.
We pioneered a unique inference-time scaling architecture for browser agents, named RAVE (Reason, Act, Verify, and Evaluate), which helped us maintain the lead. Read our technical deep-dive blog post for the full methodology here.
We have been using it to build transformative workflows for our clients over the past few months. Now you have access to it too through a simple API.
url = "https://api.heytessa.ai/v1/run_browser_agent"
headers = {"Authorization": f"Bearer {YOUR_API_KEY}"}
result = requests.post(url, headers=headers, json={"directive": "Go to news.ycombinator.com and get the titles and points of the top 3 stories on the homepage"}).json()
print(result['history_url'])
Legal AI: Research case law across Westlaw, LexisNexis, court databases, or your internal databases to retrieve information.
Financial Applications: Aggregate real-time data from trading platforms, regulatory filings, and financial databases.
Healthcare: Navigate patient portals, insurance systems, and medical databases with the reliability healthcare workflows demand.
E-commerce Intelligence: Monitor competitor pricing, inventory, and product data across hundreds of sites.
Research Platforms: Gather information from academic databases, journals, and specialized repositories.
Try our API today and see why choose Tessa for production web automation.