Hermes Agent + Steel Browser: Let Your AI Agent Browse the Web

Hermes Agent + Steel Browser: Let Your AI Agent Browse the Web

Apr 7, 2026

Apr 7, 2026

/

San Francisco

/

Dane Wilson

Nikola Balic

Nikola Balic

We added Steel as a cloud browser provider in Hermes, the open source AI agent from Nous Research. New SteelProvider, a steel_scrape tool, and setup wizard support. Set STEEL_API_KEY, pick steel as your browser provider, and Hermes routes all browser sessions through Steel with proxy and CAPTCHA solving built in.

You can try it today from the branch, or wait for the merge into main. We couldn't wait. Our team is already running it in their deployed Hermes agents, and we took it for a spin to show what it looks like.

Setup

Run hermes setup. The parts that matter:

  1. Pick your model.

  2. Under browser automation, select Steel.

  3. Paste your Steel API key. Get one free at app.steel.dev.

What happened

We ran Hermes and gave it one task:

Find me the best hotel near Grand Central New York and show me Steel's session URL so we can observe what the agent is doing.

Agent started working. It went to google.com/travel, searched for hotels, then navigated to Booking.com on its own. Nobody told it to go there. It decided that's where the data was.

This part was slow. LLMs are slow between actions. The agent was navigating, clicking through hotels, ruminating. The web is hard. For doing anything meaningful on it, you need a proper browser.

While this was happening, the live session was visible in Steel's dashboard.

Watching it work

You don't need to watch every click. But when something goes wrong, when the agent misclicks or gets lost in some modal, you need to see what it was looking at. That's the difference between debugging and guessing.

The integration pipes Steel's live viewer URL back into Hermes responses, so you get the link right in the agent's output. And if the agent hits bot detection, it knows to start stealth session or fall back to steel_scrape instead of retrying the same approach.

The agent built its own skill

After finishing the hotel search, Hermes created a skill called "hotel search" on its own. Nobody asked it to. It had lived through the whole navigation sequence and decided to package that up for next time.

We've seen the same pattern with Claude Code and Codex. Manual run, then script, then automation. The difference here is Hermes skipped being asked and just did the second step itself.

The scrape tool

The PR also includes steel_scrape, a dedicated tool for server-side content extraction. No browser session needed. Hit Steel's /v1/scrape API and get back pure markdown.

Markdown that an agent can actually consume. Give it a list of URLs, have it ingest them, build a knowledge base. The scrape tool works separately from browser sessions, so use whichever fits.

More than one session

This was one agent in one session. Steel runs browsers in the cloud. Let loose a fleet of agents doing parallel sessions at the same time.

With Steel you can bring your own credentials. You can keep sessions alive. Import local browser profiles with steel profile import and the agent inherits whatever accounts you're already logged into. Or create a dedicated profile with limited access.

Any agent, any model

Hermes is open source, in our case we are running an open model from Z.ai. Steel doesn't care what's behind the decisions. GLM 5.1 worked. GPT would work. Llama would work. If your agent can make tool calls, it can use Steel.

The browser is the browser.

Try it

Try it from the branch today, or wait for the merge into main.

Ready to

Build with Steel?

Ready to

Build with Steel?

Ready to

Build with Steel?

Ready to Build with Steel?

A better way to take your LLMs online.

© Steel · Inc. 2025.

All Systems Operational

Platform

Join the community