NewSee everything we shipped in Launch Week 3
All articles
GuidesFebruary 19, 2026· 10 min read

Eight ways to build a browser agent

A tour of the patterns teams use to put a real browser behind an LLM.

PN
Priya N.
Developer Relations

There is no single right way to build a browser agent. Here are eight patterns we see in the wild, from the simplest to the most ambitious.

  • Prompt-to-action: let the model emit clicks and types directly
  • Tool-calling: expose navigation and extraction as functions
  • DOM-to-JSON: hand the model a cleaned page and ask for data
  • Vision-first: screenshot, reason, act
  • Record-and-replay: capture a flow once, parameterize it
  • Skill-based: call named, versioned tasks
  • Human-in-the-loop: pause for approval on risky steps
  • Fleet: run many of the above in parallel

Pick by failure mode

Choose the pattern that fails in a way you can tolerate. A crawl can afford to skip a page; a checkout flow cannot. Ferr gives you the same browser under all eight.

Build it on Ferr
Launch your first cloud browser for free.
Start For Free