Papers
arxiv:2602.14296

AutoWebWorld: Synthesizing Infinite Verifiable Web Environments via Finite State Machines

Authors:
,
,
,
,
,
,
,
,
,
,
,

Abstract

AutoWebWorld synthesizes verifiable web environments using finite state machines and coding agents, enabling efficient training of autonomous Web GUI agents through automated trajectory generation and verification.

AI-generated summary

The performance of autonomous Web GUI agents heavily relies on the quality and quantity of their training data. However, a fundamental bottleneck persists: collecting interaction trajectories from real-world websites is expensive and difficult to verify. The underlying state transitions are hidden, leading to reliance on inconsistent and costly external verifiers to evaluate step-level correctness. To address this, we propose AutoWebWorld, a novel framework for synthesizing controllable and verifiable web environments by modeling them as Finite State Machines (FSMs) and use coding agents to translate FSMs into interactive websites. Unlike real websites, where state transitions are implicit, AutoWebWorld explicitly defines all states, actions, and transition rules. This enables programmatic verification: action correctness is checked against predefined rules, and task success is confirmed by reaching a goal state in the FSM graph. AutoWebWorld enables a fully automated search-and-verify pipeline, generating over 11,663 verified trajectories from 29 diverse web environments at only $0.04 per trajectory. Training on this synthetic data significantly boosts real-world performance. Our 7B Web GUI agent outperforms all baselines within 15 steps on WebVoyager. Furthermore, we observe a clear scaling law: as the synthetic data volume increases, performance on WebVoyager and Online-Mind2Web consistently improves.

Community

Paper author Paper submitter

We introduce AutoWebWorld, a framework that synthesizes controllable and intrinsically verifiable Web GUI environments by specifying websites as Finite State Machines (FSMs) and translating them into runnable interactive sites with coding agents. AutoWebWorld enables an automated search-and-verify pipeline (BFS over the known transition graph plus execution filtering), producing 11,663 reproducibly verified trajectories across 29 diverse synthetic websites at about $0.04 per trajectory. Training on this data substantially improves real-world web agent performance (our 7B agent outperforms all baselines within 15 steps on WebVoyager), and we observe consistent scaling trends as synthetic data increases on WebVoyager and Online-Mind2Web. Code and resources are available on the project page; feedback and contributions are welcome.

Sign up or log in to comment

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2602.14296 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2602.14296 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2602.14296 in a Space README.md to link it from this page.

Collections including this paper 1