OpenAI has officially launched its multimodal AI agent product, capable of autonomously navigating websites, filling out forms, and completing multi-step tasks on behalf of users. The agent combines vision, language understanding, and action capabilities to interact with web interfaces in real time.
In demonstrations, the agent successfully booked travel arrangements, compared insurance quotes, and filed government forms with minimal human intervention. OpenAI reports that the system achieves a 92% task completion rate on standardized web navigation benchmarks.
Privacy and security experts have raised concerns about the implications of AI agents operating independently on the open web, particularly around credential management and the potential for automated systems to be exploited by malicious actors.