ChatGPT agent mode includes a view on the browser inset that shows all of the logical steps and screenshots the AI walked through to complete a task. Users can also scrub back through the live video to view the steps in real time
When in logic mode, ChatGPT explains how it processed the task and reached its conclusion, allowing users to surgically identify where logic may have gone awry
ChatGPT relies on an auto-router to select the model it thinks is best for the task, but users can override it. The details it provides are not super helpful to the user though
ChatGPT takes a similar approach as Claude but its modes and other complications are introduced in a much more straightforward way. To make it easy to understand its supported use cases, suggestions are shown immediately after selecting one of the modes
ChatGPT demonstrates how followups can be built into the system prompt instead of relying on the interface. As voice models and interactions become more common, this approach allows the model to keep the user engaged without relying on direct input of text
ChatGPT Operator Mode lives seamlessly within the conversational interface. Users can observe as the AI navigates the multiple browser windows, taking action until it reaches the end of the task or a step that requires user intervention.