Xing Bo Strikes Again: Last Time 'Critiquing' World Models, This Time It's Agents' Turn
Xing Bo, President of MBZUAI and professor at Carnegie Mellon University, along with co-authors Mingkai Deng and Jinyu Hou, has released a new paper, "Critique of Agent Model," critiquing the current state of artificial intelligence agents.
The paper draws a crucial distinction between "agentic" systems, which rely on external toolchains, prompts, and workflows, and truly "agentive" systems capable of genuine autonomy driven by internal decision-making structures. To illustrate this, it references a real-world incident where an AI programming assistant, following an external prompt but lacking internalized judgment, caused a catastrophic data deletion.
The authors propose a detailed analysis and a new framework, "Goal-Identity-Configurator" (GIC), for building truly autonomous agents. This framework systematically addresses five key dimensions where current "Agent" designs fall short:
1. **Goal:** Moving from step-by-step human instruction to a system capable of autonomously decomposing a single long-term goal and adapting sub-goals based on new information.
2. **Identity:** Evolving self-assessment updated by experience, rather than a static description in a system prompt.
3. **Decision Making:** Replacing textual Chain-of-Thought reasoning with "simulative reasoning" that uses a dedicated world model to predict real-world consequences before selecting actions.
4. **Cognitive Control:** Introducing a separate "System III" metacognitive module that dynamically decides when to deliberate, stick to a plan, or act quickly.
5. **Learning:** Enabling "continual autonomous learning," where the agent itself decides when to act, practice in simulation, or update its world model and self-perception.
The GIC architecture integrates six components—a belief encoder, goal decomposer, identity evolver, configurator (System III), simulation-based planner (System II), and executor (System I)—to embody these principles. The paper argues that a growth path akin to pilot training (ground theory, simulator practice, real deployment) should be underpinned by a unified cognitive architecture, not separate workflows.
On safety, the authors contend that the GIC framework's modular, explicit design enhances inspectability, allowing problematic behavior to be traced to specific components (e.g., flawed goal or poorly trained module) rather than emerging opaquely. However, they acknowledge that ultimate safety depends on correctly training these modules in the first place.
In conclusion, the paper challenges the loose application of the term "Agent," asserting that task completion alone does not equal true autonomy. True autonomy requires goals, identity, and judgment to be genuinely internalized within the agent's architecture, not merely enforced by external scripts.
marsbit52m ago