Five Core Forms of AI Agent in YC's Eyes

marsbitОпубликовано 2026-05-20Обновлено 2026-05-20

Введение

The article outlines five core architectural patterns for effective AI Agents, emerging from tools like Codex and Claude, that move beyond simple prompts towards reusable, process-based capabilities. 1. **Skills**: Reusable, parameterized workflows that function like method calls, allowing a single process (e.g., "/investigate") to handle various tasks based on input parameters. 2. **Thin Harness**: A lightweight execution framework (~200 lines) that manages the AI model's "hands and feet"—handling loops, file I/O, and context—without becoming bloated. 3. **Resolvers**: Routing tables that map tasks to specific Skills, preventing "context corruption" when managing dozens of Skills and ensuring outputs go to the correct locations. 4. **Latent vs. Deterministic Layer**: A critical separation where LLMs handle judgment, synthesis, and pattern recognition, while deterministic code handles tasks requiring precision, consistency, and low cost (like calculations). 5. **Memory**: A persistent, accumulating knowledge base (e.g., a markdown folder) with a "current trusted conclusion" section and an append-only timeline, enabling the system to learn and retain context over time. Together, these patterns create a "process power"—a durable competitive advantage. Unlike one-off prompt-based applications whose value quickly commoditizes, a well-designed AI Agent system encodes experience into reusable, parameterized workflows, offloads stable rules to code, and continuously learns thr...

Editor's Note: As AI Agents move beyond one-off prompts and vibe coding into more complex workflow stages, the truly important question is no longer 'Can the model complete the task?' but rather 'Can we solidify AI capabilities into reusable, accumulable process assets?'

This article starts from Garry Tan's GBrain and summarizes five core forms that many are converging on when using Agent tools like Codex, Claude Code, Hermes: parameterizable Skills, lightweight execution frameworks (Thin Harness), routing Resolvers, an execution layer separating model judgment from deterministic code, and Memory for long-term context accumulation.

These modules, combined, point towards a new kind of 'process capability': codifying experience into workflows, abstracting tasks into parameters, handing stable rules to code, entrusting judgment and synthesis to models, and continuously depositing learnings through a memory layer. Compared to one-off generated applications or prompts, this kind of system is harder to replicate and more likely to become the foundation for individuals, small teams, and even companies to form a long-term competitive advantage in the AI era.

The original text follows:

I spent some time studying Garry Tan's GBrain. As someone without a technical background and not working in venture capital, I want to distill a few general structural forms I see in it, and where its real significance lies.

I believe many people are gradually converging on the same set of core structures. They can roughly be summarized into 5 forms, which also represent the natural evolution direction in how agentic AI tools like Codex, Claude Code, Hermes, OpenClaw are being used.

Related Reading: 'Thin Harness, Fat Skills: The Real Source of 100x AI Productivity'

Skills: From SOP to 'Method Call'

Skills are almost everyone's most natural starting point. Even without prompting, users intuitively start building them because their form feels very familiar. I initially understood it as a kind of SOP (Standard Operating Procedure), a documented process for getting something done. The user provides 'what to do,' and the Skill provides 'how to do it.'

Tan's understanding is that a Skill is more like a 'method call.' In programming, a method call means invoking a programmatic flow with parameters. The same code runs every time; what changes are the parameters: what data, what question, what objective. For example, the same process_invoice function can handle every invoice in the system, not just the one it was originally written for.

Skills are a similar structure. A Skill named /investigate might contain seven fixed steps; these steps don't change. What changes are the parameters: TARGET (who or what to investigate), QUESTION (what you want to figure out), DATASET (where to look for information). Point it at a whistleblower case in healthcare, and it works like a research analyst; point it at SEC filings, and it works like a legal investigator. The same file, the same seven steps; the difference is provided by the external world.

This is different from traditional SOPs. Most SOPs are written for a specific role or task, like 'Processing Accounts Payable.' Each use case gets its own set of documents. Skills are more abstract; the same process can handle a category of problems. A well-designed Skill can do the work of dozens of SOPs because the case-specific information is extracted from the document and moved into parameters. In practice, some Skills are closer to SOPs, others closer to method calls.

Thin Harness: Model is the Brains, Harness is the Hands and Feet

Models, like Opus, GPT-5.5, etc., are raw intelligence; Harnesses, like Claude Code, Codex CLI, Hermes, OpenClaw, are the execution frameworks that give models real 'hands and feet.' They handle loop execution, reading/writing files, managing context, enforcing safety constraints. Their core code is around 200 lines.

Garry mentions a common mistake most people make: continuously stuffing more things into the Harness. I did the same. I ended up with 100 tool definitions and a pile of MCP servers. The result was the context window being filled with tool descriptions irrelevant to the current task. The model started confusing which tool to use, latency increased, accuracy dropped, eventually leading to so-called 'context corruption.'

Resolvers: Using Routing Tables to Solve Context Corruption

The solution to context corruption is to build a routing table. A Resolver's job is to map 'the incoming task type X' unambiguously to 'Skill Y should be called.' When you only have 5 Skills, you don't need a Resolver; but when you have 100 Skills, the descriptions become fuzzy, and the model easily fails to call the right Skill at the right time. Resolvers replace fuzzy pattern matching with explicit rules.

Tan also runs a similar Resolver-like mechanism for files: a separate routing table to decide where the output of a Skill should land in the filesystem. This is the same 'audit–route' structure applied to a different problem. This ensures outputs consistently go into the correct folders, not where the model temporarily guesses.

Skillify is another supporting idea of his: it's a quality loop to turn one-off Skills into long-term reusable infrastructure. Tan describes a 10-step process including: contract definition, using deterministic code where appropriate, unit testing, integration testing, LLM-as-judge evaluation, Resolver entry, audit script, checking which Skills have no call path, and end-to-end smoke tests. The test criterion is simple: if you have to ask the model the same question twice, it's a failure.

Latent vs. Deterministic: Judgment to the Model, Deterministic Tasks to Code

It's crucial to distinguish which work should be given to the LLM and which to a deterministic system. LLMs excel at judgment, synthesis, pattern recognition, and reading between the lines; but they are not good at arithmetic, combinatorial optimization, or anything requiring the same answer every time. LLMs are inherently probabilistic; when a deterministic solution can solve the problem, don't use an LLM.

Most non-technical people often underestimate the value of the deterministic layer. The default reaction is to throw everything at the model. But if something can be done deterministically, you almost always should. And you don't need to be a programmer because the model can write the code for you. What really needs training is a discipline: each time, ask yourself, can this be done reliably and cheaply with code? If the answer is yes, have the model write that code.

Memory: Making the System Truly Accumulative

For a system to be useful, it must have some form of memory. I'm not yet sure what the correct form is; many people are building it in different ways: vector embeddings, semantic similarity, knowledge graphs, hybrid storage, etc. Tan's approach is the same as mine: just a folder of markdown files.

His structure is: one page per person, one page per company, one page per concept. The top of each page is the 'Current Believed Conclusion,' a synthesized judgment that is continuously rewritten and updated as new evidence arrives; the bottom is an append-only timeline.

Choosing markdown yields several consequences. First, the files are the system's primary record, not some export. You can open it in VS Code, edit it manually, and the Agent will automatically read those changes. Second, typed relationships, like works_at, invested_in, founded, attended, advises, are automatically extracted via regex each time something is written, so the knowledge graph can connect itself without consuming tokens. This specific schema fits his work, but for others, it likely needs to be re-tailored based on one's own profession and business context.

Furthermore, a signal detector runs in the background. If a person is mentioned once, a stub page is created; if they are mentioned three times across sources, it triggers a web info completion process; after a meeting, a full process runs. A nightly 'dream cycle' scans conversations, completes outdated entity information, and fixes broken references. The base layer is text; everything built on top is cheap and composable.

There are, of course, more details underneath, but I believe these are the most important contours, and they are to a considerable degree universal.

Personally, I've already built about half of such an architecture. Previously, I hadn't reached the scale necessitating a true Resolver, but now I have, so I recently did a minor refactor to make my system model-agnostic and built a Resolver into it. The key part I haven't built yet is the background-running signal detector and the nightly dream cycle—the automatic info completion and organization mechanisms—which is what I want to try adding next.

I suspect that different builders converging on a similar structure is itself a signal: while this form might not work for everyone, it's broadly likely useful. Even if specific implementation details vary importantly, this overall structure is being independently discovered by more and more people.

A question I've been asking myself lately is: How do you build a sustainable competitive advantage with AI?

Everyone is excited about vibe-coded apps and one-off prompts, which are of course very cool. I myself started that way and got hooked. But anything that can be built with a one-off prompt will, in equilibrium, have its price driven down to the cost of the tokens required to build it—just a few cents.

For example, someone replicating MyFitnessPal, selling it at half the price, and making a million dollars is impressive. But soon, someone else will replicate it and sell it cheaper. The cycle continues until the profit margin is completely squeezed.

The truly sustainable thing is a kind of 'process capability.' Using the framework from Hamilton Helmer's 7 Powers, the architecture described above embodies process power.

7 Powers posits that companies can sustain above-average returns over the long term because they possess one of seven structural powers. Any advantage not rooted in these powers will eventually be competed away.

For small and medium businesses and early-stage companies, five of Helmer's seven powers are essentially closed doors. Economies of scale require scale; network effects and switching costs can be built but require a large user base first; counter-positioning or exclusive resources often mean patents or similar assets, which most companies don't have; brand typically takes a decade to build and can't be shortcut.

The remaining two are counter-positioning and process power.

Counter-positioning refers to a business model existing giants cannot copy because doing so would harm their own core business. Such opportunities exist sometimes but are not always available.

Thus, the most realistic path left is process power. And a well-designed AI system is precisely a tool for generating process power.

This is essentially the same work as building high-quality SOPs or proprietary software in-house: processes are encoded, cases are parameterized, the underlying deterministic systems are fast and reliable, and the memory layer continuously absorbs past learnings. It amplifies the 'productization of services': you can offer a service or product at lower cost or higher quality because the entire job has been structured.

Imagine an accountant building such a system. The memory layer is a folder; each client has a markdown file containing a current believed conclusion—like entity structure, annual tax positions, ongoing audits—and a timeline logging meetings, decisions, and changes.

She has Skills, like /year-end-review, /quarterly-estimate, /audit-prep. The same process can be executed parametrically for different clients.

She has a deterministic layer including tax forms, depreciation schedules, IRS documents, client historical returns, etc.

Add a mechanism like log tidying or a dream cycle. For example, the system automatically notices at night that a partner's K-1 allocation dropped 40% without a stated strategy change; or notices that a certain client's home office deduction structure could be migrated to another client—the structure is reusable, but the identity and privacy stay put.

Thus, she can charge a small premium, serve more clients per year, and competitors find it hard to replicate because this structure didn't appear out of thin air after her success; it was accumulated from the start.

On the surface, the tool is just a folder of markdown files. But every line in every file comes from a great deal of intentional testing, building, and iteration. What forms the competitive moat isn't the files themselves, but the process capability they embody.

Связанные с этим вопросы

QWhat are the five core structural forms that AI Agent tools like Codex, Claude Code, Hermes are converging towards, as discussed in the article?

AThe five core forms are: 1) Parameterizable Skills, which function like method calls. 2) Thin Harness, the lightweight execution framework for the model. 3) Resolvers, which are routing tables to prevent context corruption. 4) A separation layer for Latent (LLM) vs. Deterministic (code) tasks. 5) Memory, a system for accumulating long-term context and knowledge.

QAccording to the article, what is the key difference between a traditional SOP and a Skill in an AI Agent system?

AA traditional SOP is written for one specific job or task, like 'processing accounts payable,' with each use case having its own process. A Skill is more abstract, akin to a method call. The same Skill process (e.g., a fixed 7-step investigation) can handle a class of problems, with the case-specific information moved from the document into parameters like TARGET, QUESTION, and DATASET.

QWhat problem does the 'Resolver' component address, and how does it work?

AThe Resolver addresses 'context corruption,' which occurs when an AI harness has too many tool/Skill definitions, confusing the model and degrading performance. It works as a routing table, using explicit rules to map an incoming task type 'X' directly to the specific 'Skill Y' that should be invoked, replacing fuzzy pattern matching with clear directives.

QBased on the article, which of Hamilton Helmer's '7 Powers' is most accessible for creating a sustainable competitive advantage with AI for most small teams or early companies?

AThe most accessible power is 'Process Power.' The article argues that a well-designed AI system that encodes workflows, parameterizes tasks, separates deterministic code from LLM judgment, and accumulates memory becomes a tool for building process capability. This creates a competitive moat that is difficult to replicate, unlike easily copied one-off prompts or applications.

QHow is Memory implemented in Garry Tan's system as described, and what are two key benefits of this approach?

AMemory is implemented as a folder of markdown files: one page per person, company, or concept. Each page has a 'Current Believed Truth' section at the top (rewritten as new evidence arrives) and an append-only timeline at the bottom. Key benefits are: 1) The files are the system's primary record, editable manually and read automatically by the Agent. 2) Typed relationships (e.g., works_at, founded) are auto-extracted via regex on write, building a knowledge graph without consuming LLM tokens.

Похожее

From Farm to Entrepreneur: After Building Flying Cars, He Bet on the Robot Sector and Created a $39 Billion Giant

From Farm to Tech Tycoon: Brett Adcock's Journey to a $39B Robot Giant Brett Adcock, a serial entrepreneur from an Illinois farm, has built his third major venture, humanoid robotics company Figure AI, into a $39 billion behemoth backed by NVIDIA, Intel, and others. His path began with the sale of his recruiting platform Vettery for $110 million in 2018. He then co-founded and took electric air taxi company Archer Aviation public in 2021 before departing over strategic differences. Adcock founded Figure AI in 2022 with a 30-year vision to create general-purpose humanoid robots that can work in human-designed environments, aiming to address labor shortages in manufacturing, logistics, and retail. The company gained attention through live-streamed robot sorting challenges and a viral demo, though it faces skepticism over its high valuation versus early commercial progress. A key moment was Figure's split from OpenAI in 2024 after a brief collaboration, with Adcock claiming OpenAI provided limited value and deciding to develop AI models internally—a move OpenAI contested. Adcock's pattern is tackling capital-intensive, long-term tech frontiers, moving from software to aviation to robotics, betting on AI and automation as the future of labor.

marsbit19 мин. назад

From Farm to Entrepreneur: After Building Flying Cars, He Bet on the Robot Sector and Created a $39 Billion Giant

marsbit19 мин. назад

How to Become a Pro Claude User in 30 Days?

"How to Become a Claude Power User in 30 Days" outlines a structured month-long program to transform from a casual user into someone who leverages Claude as a core productivity system. The first week focuses on mastering prompt structure (Role, Context, Task, Format, Constraints), understanding context windows, and setting up foundational Projects and Memory for personalized context. The second week involves building reusable workflows for research, writing (using a two-step outline-first process), and decision-making. The third week shifts to automation: using Claude Cowork for autonomous file tasks, connecting tools like Google Drive and Slack, and setting scheduled automation. The final week is for compounding gains: optimizing all workflows based on feedback, building a personal knowledge repository, teaching the system to a colleague, and designing a complete, ideal "Claude Operating System." By day 31, the user operates with an autonomous assistant that handles routine tasks with deep context, freeing them for high-value creative and strategic work. The core message is that consistent, systematic configuration over 30 days creates a powerful, personalized productivity advantage.

marsbit1 ч. назад

How to Become a Pro Claude User in 30 Days?

marsbit1 ч. назад

Data: 75% of Traders on Hyperliquid Are Losing Money. What Are the Profitable Ones Using?

On Hyperliquid, approximately 75% of addresses are losing money, indicating that manual traders are increasingly competing against automated systems. Profitable traders primarily employ one of three approaches: 1) Running systematic, high-frequency strategies (e.g., one address executed 261k trades with a 64.75% win rate), 2) Placing high-conviction, asymmetric bets with large positions (e.g., 50 trades yielding $4.48M at a 28% win rate), or 3) Using algorithms for execution while making manual macro judgments. The article argues that traditional methods like chart patterns or reacting to social media news are often already priced in by bots, making such traders the "exit liquidity" for systematic players. Success now depends on unique narrative timing, structural insights, or conviction during market capitulation.

marsbit1 ч. назад

Data: 75% of Traders on Hyperliquid Are Losing Money. What Are the Profitable Ones Using?

marsbit1 ч. назад

Tiger Research: On-Chain Risk Operators, The Market Cap Gap Between 147 Trillion and 70 Billion

This report by Tiger Research examines the evolution of risk management in decentralized finance (DeFi) lending. It highlights a power shift from protocol developers to specialized professional risk operators who manage on-chain capital. The era of protocols and community governance solely dictating DeFi lending is ending. A new professional asset management layer has emerged. While the sector is nascent, capital and distribution channels are rapidly consolidating around top risk operator teams, whose past performance is now a key criterion for institutional entry. The industry's development, accelerated by modular infrastructures like Morpho, has led to a clear division of labor mirroring traditional finance: distribution channels (e.g., exchanges), strategy/risk management (the risk operators), and product infrastructure/asset custody (smart contract protocols). This structure lowers the entry barrier for traditional institutions. Currently, the total value managed by risk operators is approximately $70 billion, dominated by a few leading teams like Steakhouse (RWA focus), Sentora (AI models), and Gauntlet (crisis management). Competition now centers on collateral standards, distribution access, and crisis response capabilities. The report outlines three primary entry paths for institutions: 1) **Distribution Model**: Leveraging external risk operators as backend service providers (common for exchanges). 2) **Asset Supply Model**: Onboarding real-world assets to DeFi as collateral. 3) **Independent Operator Model**: Building an in-house team to become a risk operator (e.g., Bitwise). The core opportunity lies in the strategy/risk management layer, where traditional financial institutions can leverage their existing expertise in due diligence and risk assessment without deep technical development. A vast opportunity gap exists: the global traditional asset management industry manages ~$147 trillion, while the entire DeFi sector is only ~$800 billion, with the risk operator niche at ~$70 billion. This disparity signifies immense growth potential. Once robust risk frameworks and clearer regulations are established, even a minor allocation from traditional markets could trigger exponential DeFi growth. Early movers who help build these foundational systems will gain significant rule-setting influence and first-mover advantages.

marsbit1 ч. назад

Tiger Research: On-Chain Risk Operators, The Market Cap Gap Between 147 Trillion and 70 Billion

marsbit1 ч. назад

Interview with Circle's Chief Economist: USDC's Entry into Hyperliquid Benefits Circle and HYPE, Stablecoins Are Becoming Marginal Buyers of U.S. Treasuries

In an interview with Circle's Chief Economist Gordon Liao, the conversation covers the strategic significance of USDC replacing USDH as the reference asset on the decentralized perpetual exchange Hyperliquid. This shift, facilitated by Coinbase as the reserve manager and Circle providing technical infrastructure, aims to capture net interest income for the platform, with 90% of reserve earnings directed back to Hyperliquid for HYPE token buybacks. Liao discusses how stablecoins like USDC, with their substantial on-chain settlement volumes (e.g., $21 trillion in Q1 2026), are emerging as marginal buyers of U.S. Treasuries, concentrating on short-term debt and effectively reducing the weighted duration of the market, which may provide underlying support for long-term rates. The dialogue also explores the evolving nature of stablecoins as both a medium of exchange and a vehicle for capital and collateral liquidity. Additionally, the panel touches on the CLARITY Act's legislative progress, noting compromises around "activity-based rewards" and remaining hurdles like ethics concerns. On AI, there's debate over value capture, with predictions that distribution and application layers, rather than foundational model companies like OpenAI, will accrue most value. Regarding the bond market, Liao attributes the rise in 30-year yields primarily to an increased term premium (around 80 bps) driven by supply-demand dynamics, including fiscal expansion and changing investor demand, rather than expectations of Fed rate hikes.

marsbit1 ч. назад

Interview with Circle's Chief Economist: USDC's Entry into Hyperliquid Benefits Circle and HYPE, Stablecoins Are Becoming Marginal Buyers of U.S. Treasuries

marsbit1 ч. назад

Торговля

Спот

Фьючерсы

Обсуждения

Добро пожаловать в Сообщество HTX. Здесь вы сможете быть в курсе последних новостей о развитии платформы и получить доступ к профессиональной аналитической информации о рынке. Мнения пользователей о цене на CORE (CORE) представлены ниже.

Five Core Forms of AI Agent in YC's Eyes

Введение

Skills: From SOP to 'Method Call'

Thin Harness: Model is the Brains, Harness is the Hands and Feet

Resolvers: Using Routing Tables to Solve Context Corruption

Latent vs. Deterministic: Judgment to the Model, Deterministic Tasks to Code

Memory: Making the System Truly Accumulative

Связанные с этим вопросы

Похожее

From Farm to Entrepreneur: After Building Flying Cars, He Bet on the Robot Sector and Created a $39 Billion Giant

How to Become a Pro Claude User in 30 Days?

Data: 75% of Traders on Hyperliquid Are Losing Money. What Are the Profitable Ones Using?

Tiger Research: On-Chain Risk Operators, The Market Cap Gap Between 147 Trillion and 70 Billion

Interview with Circle's Chief Economist: USDC's Entry into Hyperliquid Benefits Circle and HYPE, Stablecoins Are Becoming Marginal Buyers of U.S. Treasuries

Торговля

Популярные статьи

Как купить CORE

Обсуждения

Топ вопросы

Популярные категории

Популярные теги