From Tokens to Machine Labor: AI is Shifting from Tool to "Worker"

marsbitОпубликовано 2026-05-31Обновлено 2026-05-31

Введение

The article "From Token to Machine Labor: AI is Evolving from Tool to 'Worker'" argues that the business model for AI is shifting beyond simply selling computational resources (tokens, GPU hours) or model access. Instead, a new "machine labor market" is emerging, where the core economic transaction is the purchase of economically useful work directly performed by software. The central thesis is that AI pricing will evolve through four stages: 1) raw tokens, 2) standardized LLM capabilities (e.g., text generation), 3) industry-specific labor markets (e.g., legal review, radiology), and finally 4) a programmable results market where tasks like resolving a support ticket are bid on and priced based on outcome. In this future, buyers will care less about *which* model or GPU completes a task and more about whether the work meets specified standards for accuracy, latency, and cost. This transition reframes the impact of AI on human labor. Rather than simple replacement, it suggests a re-coordination where machines handle standardized, verifiable work, freeing humans for roles involving oversight, context management, responsibility, and final judgment. In some cases, this "last 1%" of human input becomes more valuable as it enables the other 99% to be automated. Furthermore, as AI reduces the cost of work, demand may expand, creating larger markets (e.g., 24/7 customer service) rather than just cheaper versions of existing ones. The article concludes that while infrastructure (G...

Editor's Note: As AI begins to write code, handle customer service tickets, and review legal documents, a more fundamental question is emerging: what are businesses truly purchasing—tokens, GPU hours, or completed work?

This article proposes a noteworthy framework: the commercialization of AI should not merely be understood as a "computing power market" or a "model invocation market," but is moving toward a new "machine labor market." In this market, tokens are merely a unit of measurement, GPUs are input goods, models are production tools, and the real objects being priced and traded are economic tasks directly completed by software.

The article's core thesis is that AI pricing mechanisms will evolve from raw tokens and standardized model capabilities to industry-specific labor, and finally to a programmable outcomes market. This means that in the future, enterprises might no longer care which specific model or GPU type completes a task, but rather whether it delivers a result meeting defined standards within specified latency, accuracy, reliability, and cost constraints.

This also implies that AI's impact on the human labor market may not be simple replacement. As machines take on more standardized, verifiable work, the human role may shift towards review, accountability, context management, and final judgment. In some scenarios, the final 1% of human judgment could become even more valuable, as it unlocks large-scale automation of the other 99%.

From this perspective, the next stage of competition in the AI market may no longer be just about model capability, nor a simple price war over computing power, but about who can first standardize, verify, and price "work," and ultimately turn machine labor into a new type of factor of production that can be procured, settled, and traded.

Original article follows:

Waves of productivity have historically come from providing humans with tools and software to optimize how work gets done. Spreadsheets aided accountants and analysts, conveyor belts increased throughput, hammers amplified human leverage. But the actual labor always came from humans.

Now, AI is producing work outcomes end-to-end, directly executing the labor itself. It can write code, handle customer service tickets, review legal documents. A compression is happening at the far end of the tech stack: the old stack supported labor, the new stack is starting to produce it.

If you've listened to discussions on AI financialization recently, you've likely heard Jensen and others say that LLM tokens and/or GPU hours are becoming the new commodities. This intuition is understandable because tokens are measurable, billable, and easy to chart; billions of dollars are also flowing into GPU hours. But tokens remain merely a meter, GPU hours are just an input—no one buys them for their own sake. What people actually want is to get work done. AI is turning the tech stack itself into a source of labor.

Machine Labor: Work performed by software, for an economic purpose, and sold into the production process.

The market is already moving in this direction. Benchmark's Sarah Tavel prefers to understand this opportunity through the lens of outsourcing labor markets, not software categories. If a repeatable task is already performed by a dedicated offshore team or professional services firm, it's often also a good candidate for AI delivery. a16z's Alex Rampell calls this "software eating labor": software's next act is to do the work itself. Sequoia's Julien Bek describes the same shift from another angle: services are turning into software, copilots sell tools, while autopilots sell the work.

The Missing Market Behind Outcomes-Based Pricing

Seat pricing charges for access, token pricing charges for usage. Outcomes-based pricing charges when work is completed. Outcomes pricing moves us a step forward, but it still doesn't answer one question: who decides the price?

If machine labor can be bought directly, price should come from competition among suppliers. These suppliers must be able to meet the same class of tasks or work completion standards, which requires standardization within different industries and tasks.

The current approach uses LLM tokens, but raw tokens are just the lowest layer. A barrel of oil is just a unit of measurement; what's actually traded is a barrel of a specific grade of oil, with defined quality, delivery terms, and market price. A barrel of Brent crude and a barrel of high-sulfur heavy crude are not the same commodity. It's the same with LLM tokens. Tokens are just the unit; what matters is the intelligence behind them: model quality, benchmark floors, latency, context window, reliability, and delivery guarantees. One million tokens from a frontier coding model are not the same commodity as one million tokens from a cheap general-purpose model. The market needs standardized inference grades, just as the energy market needs standardized oil grades.

Anjali Shriva points this out directly: a token is not a fixed cost unit. Its economics vary with context length, task structure, input/output ratio, retry counts, tool calls, and agent workflows. A token in a short prompt and a token buried in a long agent loop are not the same economic object.

We already do this in human labor markets. No one hires a radiologist as a generic "human hour." They look at training background, certifications, specialty, years of experience, availability, reputation, liability, etc. Different human contract specs correspond to different minimum standards and grade expectations.

Human labor markets already run on these specs; it's just that these specs are often muddled, qualitative, and full of proxy signals. Machine labor will make these specs more explicit and quantifiable.

For an LLM or an agent, metrics like skill, experience, speed, and reliability can be written directly into a contract: benchmark scores, latency, throughput, context window, max output length, tool-use accuracy, uptime, error rates. We can procure labor based on quantifiable expectations and outcomes.

TheGrid.ai's contract spec is essentially a qualification filter, plus price competition for LLM outputs. Any supplier meeting the spec can enter the competition:

Intelligence Benchmark ≥ Floor

Latency ≤ Ceiling

Throughput ≥ Floor

Uptime ≥ Floor

Error Rate ≤ Ceiling

Once suppliers all meet the same minimum bar, they compete on price. The buyer asks: which supplier can deliver the required labor at the best price?

Hiring a radiologist, in the LLM context, becomes a measurable question: which LLMs can read X-rays with high proficiency, meeting defined latency, context window, and other outcome-based contract specs.

Outcome is how the buyer measures success; Labor is the economic activity being supplied; Token is the fuel the machine consumes while doing the work.

The Grid is the machine labor market.

From Tokens to Machine Labor Market

The market can price inputs of the tech stack, but to price outputs requires a machine labor market. Buyers don't care about GPU hours. Model endpoints themselves are unstable: they get renamed, deprecated, wrapped, or simply retired.

Users and liquidity hate frequent changes. GPUs and models will keep evolving, but the stable unit is the work itself.

I believe the market will evolve along the following path. Each step up the ladder, what is being purchased becomes more abstract, more valuable, but also harder to verify. The Grid should climb this ladder progressively:

Raw Tokens → Commoditized LLM Capability Market → Commoditized Labor Market → Programmable Outcomes Market

Stage 1: Raw Tokens

Claude 4.7, GPT 5.5, Kimi 2.6, DeepSeek V4, GLM 5, etc.

Today, buyers purchase raw model output from inference providers. They send their prompts, receive inference results, and pay per use. This is easy to verify, but it's still just raw material. Buyers don't actually want tokens; they want useful intelligence at the best price.

Stage 2: Commoditized LLM Capability Market

E.g., text/usd, code/usd, agent/usd, etc.

The buyer no longer chooses a specific model, but the category of intelligence they need. The buyer still owns the workflow, prompts, data, and application logic. The Grid just routes each request to the qualified model that meets the contract spec and offers the lowest price.

Note: This is the first real abstraction above raw tokens, and where TheGrid.ai currently sits.

Stage 3: Commoditized Labor Market

E.g., accounting/usd, support_agent/usd, legal/usd, healthcare/usd, radiology/usd, etc.

As models become more specialized, the capability market can evolve further into industry-specific markets. This is analogous to specialization in human labor markets.

At this layer, we're selling inference suited for workflows in specific labor verticals. This category will expand rapidly as niche industry models become common. Examples include Cursor's Composer, Harvey for legal work, and EvidenceOpen for healthcare.

Stage 4: Programmable RFQ and Outcomes Market for Agents

E.g., support_ticket_resolved/usd, pr_merged/usd, claim_processed/usd, etc.

The final layer is where The Grid moves from an inference market to a machine labor market.

This layer requires RFQs (Request for Quote), escrow, delayed settlement, buyer confirmation, supplier reputation, clawback, dispute resolution, etc. It will likely start with RFQs rather than order books. Buyers define the job, constraints, acceptance criteria, and settlement terms; agents bid to complete it. The Grid helps route, price, verify, and settle these jobs.

This is the most valuable layer, but also the hardest to verify, as outcomes can be delayed, subjective, and easily gamed. A customer service ticket might be reopened; a pull request might pass tests but still introduce poor architecture.

Total Price = Cost of doing the work + Cost of bearing the risk

A workflow does not automatically become a market just because intelligence has a market or intelligence gets cheaper. Some work is deeply dependent on private context, like customer history or internal policies. The more context-dependent the work, the less likely it is to be cleanly liquidated in an open market. [@hypersoren https://hypersoren.xyz/posts/cybernetic-arbitrage/]

The market needs to reveal which labor categories will expand and which will contract.

"Machine Labor vs. Human Labor" or "Machine Labor & Human Labor"

Anjali Shriva notes in her mechanism design draft that the AI narrative is too often described as replacement. But in reality, it's more of a coordination problem: how work, attribution, incentives, and value get reorganized when both humans and machines participate in production.

Today, much AI usage inside companies remains stuck because employees use AI privately, workflows stay locked on individuals, and the firm cannot price these productivity gains or scale them.

Most automatable work will likely shift to machines. Some work will turn into human review, accountability, training, and context management. In some cases, the final 1% of human judgment will become more valuable, as it unlocks the other 99% of automation at scale.

Rachel Su Park's "Brave New World of AI Markets" points out that AI's TAM should not be simply modeled as a replacement for existing human labor spend, because it changes both price and quantity. As the cost of work falls, unit price may decline, but quantity consumed may expand because existing work gets consumed more often, and entirely new work that wasn't economical before becomes possible. The article summarizes it as:

P × Q: Market Size = Price per unit of work × Quantity of work consumed

If AI makes a customer service interaction cheaper, companies can afford to offer 24/7 coverage. The market won't just be a cheaper version of the old customer service labor market; it might become a much larger market for customer interactions.

AI is an expansionary market because demand does not stay constant when the cost of work falls.

The Labor Layer

The machine labor market should start with work that can be crisply defined in specs. GPU hours contain too much input information; they only tell you what powered the work. Pricing full outcomes is too complex, too context-dependent. As verification, reputation, and risk/insurance pricing get handled by machines, the market will move further into the pure outcomes layer.

Machine labor can become tradable because buyers will increasingly not care which model or which GPU produced the work, but rather whether the work itself meets the minimum standards and grade from the contract spec, at the right price. Agents will care even less about the underlying source.

Machines can now directly execute work for an economic purpose, and that work can be defined, measured, priced, procured, and eventually traded. Electricity, compute, models, and tokens are still important, of course, but they're all still upstream.

Downstream is where the work actually gets done, and the market is moving toward a simpler object: machine labor.

Связанные с этим вопросы

QAccording to the article, what is the core evolution path of AI pricing mechanism?

AThe article states that the AI pricing mechanism will evolve from raw tokens, to standardized model capabilities, to industry-specific labor, and finally to a programmable results market.

QWhat does the article define as 'machine labor'?

AThe article defines 'machine labor' as work performed by software for economic purposes, which is then sold into the production process.

QWhat are the four evolutionary stages of the AI market, from primitive to advanced, as outlined in the article?

AThe four stages are: 1) Raw Tokens, 2) Commoditized LLM Capability Market, 3) Commoditized Labor Market, and 4) Programmable Results Market for Agents.

QHow does the article suggest AI will change the human job market, rather than simply replacing it?

AThe article suggests that as machines take over standardized work, human roles will shift towards review, accountability, context management, and final judgment. In some cases, the final 1% of human judgment becomes more valuable as it unlocks large-scale automation of the other 99%.

QWhat key factor does the article identify as necessary for a true machine labor market to function?

AThe article identifies the need for standardization and verifiable specifications within different industries and tasks. Suppliers must meet defined standards (like benchmark scores, latency, error rates) for a specific type of work, enabling price competition based on the delivered labor, not the underlying tokens or compute.

Похожее

Three Years Later: Looking Back on My 2023 Predictions for ChatGPT

Looking Back After Three Years: Revisiting My 2023 Predictions on ChatGPT In March 2023, shortly after ChatGPT's debut and before GPT-4's release, I made over twenty predictions about AI's future based on limited information and intuition. Now, in May 2026, I revisited those forecasts using an AI-driven analysis with 41 Opus 4.8 agents to cross-reference them with the latest data. The assessment used symbols: ✅ Correct, 🟢 Mostly Correct, 🟡 Partially Correct, ❌ Incorrect. Overall, the directional judgments held up well, with only one major factual error regarding GPT-4's rumored parameter size (incorrectly cited as 100T). However, nuances and degrees of accuracy revealed more. **What Was Largely Correct:** Predictions about mechanisms and directions proved accurate. The rise of RAG (Retrieval-Augmented Generation) as the standard architecture for combating AI hallucination was confirmed, as was the transformative potential of LUI (Language User Interface) in creating a new industry layer atop GUIs. The emergence of "robot networks" (agent-to-agent communication protocols) and China's rapid catch-up in developing capable large models (closing the performance gap with top models to ~2.7%) were also on point. The analysis affirmed that LLMs lack consciousness and that the Turing Test merely measures perceived intelligence. **What Was Off Target:** Errors often involved specific numbers, over-optimistic timelines, or misjudged distributions. The prediction that value would primarily accrue to the application layer was half-right but missed NVIDIA's dominance as the profitable infrastructure layer. Forecasts about AI circumventing copyright issues and fostering a "global common ground" by averaging human viewpoints were incorrect; instead, major copyright settlements occurred and AI personalization is increasing. Estimates for model training costs ("$5-10 billion cap") were significantly off, underestimating frontier costs and overestimating replication costs. The notion that LLMs could never do complex math without tools was disproven by later models winning IMO gold. **Key Patterns from the Review:** 1. **Direction over precision:** Judgments about mechanisms and trends were more reliable than specific numbers or definitive statements. 2. **Timing bias:** There was a tendency to overestimate short-term speed but underestimate long-term magnitude and transformation. 3. **The distribution blind spot:** Aggregate-level correctness often masked uneven impacts (e.g., on young professionals' employment). 4. **The value of qualifiers:** Predictions framed with caution (e.g., "reportedly," "for now," "prototype in 2-3 years") aged better. 5. **Some debates continue:** Issues like the nature of "emergent abilities" or machine consciousness remain unresolved. This three-year review highlights that while seeing the big picture is crucial, humility regarding specifics, timelines, and disparate impacts is essential for future forecasting.

链捕手1 ч. назад

Three Years Later: Looking Back on My 2023 Predictions for ChatGPT

链捕手1 ч. назад

AI Bubble Warning: AI Investments Are Negative Returns for Most Tech Giants

The article issues a stark warning about a potential AI investment bubble. It notes that while the AI boom shares similarities with the TMT bubble of the late 1990s, its scale is vastly larger, currently driving 93% of U.S. GDP growth. Major hyperscale cloud providers like Microsoft, Alphabet, Amazon, Meta, and Oracle are planning to invest trillions in AI data centers over the coming years. However, calculations based on analyst projections for 2025-2030 reveal a concerning math problem: expected capital expenditure growth far outpaces projected revenue growth. Even under an extremely optimistic scenario of zero costs, the implied return on investment for most of these tech giants (except Amazon) is deeply negative. This suggests that the current trajectory could lead to one of history's largest shareholder value destruction events. The piece outlines two potential escapes: AI generating vastly more revenue than currently anticipated—a near-impossible task—or a significant cutback in the planned investment splurge. The latter scenario could trigger a domino effect, severely impacting the entire tech supply chain (from Nvidia to TSMC), potentially pushing the U.S. economy into recession, and causing a major stock market downturn. The author suggests upcoming high-profile IPOs by companies like OpenAI and Anthropic might represent a transfer of risk from early investors to public market participants. While the peak of the hype cycle might sustain investment through 2026, the fundamental financial dilemma remains unresolved, setting the stage for a potential market correction in 2027 or 2028, similar to the years following Alan Greenspan's "irrational exuberance" warning.

marsbit1 ч. назад

AI Bubble Warning: AI Investments Are Negative Returns for Most Tech Giants

marsbit1 ч. назад

Xiaomi MiMo's 99% Price Cut is Not Marketing! Luo Fuli Posts on X to Refute Critics

The price of Xiaomi's MiMo-V2.5 series API has been permanently reduced by up to 99%, specifically for the "Input (Cache Hit)" cost, which covers users re-reading historical context in long conversations. MiMo's head, Luo Fuli, published a detailed technical blog to clarify that this drastic price cut stems from genuine engineering breakthroughs, not a marketing stunt or a simple price war. The core of the achievement lies in six key engineering optimizations. First, the model architecture adopts a Hybrid Sliding Window Attention (SWA), reducing the memory footprint (KVCache) to 1/7th of a traditional model. Second, a dual-pool memory management system actually utilizes these savings, allowing a single GPU to handle over 5 times more concurrent users. Third, an upgraded prefix caching mechanism achieves a cache hit rate of 93-95% for repeated reads, meaning most such requests bypass GPU computation entirely. Fourth, a self-developed distributed cache (GCache) utilizes idle SSD space on existing GPU servers, eliminating additional storage costs. Fifth, an intelligent scheduling system (LLM-Router) efficiently routes requests to maximize cache reuse and performance. Sixth, Multi-Token Prediction (MTP) accelerates the model's text generation ("output") side. Together, these systemic optimizations dramatically lower the real computational cost per request, enabling the 99% price reduction for cached inputs while reportedly maintaining positive gross margins. Luo Fuli's disclosure aims to shift the narrative from "price war" to a demonstration of substantive AI engineering progress.

marsbit4 ч. назад

Xiaomi MiMo's 99% Price Cut is Not Marketing! Luo Fuli Posts on X to Refute Critics

marsbit4 ч. назад

$26 Billion: An 'All-Chinese Team' Backs the World's Highest-Valued AI Programming Company

Cognition AI, the company behind the AI programmer "Devin," has raised over $1 billion in new funding at a valuation of $26 billion, just eight months after reaching a $10.2 billion valuation. The round was led by Lux Capital, General Catalyst, and 8VC. Founded by three young Chinese entrepreneurs with strong competitive programming backgrounds, Cognition initially gained fame with Devin, marketed as the world's first AI software engineer capable of handling tasks from start to finish. While its early demos were impressive, real-world usage revealed reliability and cost-effectiveness issues, leading to a significant price cut for Devin in 2025. A pivotal moment came when Cognition acquired the assets of AI IDE company Windsurf after a failed acquisition by OpenAI. This move gave Cognition a crucial developer-facing tool, allowing it to pursue a two-pronged strategy: Devin for autonomous task execution and Windsurf for integrated, collaborative coding within an IDE. This shift helped the company move away from the controversial "AI replacement" narrative towards a model of augmenting human engineers, particularly for repetitive or maintenance tasks. This strategic pivot is backed by strong commercial metrics. The company reports a 10x increase in enterprise usage this year, with an annual revenue run-rate of $492 million and a 50% month-over-month growth in enterprise Devin usage over the past six months. Its client list now includes major corporations like Goldman Sachs and Mercedes-Benz, as well as government agencies like NASA and the U.S. Army. Investors are betting on Cognition becoming a foundational piece of next-generation software engineering infrastructure, positioning it at the center of a hybrid future where AI agents and human developers work in tandem.

marsbit4 ч. назад

$26 Billion: An 'All-Chinese Team' Backs the World's Highest-Valued AI Programming Company

marsbit4 ч. назад

The Hottest 00s Generation on Wall Street

"Wall Street's Hottest '00s Phenom: The 25-Year-Old Fund Manager Who Bet on AI's 'Boring' Backbone" At just 25, Leopold Aschenbrenner, once fired by OpenAI, now runs a hedge fund worth $13.7 billion. His strategy? Betting against the consensus. While others chased AI chips, he invested early in the physical infrastructure powering the AI boom: electricity, data centers, and energy. Expelled from OpenAI's safety team in 2024, Aschenbrenner foresaw the coming bottleneck. He argued that AI progress would be limited not by algorithms, but by power, chip capacity, and space. Acting on this, he founded Situational Awareness LP to go long on these "old economy" assets. His bets have paid off spectacularly. His fund's assets soared from $255 million in late 2024 to $13.7 billion by Q1 2026. His portfolio is a direct reflection of his thesis: major long positions in fuel cell company Bloom Energy and data center/bitcoin mining firms like CleanSpark and Riot Platforms, which control critical land and power resources. Conversely, he holds massive put options against overheated semiconductor giants like NVIDIA and AMD. A notable exception was his bullish bet on storage company SanDisk, which surged ~160% in Q2. Aschenbrenner's vision is materializing. Tech giants like Amazon, Alphabet, and Meta are ramping up colossal capital expenditure on data centers. Global data center power consumption is projected to skyrocket, with AI accounting for over half by 2030. The demand for enabling technologies like optical fiber and modules is also exploding. His story underscores a fundamental truth of the AI era: the ethereal intelligence of algorithms rests on a very physical, heavy, and power-hungry foundation. The future is being built not just in code, but in concrete, copper, and kilowatts.

marsbit6 ч. назад

The Hottest 00s Generation on Wall Street

marsbit6 ч. назад

Торговля

Спот
Фьючерсы

Популярные статьи

Неделя обучения по популярным токенам (2): 2026 может стать годом приложений реального времени, сектор AI продолжает оставаться в тренде

2025 год — год институциональных инвесторов, в будущем он будет доминировать в приложениях реального времени.

1.8k просмотров всегоОпубликовано 2025.12.16Обновлено 2025.12.16

Неделя обучения по популярным токенам (2): 2026 может стать годом приложений реального времени, сектор AI продолжает оставаться в тренде

Обсуждения

Добро пожаловать в Сообщество HTX. Здесь вы сможете быть в курсе последних новостей о развитии платформы и получить доступ к профессиональной аналитической информации о рынке. Мнения пользователей о цене на AI (AI) представлены ниже.

活动图片