# Automation Related Articles

HTX News Center provides the latest articles and in-depth analysis on "Automation", covering market trends, project updates, tech developments, and regulatory policies in the crypto industry.

Robots Begin to 'Consume Data': The Hidden Production Chain from Indian Data Factories to Billion-Dollar Humanoid Robots

Robots have started to 'consume data,' driving the formation of a new industrial supply chain focused on producing training data for embodied AI. Unlike large language models, which are trained on vast internet text corpora, embodied AI models face a 'data desert' in the physical world. This has created a massive demand for first-person perspective video data (Ego Data), captured by workers wearing cameras in places like Indian garment factories. Companies like Neocambrian AI are establishing 'data factories' where workers perform standardized tasks (e.g., sorting clothes, kitchen organization) to generate thousands of hours of video. Research, such as NVIDIA's EgoScale, demonstrates that scaling this human demonstration data predictably improves robot performance, particularly for dexterous manipulation. This has validated a training path combining large-scale human data for pre-training with smaller amounts of robot-specific data for fine-tuning. The value of different data types varies significantly, forming a 'data pyramid.' The base consists of low-cost, large-scale internet and Ego Data. Higher layers include more expensive motion-capture data (e.g., from data gloves), simulation/synthetic data, and the most costly and scarce layer: real robot teleoperation data. This demand has spawned a layered ecosystem of data suppliers: low-cost data factories, motion capture and alignment specialists, robot-native teleoperation service providers, simulation data companies, and platforms aiming for data standardization. Robot companies themselves are adopting a 'layered procurement' strategy: outsourcing generic Ego Data while building in-house capabilities for robot-specific adaptation data and the critical deployment/failure data generated in real-world applications. The industry is shifting focus from hardware and basic mobility to the data pipelines required for general-purpose capability. While parallels exist to data labeling companies like Scale AI in the LLM boom, the physical complexity of robot data—involving action success ambiguity and sim-to-real gaps—requires more integrated solutions for data collection, annotation, and a continuous feedback loop. The race is on to build the data engines that will teach robots to operate reliably in the unstructured real world.

marsbit12h ago

Robots Begin to 'Consume Data': The Hidden Production Chain from Indian Data Factories to Billion-Dollar Humanoid Robots

marsbit12h ago

The Recursive AI Anthropic Warned About: Tian Yuandong's New Company Has Just Taken the "First Step"

Anthropic recently highlighted the rapid progress toward "recursive self-improvement," where AI systems autonomously design and train their successors. In response, Recursive Superintelligence, a new company co-founded by former Meta researcher Tian Yuan Dong, has publicly demonstrated its first step toward automating AI research. The company released a system designed to autonomously execute the full AI research cycle: generating ideas, implementing code, running experiments, and learning from results. It validated this approach by achieving state-of-the-art results on three diverse benchmarks: 1. **NanoChat Autoresearch:** Optimizing a small language model's validation loss under a fixed 5-minute GPU budget, improving upon the community's best result. 2. **NanoGPT Speedrun:** Reducing the time to train a GPT model to a specific loss on 8 H100 GPUs from 79.7 seconds to 77.5 seconds, beating a highly optimized, human-driven community effort. 3. **SOL-ExecBench:** Improving the overall score on NVIDIA's suite of 235 GPU kernel optimization tasks by 18%, closing the gap to the hardware limit. The system discovered novel optimizations in this highly specialized domain without direct human expertise. Recursive's system operates as a general framework, capable of parallel exploration and cross-task knowledge transfer while incorporating safeguards against reward hacking. The company, backed by $650M in funding and a star-studded team including Richard Socher and Alexey Dosovitskiy, aims to create AI that recursively enhances its own research capabilities. This development represents an early but concrete move toward a new paradigm where AI accelerates its own advancement. It occurs alongside Anthropic's warnings about the need for industry coordination and potential pauses when recursive self-improvement thresholds are reached, highlighting the dual trajectory of rapid technical progress and growing calls for careful stewardship.

marsbitYesterday 04:12

The Recursive AI Anthropic Warned About: Tian Yuandong's New Company Has Just Taken the "First Step"

marsbitYesterday 04:12

Public Version of Mythos Officially Launched: Analyzing the Advantages and Limitations of AI Smart Contract Auditing

Publicly available Mythos, Anthropic's AI model, has officially launched, demonstrating both significant potential and limitations in smart contract security auditing. The article analyzes its capabilities through real-world cases. AI excels in identifying subtle, low-level vulnerabilities through pattern recognition and large-scale code screening. A key example is detecting a storage slot collision between a custom rewards mapping and a third-party library's ReentrancyGuard, a vulnerability easily missed in manual audits. In the recent Zcash incident, AI also rapidly discovered a critical soundness bug that had remained hidden for years. However, AI currently struggles with complex, interconnected scenarios. When tested on the Curve LlamaLend sDOLA exploit, which involved manipulating prices across multiple protocols (Curve pools, lending markets) to trigger liquidations, Fable 5 failed to identify the core cross-protocol attack vector. These scenarios require a deep understanding of DeFi economic models and multi-contract interactions. In conclusion, while AI tools like Mythos significantly boost efficiency in finding standardized, syntactic vulnerabilities, they cannot yet replace expert analysis for complex, business-logic, and cross-protocol attacks. An effective audit workflow combines AI's speed for initial screening with human expertise for in-depth, holistic analysis.

marsbit2 days ago 08:06

Public Version of Mythos Officially Launched: Analyzing the Advantages and Limitations of AI Smart Contract Auditing

marsbit2 days ago 08:06

AI Investors' 2026 Anxiety: When Models Devour Everything, What Moat Is Left for Startups?

In 2026, a wave of investor anxiety questions the defensibility of AI startups as models improve, fearing that most companies are just "thin wrappers" destined to be absorbed by foundation models or chipmakers. The author argues against this despair, positing that true moats lie not in benchmark performance but in areas models cannot easily reach. The logic of despair is that if models excel at all measurable tasks, only compute and cutting-edge model weights hold lasting value. However, the essay contends that the most valuable work is inherently "untrainable." Benchmarks measure what can be measured and thus optimized for, but real-world correctness often resides in private, complex systems. Examples include legacy codebases, intricate legal transactions, or hospital workflows. This kind of correctness is proprietary, costly to establish, and cannot be validated quickly—it requires time and trust within an organization. As models commodify visible, measurable tasks from both above (labs absorbing scaffolding) and below (saturation by cheaper models), value shifts to "untrainable ground." This encompasses work where correctness is a private truth, locked behind integration barriers, licenses, liability frameworks, and entrenched user habits. Trust and adoption are slow, human-centric processes that smarter models cannot accelerate. Successful companies defend their position by embedding deeply into client operations, owning the definition of "good" within a specific domain (e.g., Harvey in law, OpenEvidence in medicine), and pricing on outcomes rather than tokens. While labs compete fiercely, they are incentivized to keep the application layer vibrant. The future belongs not to those competing on generic benchmarks but to those navigating unscoreable terrain, doing the "unsexy work" of translation between models and messy human realities. The most cited benchmark scores are thus maps of territory about to become worthless, signaling who will lose the right to define what counts as good.

marsbit2 days ago 03:34

AI Investors' 2026 Anxiety: When Models Devour Everything, What Moat Is Left for Startups?

marsbit2 days ago 03:34

How to Conduct Deep Research Using Claude's Dynamic Workflows

The article "How to Use Claude's Dynamic Workflows for Deep Research" discusses overcoming the pitfalls of technical research, where both humans and AI can get overwhelmed by information, leading to vague conclusions. It introduces Claude Code's new "Dynamic Workflows" feature, which automatically designs and executes task-specific workflows before starting a task, unlike simpler "planning modes." This approach incorporates validation, result convergence, and adversarial verification from the outset. The core of Dynamic Workflows is six predefined scheduling patterns that address how to decompose tasks and synthesize results: 1. **Classify-and-Act (Routing):** An agent classifies the task and routes it to the most suitable specialist agent for execution. It's precise and efficient but struggles with ambiguous tasks. 2. **Fan-out & Merge:** The task is split into parallel, independent subtasks whose results are later merged. It's fast and isolates contexts but is more expensive and challenging to synthesize. 3. **Adversarial Verification:** Multiple "challenger" agents critique a worker agent's conclusion, requiring majority approval. This counters confirmation bias and self-assessment errors but relies on verifiable facts. 4. **Generate & Filter:** Multiple agents generate many candidate solutions, which are then filtered against a rubric to output only the best. It fosters diversity but depends heavily on the filter's quality. 5. **Tournament:** Multiple agents compete on the same task, with pairwise comparisons eliminating contestants over rounds to select the best. This offers stable relative judgment but is complex. 6. **Loop:** An agent iteratively attempts a task, learning from errors and adjusting until a stop condition is met. It handles tasks with unknown scope but risks infinite loops without proper design. The author compares their own custom deep-research system, which involved multi-agent analysis and deduplication but lacked goal-oriented convergence, to Claude's built-in workflow. The official workflow adds critical layers: initial problem decomposition, credibility assessment of sources, cross-agent voting to delete weak conclusions (not just averaging), and output tightly focused on the user's original goals and actionable recommendations. This structurally addresses common AI issues like goal drift, premature stopping, context pollution, and output bias. In summary, Dynamic Workflows represent a shift from smarter single conversations to a structured research process, compressing what used to require many dialogues into 3-4 interactions, albeit at higher token cost. The author notes remaining challenges for their specific domain (blockchain research): the need for fact-based verification over official documentation, depth in truly novel interdisciplinary thinking, the practical validation of proposed solutions, and tailoring information density to the audience.

marsbit06/09 03:07

How to Conduct Deep Research Using Claude's Dynamic Workflows

marsbit06/09 03:07

Recursive Self-Improvement AI Gains Traction, Google Pours Cold Water, While DeepSeek and Others Approach the Fringes

The term "recursive self-improvement" (RSI), where AI improves itself autonomously, is gaining momentum in the AI industry. Startups like Recursive Superintelligence and projects such as Andrej Karpathy's Auto-Research aim to create systems where AI designs, implements, and validates its own research, moving toward superintelligence. While Google CEO Sundar Pichai cautions that such exponential acceleration is not yet a reality, progress is evident. For instance, Anthropic reported its Claude Code writes nearly 100% of the team's code, though it still lacks true self-direction. Analysts frame RSI development in stages: "adequacy" (systems functioning without humans), "parity" (matching human research quality), and "supremacy" (exceeding human-AI collaboration). Reaching parity could trigger rapid, unpredictable advancement due to AI's continuous operation. In China, companies like DeepSeek and Baidu incorporate self-optimization techniques without explicitly branding them as RSI, focusing on algorithmic efficiency and reinforcement learning. However, challenges remain, including "model collapse" from training on AI-generated data and the immense computational and open-collaboration requirements. Ultimately, RSI represents a trend of increasing automation in AI development, potentially reducing human oversight in the creation process itself.

marsbit06/06 23:25

Recursive Self-Improvement AI Gains Traction, Google Pours Cold Water, While DeepSeek and Others Approach the Fringes

marsbit06/06 23:25

Exclusive from Yingke | Tang Wenbin's 'Yuanli Lingji' Merges with Logistics Robotics Company, and Secures Investment from Zhipu, SenseTime, Jieyue, and Others

Exclusive report: Embodied AI company "Yuanli Lingji" recently completed a new round of financing from major AI model firms including Zhipu AI, Stepfun, and SenseTime, alongside continued investments from industrial backers like Huaqin and SAIC Hengxu. Founded in March 2025 by Tang Wenbin, former co-founder and CTO of Megvii, Yuanli Lingji is a general-purpose embodied AI model company. In a notable move, the company has merged with logistics robotics firm "Atomix" (formerly known as Yuanli Juhe) through a share acquisition. Atomix, which originated from Megvii's logistics robotics business led by Tang in 2016 and was spun off in July 2024, has grown to become the world's second-largest supplier of pallet shuttle robots, with annual revenue nearing 1 billion RMB and over 500 projects globally for clients like Uniqlo and CATL. This merger aims to break the industry's "data deadlock" by combining Atomix's extensive real-world operational data from more than 20 countries with Yuanli Lingji's model training capabilities. The company's embodied AI model "DM0" utilizes a cross-domain training approach, integrating internet semantics, autonomous driving rules, and robotics data to achieve hardware-agnostic, precise manipulation even with a compact 2.4B parameter size. The collective investment from key AI players and the strategic merger signal a shift in the competitive landscape, as major model companies pivot from language tokens to physical actions ("from Token to Action"). The industry is entering a consolidation phase where hardware, AI models, data, and application scenarios converge to scale embodied intelligence, a trend mirrored by recent moves from giants like ByteDance and Skild AI.

marsbit06/05 01:07

Exclusive from Yingke | Tang Wenbin's 'Yuanli Lingji' Merges with Logistics Robotics Company, and Secures Investment from Zhipu, SenseTime, Jieyue, and Others

marsbit06/05 01:07

The Merger of Codex and ChatGPT Marks the Beginning of a Major Reshuffle in Programming Tools

OpenAI is shifting its strategic focus from ChatGPT to Codex, merging them along with the browser tool Atlas into a unified desktop super-app. This move signals an internal belief that Codex, originally a programming tool, represents the next evolution of AI more than conversational models like ChatGPT. Over the past year, Codex's weekly active users have surged past 5 million. The key distinction is that while ChatGPT answers questions, Codex executes tasks. Enterprises increasingly value this ability to get work done over simply receiving advice. Consequently, Codex is attracting professionals beyond developers, including analysts, bankers, marketers, and product managers. OpenAI's reorganization and increased investment in Codex stem from recognizing that the future of AI competition lies in execution capabilities, not just conversation. The company is launching role-specific plugins (e.g., for data analysis, sales, design) to transform Codex into a broad knowledge work platform that automates and redefines white-collar workflows. Beyond being a tool, Codex reflects OpenAI's ambition to redefine software. New features like "Sites"—which generates interactive websites from documents—and collaborative "Annotations" aim to create a paradigm where the AI understands the goal and handles the tools and steps, functioning more like a digital colleague than traditional software. The ultimate goal is a unified experience where the user cares only about the completed task.

marsbit06/04 11:32

The Merger of Codex and ChatGPT Marks the Beginning of a Major Reshuffle in Programming Tools

marsbit06/04 11:32

GitHub, Transfixed by AI

On the night of February 9th, GitHub suffered a major outage caused by a simple configuration change—reducing a cache refresh interval from 12 to 2 hours—that triggered a cascade of failures. This was not an isolated event, but part of a broader pattern. In early 2026, GitHub experienced at least 8 major incidents, failing to meet its promised 99.9% availability. These outages stemmed from structural issues: explosive growth in load, tight service coupling, and insufficient protection against abnormal traffic. This unprecedented load is driven by AI Agents. In 2025, GitHub handled ~1 billion commits. By 2026, weekly commits reached 275 million, projecting to ~14 billion for the year—a 14x increase. AI tools like Claude Code now contribute 4.5% of all public repository commits, with weekly submissions surging 25x in just three months. AI-generated pull requests jumped from 4 million to 17 million per month in half a year. Unlike human developers, AI Agents work continuously, generating commits at a scale that overwhelms infrastructure designed for human rhythms. The surge also shattered GitHub's business model. Copilot's flat-rate pricing, based on assisting human developers, became unsustainable as Agentic AI sessions consumed resources worth hundreds of dollars for a few dollars in fees. In response, GitHub imposed usage limits and, by June 1st, shifted to a pay-per-use "AI Credits" system. Facing this new reality, GitHub realized a 10x scaling plan was insufficient. It announced a need to *redesign* its architecture for 30x current scale—decoupling services, adding fault isolation, and improving change management to prevent cascading failures. Other platforms like Stripe and AWS are facing similar challenges with AI Agents. Fundamentally, GitHub is transitioning from a human collaboration platform to an "exhaust pipe" for automated AI workflows. Its detailed post-mortem reports aim to maintain trust during this turbulent rebuild. The February outage was not just a technical glitch, but a signal of the software industry's entry into a new, AI-driven era.

marsbit06/04 10:40

GitHub, Transfixed by AI

marsbit06/04 10:40

活动图片