# Пов'язані статті щодо AI Agent

Центр новин HTX надає останні статті та поглиблений аналіз на тему "AI Agent", що охоплює ринкові тренди, оновлення проєктів, технологічні розробки та регуляторну політику в криптоіндустрії.

Blocked Its Own Treasure, WeChat AI Steps Up

Tencent's stock surged over 10% on June 2nd amid reports that WeChat, with 1.43 billion monthly users, is finalizing tests for a native AI Agent. The reported feature, accessible by swiping right from the main interface, allows users to issue commands in natural language. The AI then decomposes tasks and automatically calls upon relevant Mini Programs within WeChat to complete actions like ordering food, booking tickets, or making payments, creating a closed-loop service execution system. This strategic shift follows the internal conflict and subsequent "blocking" of Tencent's standalone AI app, Yuanbao, by WeChat for violating sharing rules during a 2026 Spring Festival promotion. The incident highlighted a lack of internal consensus and exposed the weakness of competing in the standalone AI assistant arena against rivals like ByteDance's Doubao (345M MAU) and Alibaba's Qianwen. The new WeChat AI Agent aims to leverage WeChat's unique assets—its massive user base, standardized Mini Program APIs, WeChat Pay, and identity system—to move from simple content generation to actual task execution. Analysts note this changes the competitive landscape from model benchmarks to which AI can connect to more real-world services. However, success depends on key variables: the capability of Tencent's underlying Hunyuan model, managing massive inference costs, and redesigning incentives for Mini Program developers whose traffic might be bypassed. The move is seen as an attempt to keep user service intent within WeChat's ecosystem as AI begins to redefine how users access services.

marsbit2 год тому

Blocked Its Own Treasure, WeChat AI Steps Up

marsbit2 год тому

AI PCs Are Here, Going Toe-to-Toe with 120B Models Locally! NVIDIA Redefines the "Personal AI Computer" Foundation with RTX Spark

NVIDIA has redefined the "AI PC" standard with the launch of the RTX Spark super chip at GTC 2026. Boasting 1 petaflop (1000 TOPS) of AI performance, it dwarfs the 45-50 TOPS NPUs in current AI PCs. The SoC features a Blackwell GPU, a 20-core Arm CPU co-designed with MediaTek, and crucially, up to 128GB of unified memory shared between CPU and GPU. This architectural shift enables local execution of 120-billion-parameter large language models with million-token context windows, a massive leap from the 9B-40B models typical on current consumer hardware. Beyond AI, use cases include 12K video editing and high-fps ray-traced gaming. Key to enterprise adoption is a security collaboration with Microsoft. Windows security is upgraded, and NVIDIA's OpenShell sandbox runtime is integrated to safely contain AI agent actions. Major software support comes from Adobe, which announced a deep,底层-level rewrite of Photoshop and Premiere to leverage the unified memory for up to 2x performance gains. Six OEMs, including Dell, HP, Lenovo, and Microsoft Surface, will release RTX Spark-based轻薄本 and compact desktops this fall. However, questions remain about real-world performance,功耗, thermal management in laptops, pricing, and the actual impact of the OpenShell sandbox. The RTX Spark represents a fundamental power shift in the PC industry, moving from an x86 CPU-centric model to a GPU-centric SoC platform, but its ultimate success hinges on the upcoming product rollouts and ecosystem validation.

marsbit06/01 06:41

AI PCs Are Here, Going Toe-to-Toe with 120B Models Locally! NVIDIA Redefines the "Personal AI Computer" Foundation with RTX Spark

marsbit06/01 06:41

Jensen Huang: Vera Rubin Full Mass Production, AI Agent a Key Focus, Challenging Intel to Target the Next-Generation AI PC Gateway

NVIDIA CEO Jensen Huang delivered the keynote speech at GTC Taipei 2026, announcing several major product launches and strategic directions. The company's Vera Rubin architecture is now in full-scale production, with OpenAI, Anthropic, and SpaceX among the first customers. NVIDIA highlighted AI Agent as a key future focus, introducing the Vera CPU designed for AI agents and the Vera BlueField-4 STX for secure, chip-level AI storage processing. A significant move involves challenging Intel in the PC market. NVIDIA, in collaboration with MediaTek, is developing the RTX SPARK PC chip (manufactured by TSMC) for Windows systems, set to launch this fall for laptops and desktops. This signals NVIDIA's push into the next-generation AI PC arena, aiming to provide a vertically integrated core computing platform for the entire Windows ecosystem, similar to Apple's approach. Other announcements include the new Nemotron 3 Ultra AI model and the NVIDIA DSX platform, described as a complete "playbook" for building AI factories, allowing performance simulation and validation before physical deployment. In automotive, the DRIVE Hyperion platform was positioned as a global robotaxi platform, with major Chinese automakers like BYD, Geely, Zeekr, Xiaomi, and Pony.ai already adopting or developing autonomous driving solutions based on it. The Alpamayo 2 super open inference model for robotaxis was also introduced. For robotics, NVIDIA unveiled the Isaac GR00T humanoid robot reference platform for academic research and a large open-source agent tools and skills suite for Physical AI. The company plans to collaborate with global humanoid robot manufacturers, including China's Unitree, whose H2 Plus robot served as the reference hardware for the GR00T platform demonstration.

marsbit06/01 06:14

Jensen Huang: Vera Rubin Full Mass Production, AI Agent a Key Focus, Challenging Intel to Target the Next-Generation AI PC Gateway

marsbit06/01 06:14

We Captured Thousands of Job Postings and Discovered ByteDance is Reviving Smartphone R&D

This article analyzes ByteDance's recent hiring activities, revealing a potential restart of smartphone hardware development. By scraping and analyzing thousands of ByteDance job postings, the authors identify three key categories: roles for the "Doubao Phone Assistant" (an AI agent), for a "Mobile OS" (system-level development), and for hardware/engineering positions in Shenzhen (a manufacturing hub). The piece traces the context to the 2025 launch of the "Doubao Phone," a concept device that integrated an AI agent directly into a smartphone, allowing it to see the screen, operate apps, and perform tasks like shopping or booking tickets. While innovative as an early AI Agent prototype, it faced operational restrictions from major platforms like WeChat and Alipay. The new hiring signals a deeper commitment. "Doubao Phone Assistant" roles focus on core Agent capabilities (task execution, memory, cross-app operation). "Mobile OS" positions involve deep system work (kernel, chip adaptation, power/thermal management) necessary for a responsive, always-on AI. Shenzhen-based hardware roles (structure design, testing, production) suggest preparation for physical device manufacturing. The article concludes that in the AI era, where phones may become an Agent's "body," controlling the operating system and hardware is critical. For a company like ByteDance, being merely an app within others' ecosystems is no longer sustainable if it aims to own the next-generation user interface. Therefore, while a consumer phone brand isn't confirmed, ByteDance is decisively moving beyond app development into the complex domain of system-level and hardware-integrated AI.

marsbit05/25 07:31

We Captured Thousands of Job Postings and Discovered ByteDance is Reviving Smartphone R&D

marsbit05/25 07:31

An AI Read SpaceX's Prospectus and Wrote This Investment Memo in 12 Minutes

An AI agent autonomously analyzed SpaceX's 226MB S-1 filing, purchased real-time market data on-chain for $1.87, and generated a comprehensive investment memo in 12 minutes. The memo concludes a "Hold" recommendation. Bull Thesis: SpaceX holds a near-monopoly in commercial launch (80% of global orbital mass since 2023), operates the profitable Starlink business (10.3M subscribers, $7.2B adj. EBITDA), and is vertically integrated from rockets to AI via the xAI acquisition. Starlink alone is a standout, high-margin business. Bear Thesis: The AI division is a massive cash burn ($6.4B operating loss on $3.2B revenue in 2025). True debt obligations approach ~$42B, not the headline $29B, due to bridge loans and X-related debt. Significant contingent liabilities exist, including a potential $10B fee from a Cursor option agreement. The company faces concentrated counterparty risk (e.g., a $45B Anthropic contract), slowing revenue growth, and complex governance as a controlled company with four share classes. Valuation anchors Starlink's standalone value at ~$84B (applying Iridium's 7.4x sales multiple), suggesting the current ~$500B+ IPO target prices in immense future execution risk for Starship and AI. Key risks include Starship delays, accelerating AI losses, and underwriter conflicts (the IPO's lead banks are also lenders on the $20B bridge loan it aims to refinance). Investment triggers: upgrade to "Overweight" if priced ≤$350B and Starship meets milestones; downgrade to "Pass" if priced >$510B or key risks materialize.

marsbit05/25 04:23

An AI Read SpaceX's Prospectus and Wrote This Investment Memo in 12 Minutes

marsbit05/25 04:23

Understanding the New Economic Model of Tokenization

Understanding the New Token Economics Model The commercialization of AI applications is evolving from selling software and subscriptions to selling token call capacity. Tokens, the fundamental unit of information processing for large language models (LLMs), have become the basis for API billing and consumption. With call volumes exploding, tokens themselves are now being traded—procured, routed, split, and resold—forming a new intermediary market. This layer connects upstream LLM providers with downstream developers and enterprises, acting as a global wholesale-to-retail liquidity network. The rise of this business is fueled by a massive surge in China's daily token call volume—growing over a thousandfold from 100 billion in early 2024 to over 140 trillion by March 2026—and significant improvements in domestic LLM capabilities, which are now competitive globally. The core value of token distribution platforms extends beyond simple arbitrage. Key functions include aggregating multiple models (like GPT, Claude, and domestic models such as Kimi and DeepSeek) under a unified API, lowering network and payment barriers, and providing enterprise services like model selection, prompt engineering, and system integration. Profit models are diversifying: (1) resale margins; (2) technical premiums from proprietary inference acceleration (e.g., reducing costs to 1/10 of the industry standard); and (3) enterprise value-added services. High-consumption scenarios like marketing, short-form video, gaming, and e-commerce are primary drivers. Investment opportunities are seen in both companies with strong model capabilities (e.g., Alibaba, Tencent, MiniMax) and those with high-consumption client scenarios (e.g., marketing agencies with overseas reach). However, risks are significant: low entry barriers leading to intense competition, capital requirements and bad debt risks from advance payments, and dependency on policy changes from upstream LLM providers who control API pricing and access.

marsbit05/19 02:54

Understanding the New Economic Model of Tokenization

marsbit05/19 02:54

No Coding Required: Build Your First AI Agent in 2 Days (Complete Tutorial)

A No-Code Guide to Building Your First AI Agent in a Weekend This article presents a weekend, zero-code tutorial for beginners to build a functional AI Agent using tools like Claude. It clarifies the core difference between a chatbot, which responds to queries, and an Agent, which autonomously plans and executes multi-step tasks using tools to deliver a final result. The process is broken into four stages over two days: 1. **Saturday Morning: Understanding Agents.** Learn that an Agent requires a clear Goal, a Plan, necessary Tools, and an execution Loop. Identify a simple, multi-step task from your own work/life as your first project. 2. **Saturday Afternoon: Building with Claude.** Create a one-page "Agent Blueprint" answering: the Goal, sequential Steps, required Tools, the desired Output format, and error-handling rules. Implement this blueprint in Claude (Desktop Cowork or web Projects) and run the Agent for the first time. 3. **Sunday Morning: Debugging & Optimization.** Review the initial (often 60-70% accurate) output. Identify flaws, trace them back to vague instructions in your blueprint, and refine it with more specific criteria and error handling. Iterate this run-review-refine cycle 3-4 times to reach ~90% reliability. 4. **Sunday Afternoon: Expansion.** Apply the learned workflow to quickly build a second, different Agent (e.g., for research, content repurposing, or meeting prep), experiencing the compounding efficiency gains. The core skill is not writing a perfect blueprint initially, but rapidly iterating based on output. By the end of the weekend, you'll have built two usable Agents, moving beyond just chatting with AI to automating multi-step workflows, fundamentally changing how you approach repetitive tasks.

marsbit05/16 15:19

No Coding Required: Build Your First AI Agent in 2 Days (Complete Tutorial)

marsbit05/16 15:19

Auto Research Era: 47 Tasks Without Standard Answers Become the Must-Test Leaderboard for Agent Capabilities

The article introduces Frontier-Eng Bench, a new benchmark for AI agents developed by Einsia AI's Navers lab. Unlike traditional tests with clear answers, this benchmark presents 47 complex, real-world engineering tasks—such as optimizing underwater robot stability, battery fast-charging protocols, or quantum circuit noise control—where there is no single correct solution, only continuous optimization towards a limit. It shifts AI evaluation from static knowledge retrieval to a dynamic "engineering closed-loop": the AI must propose solutions, run simulations, interpret errors, adjust parameters, and re-run experiments to iteratively improve performance. This process tests an agent's ability to learn and evolve through long-term feedback, much like a human engineer tackling trade-offs between power, safety, and performance. Key findings from the benchmark reveal two patterns: 1) Improvements follow a power-law decay, becoming harder and smaller as optimization progresses, and 2) While exploring multiple solution paths (breadth) helps, sustained depth in a single path is crucial for breakthrough innovations. The research suggests this marks a step toward "Auto Research," where AI systems can autonomously conduct continuous, tireless optimization in scientific and engineering domains. Humans would set high-level goals, while AI agents handle the iterative experimentation and refinement. This could fundamentally change research and development workflows.

marsbit05/13 07:06

Auto Research Era: 47 Tasks Without Standard Answers Become the Must-Test Leaderboard for Agent Capabilities

marsbit05/13 07:06

活动图片