# Сопутствующие статьи по теме LLM

Новостной центр HTX предлагает последние статьи и углубленный анализ по "LLM", охватывающие рыночные тренды, новости проектов, развитие технологий и политику регулирования в криптоиндустрии.

From a Lunch Table to an Infinite Universe: Fei-Fei Li Bets on AI's Next Dimension

From a Lunch Table Conversation to an Infinite Universe: Fei-Fei Li Bets on AI's Next Frontier - Spatial Intelligence In an era dominated by large language models, AI pioneer Fei-Fei Li argues that true understanding requires spatial intelligence — the ability to perceive, reason, and interact within the physical 3D/4D world. She points to evolutionary history: spatial perception drove the Cambrian explosion 540 million years ago, while language is a far more recent, inherently "lossy" way to encode reality. Current models struggle with basic spatial tasks a child can do, like counting chairs in a video. Her company, World Labs, is pioneering this shift with "Marble," a model that generates navigable, consistent 3D worlds from text, images, or simple 3D inputs—distinct from video generators like Sora. Though smaller than models like GPT-5, due to scarce 3D data and early-stage scaling laws, Marble is already used in gaming, robot training (by NVIDIA), architectural design, and personalized therapy for conditions like OCD and acrophobia. Li envisions this technology enabling "infinite universes" for creativity, social interaction, and more. However, she cautions against utopian or dystopian extremes, advocating for a measured vision where AI enhances human dignity and prosperity, akin to how electricity transformed civilization. The journey is long — as evidenced by the 20-year path to viable autonomous vehicles — but the direction is clear: for AI to move from merely talking about the world to truly understanding and acting within it.

marsbit05/27 00:14

From a Lunch Table to an Infinite Universe: Fei-Fei Li Bets on AI's Next Dimension

marsbit05/27 00:14

From Power Infrastructure to Token Economy: The 'Seven-Layer Cake' of the AI Industry Chain

From Power Grid to Token Economy: The AI Industry's "Seven-Layer Cake" The AI industry is shifting from a "model-centric" paradigm focused on massive training to a "token-centric" industrial era driven by inference demand. This new phase revolves around the production, distribution, scheduling, and consumption of tokens—the units of computation used by AI agents for every interaction and task. The article proposes a "seven-layer cake" framework for the AI economy: 1. **Power**: The foundational energy source, with competition shifting to securing stable, low-cost electricity. 2. **AIDC (AI Data Centers)**: Large-scale "Token factories." A trend toward smaller, modular, and regionally deployed AI Factories is emerging for efficiency and proximity to users. 3. **GPU**: The core production hardware for tokens. While NVIDIA dominates, competition exists from AMD, ASIC makers, and Chinese chipmakers, with a growing focus on inference efficiency. 4. **LLMs**: The "engines" that generate tokens. The competition is evolving beyond model size to prioritize factors like token cost, inference efficiency, and operational synergy with infrastructure. 5. **Token Distribution**: The "grid" that allocates and rents out compute resources, led by cloud giants and specialized AI-native platforms. 6. **Token Optimization & Intelligent Scheduling**: The critical "brain" layer that intelligently routes tasks (e.g., to local, cloud, or edge models) for optimal cost, latency, and privacy—maximizing the value of each token. 7. **AI Agents & Models**: The end consumers of tokens. The vision involves billions of AI agents working and interacting concurrently, consuming vast amounts of tokens. Currently, the industry faces fragmentation and inefficiencies between these layers. The true "mass adoption era" of AI will begin only when this seven-layer infrastructure is fully integrated and operates as a cohesive, intelligent network—transforming AI from a software tool into a global industrial system spanning energy, hardware, and compute logistics.

marsbit05/26 05:43

From Power Infrastructure to Token Economy: The 'Seven-Layer Cake' of the AI Industry Chain

marsbit05/26 05:43

Investors Frantically Snap Up AI Firms with 'No Profits': A High-Stakes Gamble on 'the Right to Define the Future'

"Investors are pouring billions into Chinese AI startups with no profits, betting on the future of the industry. A state-backed fund is reportedly in talks to lead DeepSeek's funding at a $45B valuation, just weeks after it was valued at $10B. Along with companies like Zhipu AI, MiniMax, and Kimi (backed by Meituan and Alibaba), their combined valuation exceeds $140B. This isn't a typical venture capital play. Investors are paying for 'future definition rights'—a chance to set the standards for the next tech era. Morgan Stanley notes a 6-12 month window for this scarcity premium before more AI companies go public. Despite massive losses, these companies show strong growth. Zhipu AI's API revenue grew 60x, Kimi's annual recurring revenue doubled to $200M in a month, and MiniMax turned its gross margin positive, with over 70% of revenue from overseas. Their valuations vastly exceed profitable firms like iFlytek. Crucially, technical progress underpins this growth. DeepSeek's latest model boasts costs just 1% of a leading competitor's, while Zhipu AI has raised API prices due to high demand. However, gaps with top global models remain. Tech giants like Tencent and Alibaba, investing heavily while describing their own AI efforts as 'leaky boats,' are also investing in these startups as a hedge. Key risks loom: the closing scarcity window, computing power bottlenecks limiting growth, and the sustainability of DeepSeek's cost-advantage model. With state capital now a major player, the success of these companies has become a strategic national concern. The next year will test if their soaring valuations can be justified by future profits."

marsbit05/26 02:06

Investors Frantically Snap Up AI Firms with 'No Profits': A High-Stakes Gamble on 'the Right to Define the Future'

marsbit05/26 02:06

TechFlow Intelligence Report: Huawei Unveils "Tao" Law, Semiconductor Sector Surges; Meta Cuts 10% of Workforce

"TechFlow Intelligence Brief": Huawei's new "Tau Law" in semiconductors and Meta's 10% layoffs headline today's tech landscape. In AI, breakthroughs include an AI solving 9 high-difficulty pure math problems for just a few hundred dollars each, and DeepSeek's new Reasonix programming agent challenging commercial models. However, research highlights a "constraint decay" issue in LLM-generated backend code. Open-source model Qwen 3.6 27B achieves high speeds on older GPUs, sparking debate on NVIDIA's future dominance. In Crypto/Web3, Ethereum Foundation plans to downsize, possibly reducing ETH selling pressure. Fake news about CZ ignited a meme coin frenzy, showing the market's sensitivity to celebrity narratives. DeFi sees a new trend in HELOC-backed Real World Asset (RWA) pools. The chip sector is stirred by Huawei's proposed "Tau (τ) Law," aiming for 1.4nm-equivalent performance by 2031 through architectural innovation, causing related stocks to surge. A report notes memory now constitutes nearly two-thirds of AI chip cost. Meanwhile, executives at 7 Chinese semiconductor firms sold shares after price peaks. Meta announces 10% layoffs as it pivots to AI. Google's CEO faced student protests over AI ethics during a speech, and the company controversially published a Chromium exploit before patching was complete. Xiaomi permanently banned installers for AC installation fraud. In US stocks, AMD is seen as a potential challenger to NVIDIA, while a survey reveals 99% of CEOs expect AI-driven layoffs within two years. Palantir secured a government contract for employee monitoring, raising privacy concerns. Macro developments include a 6% drop in WTI crude oil on hopes for reopened Hormuz Strait, and silver prices rising over 4%. Global oil inventories are nearing critical lows. New trends highlight a "audio prompt injection" attack targeting AI voice assistants via hidden commands, and CBS pausing takedowns of pirated Stephen Colbert episodes after public pushback. The underlying narrative connects AI's cost-effective problem-solving, widespread planned job displacement, and Huawei's challenge to Western tech hegemony, framing the AI and chip race as a broader contest over employment, geopolitics, and the very definition of intelligence.

marsbit05/25 10:50

TechFlow Intelligence Report: Huawei Unveils "Tao" Law, Semiconductor Sector Surges; Meta Cuts 10% of Workforce

marsbit05/25 10:50

Agentic Design Patterns: A Book That Made Me Re-Understand "What Is an Agent, Really?"

"Agentic Design Patterns" is a 2025 book by Antonio Gullí, a Google engineering director, which offers a systematic framework for AI Agent development through 21 design patterns. A core contribution is the "Four Levels of Agency": Level 0 (bare LLMs) are not true agents. Level 1 agents actively decide when and how to use tools. Level 2 agents engage in strategic planning, context engineering (curating and filtering information), and self-reflection. Level 3 involves multi-agent collaboration with defined communication topologies. The book introduces **Context Engineering** as a superset of prompt engineering, managing four layers of information for the agent: system prompts, external data, implicit context (user history, environment), and feedback loops for automated optimization. A key pattern is **Reflection (Producer-Critic)**, where two distinct agents with different prompts collaborate iteratively—one produces output, the other critiques it—until quality is satisfactory or a max iteration limit is reached. For **Memory**, a three-layer model is proposed: Session (ephemeral conversation context), State (temporary task data), and Memory (persistent, long-term storage). Regarding **Multi-Agent Systems**, the book advises against unnecessary complexity, recommending simple topologies like Supervisor or Peer-to-Peer based on task needs. It emphasizes perfecting a single Level 2 agent before moving to multi-agent setups. The author concludes with three actionable takeaways: 1) Add a Critic agent to existing workflows, 2) Practice Context Engineering beyond simple prompts, and 3) Avoid premature multi-agent complexity; first master a robust single agent. The book provides a practical map, codifying common challenges like reflection, memory, and coordination into reusable patterns, saving developers from reinventing foundational solutions.

链捕手05/25 04:43

Agentic Design Patterns: A Book That Made Me Re-Understand "What Is an Agent, Really?"

链捕手05/25 04:43

DeepSeek Announces Permanent Price Cut, But Liang Wenfeng Is Not Trying to Be a "Cyber Bodhisattva"

DeepSeek has announced a permanent 75% discount on its V4-Pro API, significantly reducing its token prices. This move stands out as a major industry-wide price cut while competitors like Anthropic, OpenAI, and Google have been quietly raising theirs. The article contrasts this strategy with the broader trend of AI becoming more expensive, citing examples of companies like Microsoft and Uber struggling with high token costs as usage soars. While CEO Liang Wenfeng is hailed by some as a "Cyber Bodhisattva" for this普惠 approach, the article argues this is a strategic business choice, not mere altruism. DeepSeek's ability to maintain low prices is attributed to several structural advantages: lower-cost AI talent in China, the impending use of domestic昇腾 hardware for further cost reductions, and, most critically, access to China's cheaper and more abundant energy infrastructure, which drastically reduces the electricity costs dominating AI operations. The analysis suggests that for many commercial applications, a "good enough" model that is radically cheaper (e.g., 1% to 11% of GPT-5.5's cost) is more valuable than the absolute top-tier model. This allows for vastly more experimentation and iteration within a budget. Therefore, as AI generally becomes more expensive, DeepSeek's cost-competitiveness—rooted in China's energy and talent advantages—becomes its core strategic value and differentiator in the global market.

marsbit05/24 12:19

DeepSeek Announces Permanent Price Cut, But Liang Wenfeng Is Not Trying to Be a "Cyber Bodhisattva"

marsbit05/24 12:19

Why Did Zhipu Surge Nearly 30% in a Single Day?

"Global AI Model Unicorn" Zhipu's stock surged nearly 30% in a single day, reaching a new market cap high. The catalyst was the launch of its GLM-5.1-highspeed API, boasting a generation speed of **400 tokens per second**, setting a new global benchmark. This speed, roughly 3-5 times faster than industry leaders like OpenAI's GPT-4o and Anthropic's Claude, is achieved **without compromising the full-scale model's capabilities**. In the era of AI Agents requiring dozens of self-calls, such latency reduction is critical, transforming speed from a system metric into a determinant of intelligence limits. The breakthrough stems from a three-layer technical overhaul: 1. **TileRT Inference Engine**: Compiles the entire model into a continuous, always-on computation pipeline using "Warp Specialization," minimizing GPU idle time by having different processor groups handle data loading, computation, and communication in parallel. 2. **Heterogeneous Parallelism for MLA**: To efficiently run the GLM-5.1 model using the MLA attention mechanism, TileRT employs a heterogeneous strategy. One GPU handles sparse indexing/routing, while the others perform dense computation, optimizing for MLA's unique workflow. 3. **ZCube Network Architecture**: Replaces the standard Spine-Leaf (ROFT) network topology with a flat, dual-group interconnect. This design creates a single optimal path between any two GPUs, eliminating network congestion at scale and reducing latency. The business impact is significant: a 15% increase in cluster throughput (free extra capacity), a 40.6% reduction in tail latency (improved stability), and a one-third cut in networking hardware costs. Long-term, this innovation challenges the dominance of NVIDIA's integrated hardware-software stack (GPU+NVLink+InfiniBand), potentially benefiting manufacturers of high-density Leaf switches and optical modules while lowering the software barrier for domestic AI chips like Huawei's Ascend. The innovation proves that more can be achieved with the same compute, reshaping the infrastructure beyond just GPUs.

marsbit05/23 01:23

Why Did Zhipu Surge Nearly 30% in a Single Day?

marsbit05/23 01:23

Detained for 37 Days: The First Wave of People Who Got Rich from 'AI Gateways' Are Starting to Go to Jail

A prominent AI proxy service operator was reportedly detained for 37 days and is now on bail pending trial, highlighting the legal risks in China's booming but unregulated AI intermediary market. These services act as "AI scalpers," providing domestic users with access to restricted overseas models (like OpenAI, Claude) by bundling APIs, handling payments, and bypassing network blocks, all for a fee. Their controversial profitability stems from practices like bulk-registering accounts to resell free credits, exploiting refund policies, overcharging for tokens, substituting cheaper models, and illegally selling user conversation data. Major figures, including cryptocurrency entrepreneurs, are now entering this space. Legally, these operations face severe risks. Their core model often involves unauthorized API access and operating without required telecom licenses, potentially constituting illegal business operations. They fail to meet data security obligations for the vast amounts of user data they process, risking charges for failing to fulfill network security duties. Crucially, the unauthorized collection and sale of user data, which can include personal and commercial secrets, easily meets the threshold for the crime of infringing on personal information. The case underscores a critical juncture for the AI industry. While proxies lower access barriers, they expose user data to unsecured middlemen and undermine the business models of AI developers, forcing them to divert resources to security and distorting market value perceptions. The article argues that the industry's sustainable future depends on building trust, protecting data, and fostering compliant competition, moving away from its current "wild growth" phase.

marsbit05/21 14:40

Detained for 37 Days: The First Wave of People Who Got Rich from 'AI Gateways' Are Starting to Go to Jail

marsbit05/21 14:40

活动图片