# Speed İlgili Makaleler

HTX Haber Merkezi, kripto endüstrisindeki piyasa trendleri, proje güncellemeleri, teknoloji gelişmeleri ve düzenleyici politikaları kapsayan "Speed" hakkında en son makaleleri ve derinlemesine analizleri sunmaktadır.

Why Did Zhipu Surge Nearly 30% in a Single Day?

"Global AI Model Unicorn" Zhipu's stock surged nearly 30% in a single day, reaching a new market cap high. The catalyst was the launch of its GLM-5.1-highspeed API, boasting a generation speed of **400 tokens per second**, setting a new global benchmark. This speed, roughly 3-5 times faster than industry leaders like OpenAI's GPT-4o and Anthropic's Claude, is achieved **without compromising the full-scale model's capabilities**. In the era of AI Agents requiring dozens of self-calls, such latency reduction is critical, transforming speed from a system metric into a determinant of intelligence limits. The breakthrough stems from a three-layer technical overhaul: 1. **TileRT Inference Engine**: Compiles the entire model into a continuous, always-on computation pipeline using "Warp Specialization," minimizing GPU idle time by having different processor groups handle data loading, computation, and communication in parallel. 2. **Heterogeneous Parallelism for MLA**: To efficiently run the GLM-5.1 model using the MLA attention mechanism, TileRT employs a heterogeneous strategy. One GPU handles sparse indexing/routing, while the others perform dense computation, optimizing for MLA's unique workflow. 3. **ZCube Network Architecture**: Replaces the standard Spine-Leaf (ROFT) network topology with a flat, dual-group interconnect. This design creates a single optimal path between any two GPUs, eliminating network congestion at scale and reducing latency. The business impact is significant: a 15% increase in cluster throughput (free extra capacity), a 40.6% reduction in tail latency (improved stability), and a one-third cut in networking hardware costs. Long-term, this innovation challenges the dominance of NVIDIA's integrated hardware-software stack (GPU+NVLink+InfiniBand), potentially benefiting manufacturers of high-density Leaf switches and optical modules while lowering the software barrier for domestic AI chips like Huawei's Ascend. The innovation proves that more can be achieved with the same compute, reshaping the infrastructure beyond just GPUs.

marsbit2 gün önce 01:23

Why Did Zhipu Surge Nearly 30% in a Single Day?

marsbit2 gün önce 01:23

YC Partner Reveals: Building an AI-Native Company from Scratch

"YC Partner Reveals: Building an AI-Native Company from Scratch" YC partner Diana Hu argues that true AI-native companies operate 1000x faster than incumbents, not by using AI for mere efficiency, but by making it the company's core operating system. This requires a fundamental shift: companies must become "queryable" to AI, with all workflows and communications generating data for AI to learn from, creating a "closed-loop" system for continuous optimization. For example, an AI agent with access to tickets, code, meetings, and customer feedback can analyze past performance and autonomously plan future engineering cycles, dramatically increasing output. In product development, the new paradigm is the "AI software factory": humans write specifications and tests, while AI agents generate the code. This transparent, data-driven model renders traditional middle management obsolete. Future AI-native companies will consist of three roles: Independent Contributors (who build/operate with AI), Directly Responsible Individuals (who own outcomes), and the AI Founder who leads by example. The critical shift is maximizing token usage over headcount. A small, AI-augmented team can outperform large traditional teams. Startups have a key advantage: they can design their entire culture and systems around AI from day one, unburdened by legacy processes. The core takeaway: Founders must personally experience AI's transformative power. The future belongs to those who embed AI into their company's DNA from the start.

marsbit05/15 01:12

YC Partner Reveals: Building an AI-Native Company from Scratch

marsbit05/15 01:12

活动图片