# GPU Related Articles

HTX News Center provides the latest articles and in-depth analysis on "GPU", covering market trends, project updates, tech developments, and regulatory policies in the crypto industry.

Bitcoin Mining Farms Are Becoming AI Factories

Bitcoin mines are transforming into AI factories. This shift is driven by the convergence of three key assets from the previous crypto cycle: infrastructure, talent, and capital. Crypto mining companies like Crusoe, CoreWeave, and Bitdeer are repurposing their core competency—securing power, land, and grid connections in remote locations—to build data centers for AI clients. These firms are signing multi-billion dollar, long-term contracts with companies like Anthropic, AWS, and Microsoft, as AI's demand for reliable, high-capacity compute surpasses the profitability of Bitcoin mining. Simultaneously, crypto entrepreneurs and engineers are applying their skills to new AI ventures. Examples include OpenSea's co-founder launching OpenRouter (an AI model aggregator), and former Coinbase engineers building Fal.ai (a generative media infrastructure platform). Their experience in building scalable, global software networks translates effectively to the AI space. Furthermore, capital accumulated during the crypto boom is now fueling AI. Figures like Jed McCaleb (co-founder of Ripple) funded Voltage Park, a large-scale GPU cloud provider. Notably, some crypto investments, like FTX's early bets on Anthropic and Cursor, have generated astronomical paper returns, demonstrating how high-risk crypto capital flowed into AI before it became mainstream. The transition is not just about repurposing hardware, but about redirecting critical resources—power infrastructure, distributed systems expertise, and venture funding—to the next technological frontier: artificial intelligence.

链捕手8h ago

Bitcoin Mining Farms Are Becoming AI Factories

链捕手8h ago

Valuation $1 Billion, Nvidia Doubles Down! Is Prime Intellect Washing Off Its Web3 Label?

Prime Intellect, a decentralized AI infrastructure company founded in 2024, recently announced a $130 million Series A funding round at a $1 billion valuation, with investments from NVIDIA, Intel, and Dell's venture arms. The company claims its annualized recurring revenue (ARR) has exceeded $100 million within a year, serving over 6,000 enterprise clients. Initially rooted in Web3 and decentralized science (DeSci), Prime Intellect has evolved into a full-stack AI training and deployment platform. Its core technology enables distributed training of large language models across globally dispersed, heterogeneous GPU clusters. Key milestones include releasing open-source models like INTELLECT-1 and INTELLECT-3, and launching Prime Intellect Lab, a platform allowing users to train and optimize agentic models without managing their own GPU infrastructure. The company's deep collaboration with hardware giants, particularly NVIDIA, extends beyond investment to joint optimization of software (e.g., integrating NVIDIA Dynamo) and hardware systems. A notable commercial case involves fintech company Ramp using Prime Lab to train a specialized agent, demonstrating the platform's applied value. While achieving rapid commercial growth, Prime Intellect has systematically downplayed its earlier Web3 and token-based incentives from its official documentation, repositioning itself as a mainstream AI infrastructure provider focused on enterprise adoption and potential IPO.

Foresight News07/13 02:33

Valuation $1 Billion, Nvidia Doubles Down! Is Prime Intellect Washing Off Its Web3 Label?

Foresight News07/13 02:33

Fable 5 Crafts First CUDA 'Megakernel' from Scratch, Achieves 18.7x Speedup in 2.5 Hours

AI model Fable 5 (a safety-limited version of Anthropic's Claude Mythos) has achieved a breakthrough in GPU kernel optimization. In the rigorous KernelBench-Mega benchmark—which requires fusing an entire model's compute block into a single kernel—Fable 5 autonomously wrote a highly optimized CUDA "megakernel." This kernel executes a complete Kimi-Linear W4A16 hybrid decoding task within a single GPU kernel launch, using 14 grid barriers to sequence operations. The result was a performance increase of 18.7x over the baseline on an RTX PRO 6000 GPU, significantly outpacing competitors like Claude Opus 4.8 (14.4x) and GPT-5.5 (4.34x). Notably, its performance advantage widened with longer context lengths. The model spent the majority of its 2.5-hour, 550k-token session analyzing benchmarks and theoretical limits before coding, leading to an exceptionally efficient final design. Anthropic co-founder Jack Clark described this as the beginning of a "recursive self-improvement (RSI) loop," where AI's ability to optimize its own underlying computational infrastructure could rapidly accelerate its own development cycle. This advance highlights AI's growing capability in complex, low-level engineering tasks that were previously a human stronghold.

marsbit07/07 07:36

Fable 5 Crafts First CUDA 'Megakernel' from Scratch, Achieves 18.7x Speedup in 2.5 Hours

marsbit07/07 07:36

Meta Joins the 'Selling Shovels' Game, Zuckerberg: 'Models Can Be Slow, but GPUs Must Earn'

Meta pivots to selling AI infrastructure as internal model development lags. Facing setbacks like Gemini usage restrictions and slower-than-expected progress on its own AI agents, the company is reportedly launching "Meta Compute." This service would rent out its massive GPU capacity and data center resources—projected to reach over 10GW—to external clients. The move mirrors SpaceX's "neocloud" model, offering high-margin, flexible compute contracts. Meta also plans to host third-party models like Anthropic's Claude, creating a platform similar to Amazon Bedrock. This strategy provides immediate revenue while its in-house models (like the upcoming "Watermelon") continue development. Wall Street reacted positively, seeing it as a way to monetize heavy AI investments even if Meta's models aren't yet industry-leading.

marsbit07/06 01:19

Meta Joins the 'Selling Shovels' Game, Zuckerberg: 'Models Can Be Slow, but GPUs Must Earn'

marsbit07/06 01:19

One Megawatt Sustains 60,000 Agents, NVIDIA GB300 Crushes Previous Generation by 20x

NVIDIA's latest GB300 NVL72 system achieves a 20x improvement in AI agent throughput per megawatt compared to its predecessor, the H200, according to a new industry benchmark called AA-AgentPerf. Where the H200 could handle roughly 2,600 concurrent agents per megawatt, the GB300 NVL72 can support approximately 61,400. The significance lies less in raw chip performance and more in the new benchmark itself. AA-AgentPerf, created by the independent firm Artificial Analysis, is the first benchmark designed specifically for "AI agent" workloads. Traditional benchmarks measure single, fixed-length requests, but AI agents operate in long, complex chains involving dozens of model calls, tool use, and ever-growing context. These create unique system pressures that older tests cannot capture. AA-AgentPerf replays real programming agent trajectories with lengthy sessions and varying input lengths. Its key metric is "agents per megawatt," measured under strict Service Level Objectives (SLOs) that guarantee a minimum token output speed per agent. It also allows real-world optimizations like KV cache reuse and speculative decoding, which older benchmarks often disable. The results highlight two key trends: rack-scale systems like the 72-GPU GB300 NVL72 are inherently more efficient than single nodes, and the architectural leap from Hopper to Blackwell (H200 to GB300) represents a systemic, not just incremental, performance gain. The GB300's advantage stems from its high-bandwidth NVLink fabric connecting all GPUs, allowing large MiE models to be efficiently distributed and parallelized. Important caveats include that the 61,400 figure represents simulated concurrent sessions, not independently running full models, and that benchmark results are a snapshot that will improve with software optimization. AA-AgentPerf is a new standard whose industry adoption remains to be seen.

marsbit07/06 01:03

One Megawatt Sustains 60,000 Agents, NVIDIA GB300 Crushes Previous Generation by 20x

marsbit07/06 01:03

Zuckerberg Gave the AI Bull Market a Fright

Mark Zuckerberg and Meta inadvertently sent shockwaves through the AI stock market. News that Meta plans to sell its "excess" AI computing power to external clients triggered a trillion-dollar sell-off in AI infrastructure stocks like Nvidia and AMD, while Meta's stock rose. This seemingly simple business move—renting out idle resources—shook a core assumption underpinning the two-year AI bull market: the belief that computing power ("compute") would be perpetually scarce. This scarcity narrative had fueled valuations across the entire supply chain, from GPUs to power suppliers. Meta's motivations are layered: improving hardware utilization during non-peak R&D periods, executing a strategic pivot, and redefining AI infrastructure. Unlike rivals selling APIs, Meta's open-source approach with Llama appears aimed at building an ecosystem where it ultimately profits from the underlying compute, similar to how AWS transformed from Amazon's internal capacity. Meta is essentially offering an integrated "AI factory" service, not just raw GPU rental. The market's fear wasn't Meta selling a few chips, but the signal that GPU supply might become more shareable and efficient, transitioning the industry from a Capex-driven "hoarding" model to an Opex-driven "utilization" model. This could fundamentally reset valuation logic from scarcity to efficiency. While the sell-off reversed somewhat as investors realized this shift is long-term, the direction is set. The move marks a potential inflection point: the era of easy valuation gains from simply buying GPUs may be ending, giving way to an era where operational efficiency and return on AI assets take center stage.

marsbit07/03 03:14

Zuckerberg Gave the AI Bull Market a Fright

marsbit07/03 03:14

Upbit Lists Gensyn As AI Crypto Narrative Gets New Korean Market Boost

Upbit, a major South Korean crypto exchange, has announced trading support for Gensyn (GEN), a decentralized AI GPU compute network. The listing provides GEN/KRW, GEN/BTC, and GEN/USDT pairs, significantly increasing the project's exposure to South Korea's active retail market. This aligns with the persistent crypto-AI narrative, as Gensyn's model focuses on using blockchain to coordinate an open market for the vast computing power required by AI models. While such a high-profile listing can drive immediate attention and liquidity—often leading to sharp short-term price volatility—it does not guarantee sustained demand. The market will ultimately judge Gensyn based on real network activity, user adoption, and the longevity of the broader AI-in-crypto trend. The listing highlights that distribution and access via major exchanges remain crucial for altcoins, but long-term value depends on translating visibility into actual usage and utility.

bitcoinist06/30 21:51

Upbit Lists Gensyn As AI Crypto Narrative Gets New Korean Market Boost

bitcoinist06/30 21:51

AI Sweeps the Globe, So Why Is Crypto + AI Facing Gloom?

The article "AI Sweeps the Globe, But Why Is Crypto + AI So Bleak?" analyzes the disconnect between the booming AI industry and the struggling crypto+AI sector. It argues the issue is not flawed logic but severe demand-supply mismatch across four key sub-sectors. Decentralized compute and storage projects offer theoretical benefits like cost savings and data sovereignty but lack a decisive technical edge over entrenched cloud providers (AWS, GCP). Enterprises are unwilling to risk migration for unproven infrastructure that can't guarantee the performance and reliability needed for critical AI workloads. ZKML and privacy solutions address important issues like model verification but solve non-urgent, long-term concerns for most businesses currently focused on core performance and ROI. Demand here is likely to be regulation-driven (e.g., EU AI Act) rather than organic. AI agent infrastructure is developing foundational tech for a future multi-agent economy. However, the current market phase is dominated by internal process automation within single companies, making this technology premature. AI agent payments is highlighted as the only sub-sector where blockchain competes on a level playing field with traditional finance, as neither has adequately solved the challenges of machine-to-machine micropayments and real-time settlement. Overall, crypto+AI projects are building for future needs (data ownership, decentralization, transparency) that don't align with the industry's immediate priorities (performance, cost, stability). The absence of a flagship, large-scale use case further hinders mainstream adoption and capital inflow. The path forward requires either adapting to current market demands or patiently building the foundational infrastructure for the next phase of AI.

marsbit06/29 06:45

AI Sweeps the Globe, So Why Is Crypto + AI Facing Gloom?

marsbit06/29 06:45

NVIDIA's New Open-Source MoE: One Line of Import, Fine-Tuning Accelerated by 3.7x

NVIDIA has open-sourced NeMo AutoModel, a tool designed to significantly accelerate the fine-tuning of Mixture-of-Experts (MoE) large language models. By adding just one import line to existing code based on Hugging Face Transformers v5, users can achieve a 3.4x to 3.7x increase in training throughput and reduce GPU memory usage by 29% to 32% without altering their API. The key innovations include Expert Parallelism (EP) to distribute expert weights across GPUs, lowering memory pressure; DeepEP to fuse computation and communication; and TransformerEngine kernels for accelerated core operations. Benchmarks on models like Qwen3-30B-A3B show training throughput per GPU jumping from 3075 to 11340 tokens per second. The solution also enables the fine-tuning of very large models, such as the 550B parameter Nemotron 3 Ultra, which would exceed memory limits with the standard Transformers v5. Code and benchmarks are available on GitHub.

marsbit06/26 07:29

NVIDIA's New Open-Source MoE: One Line of Import, Fine-Tuning Accelerated by 3.7x

marsbit06/26 07:29

Solstice and Tensorx to Buy $1 Billion in AI Infrastructure to Support EU Sovereign AI Demand

Solstice and TensorX have announced a partnership to finance up to $1 billion in European sovereign AI infrastructure, including AI hardware and data-center build-out to meet EU demand for localized compute. TensorX operates a fleet of NVIDIA GPUs in EU data centers with strict data residency. Solstice, an onchain settlement protocol, will provide the financing and launch a new yield-bearing asset called aiUSX. This asset allows companies to deploy capital earmarked for future AI inference costs into infrastructure lending, generating yield while keeping funds liquid. Initially capped at $5 million, aiUSX aims to help companies offset rising AI expenses. Both companies are part of the Deus X Capital ecosystem.

TheNewsCrypto06/25 16:41

Solstice and Tensorx to Buy $1 Billion in AI Infrastructure to Support EU Sovereign AI Demand

TheNewsCrypto06/25 16:41

# GPU Related Articles

Bitcoin Mining Farms Are Becoming AI Factories

Valuation $1 Billion, Nvidia Doubles Down! Is Prime Intellect Washing Off Its Web3 Label?

Fable 5 Crafts First CUDA 'Megakernel' from Scratch, Achieves 18.7x Speedup in 2.5 Hours

Meta Joins the 'Selling Shovels' Game, Zuckerberg: 'Models Can Be Slow, but GPUs Must Earn'

One Megawatt Sustains 60,000 Agents, NVIDIA GB300 Crushes Previous Generation by 20x

Zuckerberg Gave the AI Bull Market a Fright

Upbit Lists Gensyn As AI Crypto Narrative Gets New Korean Market Boost

AI Sweeps the Globe, So Why Is Crypto + AI Facing Gloom?

NVIDIA's New Open-Source MoE: One Line of Import, Fine-Tuning Accelerated by 3.7x

Solstice and Tensorx to Buy $1 Billion in AI Infrastructure to Support EU Sovereign AI Demand

Regulatory Policy

Industry News