Smart Money Hoards $40 Billion in Cash, Retail Bets $2.6 Trillion on Calls: The Critical Moment of the US Stock Market's AI Narrative

marsbitPublicado a 2026-05-09Actualizado a 2026-05-09

Resumen

Title: Smart Money Hoards $40 Billion in Cash, Retail Traders Bet $2.6 Trillion on Call Options: The Tipping Point for the AI Narrative in U.S. Stocks The U.S. stock market is experiencing a striking divergence. While the S&P 500 hits new highs, the financial sector is down 6% year-to-date, underperforming more than during the 2008 and COVID crises. In contrast, a record $2.6 trillion in S&P 500 call options traded in a single day, and the Philadelphia Semiconductor Index RSI is at its highest since 1999. This reflects a clear split: "smart money" is retreating while retail traders chase gains. Key data points highlight this critical juncture in the AI-driven rally: 1. SoftBank had to cut its $10 billion loan target against its OpenAI stake to $6 billion, as lenders questioned the valuation of the private AI giant, signaling primary market skepticism. 2. The explosive $2.6 trillion daily options volume, with 60% being calls, is described by a Goldman Sachs partner as a "semi-irrational chase," drawing parallels to the 1999 tech bubble. 3. The financial sector's severe underperformance relative to the S&P 500 is a classic technical warning signal, indicating potential underlying economic stress. 4. Apollo Global Management, despite strong earnings, is building a $40 billion cash buffer in its insurance business, preparing for what its CEO calls a 30-35% probability of an exogenous shock from geopolitics, inflation, and AI's economic disruption. 5. Consumer behavior mirr...

Author: Deep Tide TechFlow

The US stock market is staging an exceptionally unusual divergence: while the S&P 500 repeatedly hits new highs, the financial sector has fallen 6% year-to-date, underperforming the broader market more severely than during the 2008 financial crisis and the Covid shock; meanwhile, the single-day notional trading volume of S&P 500 call options has exceeded $2.6 trillion, a record high, and the RSI for the Philadelphia Semiconductor Index has risen to its highest level since 1999. In the primary market, SoftBank's financing plan, using OpenAI equity as collateral, was forced to cut its target size from $100 billion to $60 billion; on the institutional side, Apollo has hoarded roughly $40 billion in cash within its insurance business. Smart money is pulling back, retail is piling in.

The narrative around AI computing power scarcity continues to drive the accelerated rise of the US tech sector, but the capital flow, sentiment, technicals, and institutional behavior are sending conflicting signals. This divergence itself is more worthy of scrutiny than any single data point.

Lenders Skeptical of OpenAI Valuation, SoftBank Cuts Financing Target by 40%

According to Bloomberg citing sources familiar with the matter, SoftBank has reduced the target size of its margin loan, collateralized by OpenAI equity, from $100 billion to a minimum of $60 billion, a 40% cut. The core resistance lies in valuation, with some investors being courted to participate harboring doubts about how to reasonably value this non-public company, OpenAI. Potential lenders involved in the discussions include private credit institutions, financial institutions, and hedge funds, with related negotiations starting as early as mid-March.

OpenAI's own fundamentals are also under pressure. The company failed to meet its monthly sales targets multiple times in early 2026, with rival Anthropic steadily eroding its share in programming and enterprise markets, and the internal goal of reaching 1 billion weekly active users for ChatGPT by the end of last year was also missed. OpenAI CFO Sarah Friar refuted this, stating the company is meeting its goals and seeing product demand "rise vertically."

SoftBank's own financial leverage is also at historically high levels. The group recently committed an additional $30 billion to OpenAI, having already invested over $30 billion cumulatively; the $40 billion loan completed this March set a record for its US dollar loan size, with part of the funds used to support the latest follow-on investment in OpenAI.

Capital market judgment on SoftBank has shown significant divergence. SoftBank's stock has risen 39% year-to-date, significantly outperforming the 12.3% gain of Japan's benchmark Topix index; however, its credit default swap spreads have widened by about 61 basis points this year. In March, S&P Global Ratings downgraded SoftBank's credit outlook from "stable" to "negative," citing that the investment in OpenAI could harm the company's liquidity and asset credit quality.

The pricing divergence for top AI assets in the primary market is manifesting in the most direct way: lenders are willing to lend 40% less than SoftBank wants to borrow.

Options Market Hits $2.6 Trillion in a Single Day, Goldman Sachs Partner Calls It "Semi-Irrational"

The secondary market presents another picture. On Thursday, the notional trading volume of S&P 500 index (SPX) call options broke through $2.6 trillion, setting a historical record, with nearly 60% of all SPX options that day being calls. Rich Privorotsky, head of Goldman Sachs's One-Delta trading desk, characterized the current US stock market as being in a "spot-up, vol-up chase mode."

The weekly RSI for the Philadelphia Semiconductor Index (SOX) has risen to its highest level since 1999. A Goldman Sachs partner stated bluntly: "It feels like we are in a semi-irrational chase mode." Privorotsky cited 1999 as a more fitting historical analogy, when numerous telecom equipment suppliers had overflowing order books, providing the "physical bottleneck narrative" supporting that rally, highly similar to the current logic around computing power scarcity and AI infrastructure deployment.

QQQ implied volatility has spiked sharply with the rally, with its spread over SPX volatility widening to over 6 volatility points. Goldman's volatility trading desk described this day as "one of the craziest trading days in the past few weeks." Notably, the number of S&P 500 constituent stocks experiencing single-day moves exceeding 3 standard deviations reached 35, the highest level since February 3rd this year.

Bank of America's global equity derivatives research team also pointed out that the S&P 500's latest record-breaking rally is reminiscent of the late 1920s and the 1990s internet bubble, but market pricing for "tail options" remains below the level implied by realized volatility. In layman's terms, the market is chasing the rally but unwilling to pay for downside risk.

Goldman Sachs warned that the dynamics of "spot-up, vol-up" have limited the room for systematic strategies to add further positions; Commodity Trading Advisors (CTAs) are already essentially back to full long positions, and as realized volatility in the upward direction rises, the marginal incremental demand from volatility control strategies is also waning. In other words, programmatic buying from the institutional side is nearing its limit, with subsequent upward momentum relying more on retail and sentiment-driven funds.

XLF Hits Weakest Relative to S&P 500 Since 1998, Financial Stocks Sound Alarm

If the options market is an extreme reading of sentiment, then the relative performance of the financial sector is a warning signal from the technical front.

The US financial sector has fallen about 6% year-to-date, while the S&P 500 index has risen 7% over the same period and closed at a record high 14 times in the past 17 trading days.

Related data was analyzed in The Crack Beneath the S&P's New High: Financial Sector Down 6% Year-to-Date, $2 Trillion Private Credit Undercurrent Spreading.

The financial sector is seen as a leading indicator due to its core role as a provider of economic liquidity. Concerns in the private credit market are believed to be a major reason for the pressure on the financial sector. Melissa Brown, Global Head of Investment Decision Research at SimCorp, noted that the financial system is highly interconnected, and related risks "have the potential to spread more widely than currently anticipated." He suggested investors might consider gradually "trimming chip stocks" rather than continuing to chase the rally, and certainly not injecting new funds into the market.

Apollo Hoards $40 Billion in Cash, Rowan Estimates 35% Probability of Exogenous Shock

Defensive moves have begun on the institutional side. Marc Rowan, CEO of Apollo Global Management, stated while announcing quarterly results that he estimates the probability of an exogenous shock occurring at 30% to 35%, far above the usual level.

Rowan attributed this risk to the convergence of three forces: a full-scale reset in geopolitics, inflationary pressures driven by trade tariffs and immigration policies, and the profound reshaping of the economic structure by AI. He characterized the current AI wave as "without question the biggest tech cycle" of his career and specifically highlighted the fragility of government finances—compared to corporations and consumers, government balance sheets are already under pressure.

Apollo has taken a series of defensive measures: shifting its fixed-income portfolio towards higher credit quality, reducing exposure to high-risk sectors like software, and hoarding about $40 billion in cash within its insurance business. "This means we are investing with an eye to protecting capital, making sure we can go through the cycle, and if there is an adjustment, we frankly expect it to happen."

Rowan saved his sharpest criticism for industry rivals. He warned that not all insurance companies are running their businesses as they should, with some relying on what he called "outrageous" practices, including Cayman Islands offshore structures, complex mortgage arrangements, and aggressive credit assumptions, making some balance sheets appear stronger than they actually are. "We are concerned about contagion," he said.

It's worth noting that Apollo's performance this quarter was stellar, with assets under management surpassing $1 trillion and fee-related earnings hitting a record high. Choosing its most significant defense at the peak of its own operational performance is, in itself, a judgment.

Consumer End's Tale of Two Cities, Confirming Macro Divergence

Consumer data provides micro-level evidence for the above macro judgment. Whirlpool (WHR) plunged 16% after hours on Thursday, with management describing the current environment as a "sharp deterioration in macroeconomic conditions" and announcing "decisive actions" like price increases and accelerated cost-cutting to restore profitability. The chill in the housing and major appliance sectors forms a stark contrast with the heat in the semiconductor sector.

In contrast, DoorDash stated the second quarter is "off to a good start," demand "remains quite strong," and its stock rose about 10%.

This divergence reflects the underlying logic of current consumer behavior: large expenditures (like renovation, appliances) feel like a recession, while small, immediate consumption (like food delivery) remains almost unaffected. Consumers haven't disappeared; they've just become highly selective, aligning perfectly with the corporate conclusion: AI infrastructure investment is accelerating, while traditional durable goods consumption is contracting.

Place these four sets of signals on the same chart: lenders unwilling to pay $100 billion for OpenAI, the options market betting $2.6 trillion on a single-day rally, the financial sector at its weakest relative level since 1998, and Apollo hoarding $40 billion in cash. This does not constitute a judgment of "imminent crash." Scott Brown himself also emphasized that similar warning signals sometimes persist for a long time before being digested by the market, or may not even materialize. But when the primary market, secondary market, leading sector, and top institutions simultaneously give conflicting readings, it at least means that the risk premium corresponding to current price levels has been compressed to a level worth vigilance.

Preguntas relacionadas

QWhat is the core reason for the 40% reduction in SoftBank's margin loan target, secured by OpenAI equity?

AThe core reason is valuation concerns. Some potential lenders are skeptical about how to determine a reasonable valuation for OpenAI as a private company, leading to reduced willingness to fund the full $100 billion target.

QAccording to the Goldman Sachs partner, what historical period does the current market rally closely resemble, and why?

AThe Goldman Sachs partner compares the current market to the 1999 period. The parallel is drawn because both eras are fueled by a narrative of physical bottlenecks — telecom equipment capacity back then and AI compute/scarcity today — providing a tangible, 'real' story to support the frenzy.

QWhat is the significance of the underperformance of the US financial sector (XLF) relative to the S&P 500, as highlighted in the article?

AThe financial sector's significant underperformance (down 6% vs. S&P 500 up 7%) is a leading indicator and a technical warning signal. As the primary provider of economic liquidity, its weakness, partly due to concerns in the private credit market, suggests underlying stress that contradicts the broader market's record highs.

QWhat defensive action has Apollo Global Management taken in its insurance business, and what probability does its CEO assign to an exogenous shock?

AApollo has accumulated approximately $40 billion in cash reserves within its insurance business as a defensive measure. CEO Marc Rowan estimates the probability of an exogenous shock occurring at 30-35%, which is significantly higher than the typical level.

QHow does the divergence in consumer spending between Whirlpool and DoorDash illustrate the current macroeconomic split?

AThe divergence shows that consumers are becoming highly selective. Large-ticket, durable goods spending (like Whirlpool appliances) is feeling recessionary pressures, while smaller, immediate consumption (like DoorDash delivery) remains strong. This mirrors the business split where AI infrastructure investment booms while traditional capital goods spending contracts.

Lecturas Relacionadas

AI PC Battle: Bet on the Toll Booth, Not the Camp

**Title:** The AI PC Battle: Don't Bet on Sides, Bet on the Tollbooth **Summary:** The AI PC competition is moving beyond simple "x86 vs. Arm" narratives. The core investment thesis should focus on identifying which players can sustain margins, cash flow, and pricing power throughout the upgrade cycle, rather than backing a particular architecture. The opportunity is analyzed in three layers: 1. **The Advanced Foundry Tollbooth:** TSMC is positioned to collect "tolls" regardless of which chip designer wins, due to its dominant ~70% share in advanced semiconductor manufacturing, which is essential for high-end AI PC chips. 2. **Compute & Platform Spillover:** AMD represents an offensive in the x86 CPU+GPU space, while NVIDIA leverages its GPU and CUDA software stack dominance. Both benefit from the demand for increased local AI compute. 3. **Architecture Diffusion & Turnaround Plays:** ARM and Intel offer potential for significant upside (elasticity), but investments here require stricter discipline due to higher execution risks and competitive challenges. The industry is transitioning from concept to shipment validation. While short-term forecasts for AI PC adoption have been revised down slightly due to tariffs and procurement delays, the long-term trend towards AI becoming a standard PC feature remains intact. The key driver for upgrade cycles will be whether compelling enterprise applications (e.g., privacy-sensitive computing, low-latency inference) emerge beyond consumer-focused features like meeting summarization. Investment strategy should prioritize companies with platform-level advantages and recurring revenue streams. TSMC offers high certainty as the foundational tollbooth. AMD presents a strong offensive play within the established ecosystem. ARM and Intel are higher-risk, higher-potential-reward turnaround bets. The report cautions against chasing short-term hype and emphasizes a disciplined, long-term approach focused on buying ecosystem strength and cash-flow certainty after market enthusiasm subsides. **Key Risks:** Underwhelming AI PC applications slowing upgrade cycles; slow improvement in Windows on Arm compatibility; macro/tariff impacts on PC demand; potential advanced node supply-demand mismatches affecting TSMC; high overall AI sector valuations making stocks vulnerable to a risk-off shift in markets.

marsbitHace 13 min(s)

AI PC Battle: Bet on the Toll Booth, Not the Camp

marsbitHace 13 min(s)

Ten-Thousand-Word Analysis: From $10 to $290, MRVL Wins the Entire AI Era by 'Not Making GPUs'

Marvell Technology's stock price surged from under $10 in 2016 to a record $290 in June 2026, fueled not by making GPUs, but by dominating AI infrastructure connectivity. This analysis argues the market misvalues MRVL as merely a smaller Broadcom in custom AI chips, overlooking its true, unique position. Marvell's core strength lies in enabling high-speed data flow for AI clusters through three interconnected businesses. First, it holds a commanding ~70% market share in high-speed optical DSPs (essential for data center light modules), a deep-moat business with accelerating growth. Second, its custom AI chip design business serves hyperscalers like AWS, Microsoft, and Google, with a significant revenue pipeline despite lower margins. Third, stable cash flows come from Ethernet switch chips and enterprise storage controllers. Together, they form a full-stack "AI data movement" platform. CEO Matt Murphy's transformative leadership since 2016, involving strategic divestments, key acquisitions (like Inphi for optical DSPs), and securing long-term agreements with major cloud providers, repositioned the company. A pivotal $2 billion strategic investment from NVIDIA in 2026 underscored Marvell's critical role in the AI ecosystem, particularly through collaborations like NVLink Fusion. While Marvell faces risks—including client concentration (losing the Amazon Trainium3 design), lower-margin business mix, competitive threats, insider selling, and complex supply chains—its fundamentals remain strong. The optical interconnect moat is widening with the acquisition of Celestial AI (photonics fabric), and financial metrics show accelerating revenue growth and operating leverage. With a PEG ratio suggesting undervaluation relative to its growth, the thesis is that the market undervalues Marvell's monopolistic position in AI "plumbing" while overemphasizing its competitive custom chip segment. The story transcends investing, symbolizing how in any complex system—from the internet to AI—the value of "connection" ultimately surpasses that of individual "nodes."

marsbitHace 43 min(s)

Ten-Thousand-Word Analysis: From $10 to $290, MRVL Wins the Entire AI Era by 'Not Making GPUs'

marsbitHace 43 min(s)

AI Relay Stations Spark Heated Debate on Zhihu: Behind Cheap Tokens, What Are Users Really Worried About?

A discussion on Zhihu about "AI relay stations" shifted the niche developer topic of "cheap tokens" into broader user awareness. Users moved beyond simply questioning the legitimacy of these services to focus on practical concerns: Where do cheap tokens truly come from? Is the model being accessed the real one? Can relay stations see prompts, code, and API keys? For occasional users, are the risks worth it? The core debate centered less on price and more on trust. A primary worry is model authenticity—the risk of "model swapping," where users paying for a premium model might be routed to a cheaper one, creating an information asymmetry. Others argued that cost comparisons matter; while cheaper than official pay-as-you-go APIs, relay stations may not be the lowest-cost option versus subscriptions, domestic models, or free tiers, making user needs assessment crucial. Speculation about token sources ranged from legitimate bulk discounts to gray-area methods like account sharing or exploiting regional pricing. This opacity makes risk assessment difficult for users. Data security emerged as a critical concern, especially for enterprise use. When processing sensitive information like code, contracts, or client data, the inability to verify a relay station's data handling, retention, or access policies poses significant compliance and confidentiality risks. The evolving consensus suggests relay stations can be used cautiously for low-sensitivity, disposable tasks (e.g., summarizing public info, simple translation). However, they should not be the default for sensitive, professional, or production workflows involving proprietary data, Agents, or automated systems. Recommendations include avoiding large prepayments, not relying on a single service, using test prompts to monitor quality, anonymizing data where possible, and keeping official channels as backups. Ultimately, the discussion framed tokens not just as a billing unit but as a measure of real cost encompassing price, model integrity, data security, and service stability. The popularity of relay stations highlights user demand for affordable access, but the debate underscores a key trade-off: the savings from cheap tokens may come at the price of trust, transparency, and control over one's data and AI experience.

marsbitHace 1 hora(s)

AI Relay Stations Spark Heated Debate on Zhihu: Behind Cheap Tokens, What Are Users Really Worried About?

marsbitHace 1 hora(s)

In-Depth Research Report on TradFi: The Convergence Wave of Crypto and Traditional Finance

In 2026, the crypto industry is undergoing a profound infrastructure-level transformation—TradFi assets are migrating on-chain at an unprecedented pace. According to CoinGecko's Q1 2026 report, the total value locked (TVL) of tokenized real-world assets (RWA) has surpassed $31 billion, a nearly 4x increase from $7.8 billion at the beginning of 2025, with the sector’s aggregate market capitalization reaching $19.3 billion. Among these, the market cap of tokenized stocks surged from $2 million to $486 million, with Q1 spot trading volume reaching $15.1 billion—a single quarter already surpassing the entire second half of 2025. RWA perpetual contract Q1 trading volume reached a staggering $524.8 billion, far exceeding the $313 billion for all of 2025. Meanwhile, BlackRock's BUIDL fund has reached $2.3 billion in scale and has filed for two new tokenized funds, signaling that the world's largest asset manager's tokenization strategy is evolving from pilot to product suite expansion. HTX, as a core participant in the crypto exchange sector, officially launched TradFi perpetual futures products including NVDA, AAPL, MSFT, META, and SPY in 2026, enabling crypto users to gain 24/7 trading access to core U.S. equities. Boston Consulting Group predicts that global tokenized asset scale could reach $16 trillion by 2030, while McKinsey offers a conservative estimate of approximately $2 trillion. The on-chain migration of TradFi assets is no longer a "future narrative" but a structural transformation unfolding in real time, as crypto exchanges evolve from single crypto asset trading platforms toward "multi-asset-class trading infrastructure."

HTX LearnHace 1 hora(s)

In-Depth Research Report on TradFi: The Convergence Wave of Crypto and Traditional Finance

HTX LearnHace 1 hora(s)

Trading

Spot
Futuros
活动图片