Computing Power Crisis: Google Quietly Imposes Usage Caps on Meta for Gemini
Google has quietly imposed usage caps on Meta's access to its Gemini AI models since around March due to surging demand overwhelming its computational infrastructure, according to a Financial Times report. The limits, which remain in place, have disrupted and delayed several of Meta's internal AI projects, forcing the social media giant to ration AI usage and improve efficiency.
This reflects a broader industry-wide shortage of AI inference capacity, as companies deploy more chatbots and AI agents. Google CEO Sundar Pichai acknowledged compute constraints are limiting cloud revenue growth. In response, Google recently signed a $920 million monthly compute leasing deal with SpaceX to expand capacity.
The restrictions have accelerated Meta's shift toward its own AI models, such as Muse Spark, to reduce dependence on external providers like Google. While other Google clients also face limits, Meta's vast scale made it particularly affected. The situation highlights how the AI infrastructure bottleneck has shifted from model training to inference, requiring massive new capital investments to resolve.
marsbitHace 10 hora(s)