Google DeepMind Releases Lyria 3 Pro: AI Music Transitions from "30-Second Previews" to Full Songs

marsbitPublished on 2026-03-26Last updated on 2026-03-26

Abstract

Google DeepMind has launched Lyria 3 Pro, a significant upgrade from the previous version released just six weeks prior. The key improvement is the extension of AI-generated music from 30 seconds to 3 minutes, alongside a new "structure-aware" capability. This allows the model to understand and construct a song's internal composition—such as intros, verses, choruses, and bridges—based on user's text, image, or video prompts, marking a shift from a simple generator to a creative tool. While competitors like Suno and Udio have offered similar features since early 2025, Google's entry into this competitive space is notable due to its vast distribution reach via the Gemini ecosystem. The model supports multiple languages, includes vocals and instruments, and automatically adds SynthID watermarks for AI content identification. Access is tiered for paid Gemini users (AI Plus, Pro, Ultra), with daily generation limits, while free users remain on the 30-second Lyria 3. It is also available for developers via API and for enterprises through Vertex AI. However, copyright concerns regarding training data remain unresolved, similar to ongoing industry disputes.

Google DeepMind launched Lyria 3 Pro on March 25. Just six weeks after the release of the previous version, Lyria 3, this upgrade focuses on one core improvement: extending the generation duration from 30 seconds to 3 minutes, while enabling the model to truly understand the internal structure of a song.

This leap is not a minor iteration. Thirty seconds is sufficient for generating background sound effects but not enough to compose a song—there are no sections, no transitions, no climax. The newly added "structure-aware" capability in Lyria 3 Pro allows users to specify intros, verses, choruses, and bridges in their prompts, and the model arranges the transitions and dynamic changes accordingly. This marks a critical step for AI music tools evolving from "generators" to "creative tools."

Suno and Udio Have Been Doing This for a Year

Frankly, Suno and Udio already possessed this capability in early 2025, with both offering longer generation durations and more flexible structural control. Google catching up at this point signifies its true entry into competitive mode in the AI music race—backed by the distribution power of the Gemini ecosystem, Lyria 3 Pro will reach a much broader user base than any standalone music AI tool.

The simultaneous opening of Vertex AI is another signal: Google isn't just aiming for consumer tools but also plans to embed Lyria into enterprise workflows.

What It Can Do

Inputs support text, images, and videos, with the model automatically matching the music style to the emotional content. Generated content includes vocals, lyrics, and instruments, covering multiple languages. All outputs automatically embed SynthID watermarks to indicate AI origin—a consistent practice by DeepMind for content溯源.

Who Can Use It and How

Gemini App paid users can now access it. Usage is tiered by plan: AI Plus allows about 10 songs per day, Pro about 20, and Ultra about 50. Free users remain on the 30-second version of Lyria 3.

Supported languages include English, Japanese, Korean, Hindi, Spanish, Portuguese, German, French, and others, limited to users aged 18 and above. Access path: Gemini App → Create Music → Choose "Thinking" or "Pro" mode.

Developers can integrate via Google AI Studio and Gemini API; Vertex AI is in public preview for enterprise-level on-demand generation scenarios. Integration has begun with Google Vids and its music production tool ProducerAI. Enterprise Workspace support is expected within days.

Copyright Issues Remain Unresolved

Google states that the use of training data follows relevant agreements with artists but has not disclosed specific sources or authorization scope. This sits against the same backdrop of copyright lawsuits faced by Suno and Udio—legal disputes over AI music training data are still unresolved in the industry, and Google's statement is more of a position declaration than a complete answer.

Lyria 3 Pro is currently being rolled out gradually to users, with possible delays in some regions.

Related Questions

QWhat is the key improvement of Google DeepMind's Lyria 3 Pro compared to its predecessor?

AThe key improvement is that Lyria 3 Pro extends the generation length from 30 seconds to 3 minutes and introduces a 'structure-aware' capability, allowing the model to understand and arrange the internal structure of a song, such as intros, verses, choruses, and bridges.

QHow does Lyria 3 Pro's 'structure-aware' capability benefit users?

AIt allows users to specify structural elements like intros, verses, choruses, and bridges in their prompts. The model then arranges the transitions and dynamic changes between these sections, making it a more powerful creative tool rather than just a generator.

QWhich competitors already had similar capabilities before Lyria 3 Pro, and what advantage does Google have?

ASuno and Udio had similar capabilities for generating longer, structurally controlled music since early 2025. Google's advantage lies in its distribution power through the Gemini ecosystem, which gives Lyria 3 Pro a much larger potential user base.

QWhat are the usage limits for Lyria 3 Pro based on Gemini subscription tiers?

AAI Plus users can generate about 10 songs per day, Pro users about 20 songs, and Ultra users about 50 songs. Free users are still restricted to the 30-second version of Lyria 3.

QHow does Google address copyright concerns related to Lyria 3 Pro's training data?

AGoogle states that the use of training data follows relevant agreements with artists, but it has not disclosed specific sources or the scope of authorization. This is part of an ongoing industry-wide legal debate about AI music training data, and Google's statement is more of a position declaration than a complete resolution.

Related Reads

From Code to Cognition: A Ten-Thousand-Word Guide to the Evolution of the Robot Brain

"From Code to Cognition: The Evolution of Robot Brains" The journey of robotic intelligence has shifted dramatically from manually coded systems to AI-driven brains. For decades, robots relied on layered software stacks—perception, state estimation, planning, control—each handcrafted. While predictable, they lacked adaptability. The 2010s saw deep learning revolutionize perception (e.g., object detection) and control (via reinforcement learning), but learned skills remained narrow. The arrival of Large Language Models (LLMs) marked a turning point. LLMs acted as high-level planners, interpreting natural language instructions and generating sequences of actions for traditional robotic systems to execute. However, true integration came with Visual-Language-Action (VLA) models, which fused vision, language, and motion prediction into a single network. Pioneered by models like RT-2 and open-source projects like OpenVLA, VLAs enable robots to reason and act directly from visual input and commands. The most advanced humanoid robots now employ a "dual-brain" architecture: a slow-thinking, large VLA (System 2) for reasoning and planning, and a fast-reacting, small network (System 1) for high-frequency motion control, sometimes with an even lower-level System 0 for balance. This split balances cognition with the physics of real-time movement. Computation is split between onboard hardware (e.g., NVIDIA Jetson) for safety-critical control loops and cloud/edge servers for non-critical tasks like learning and interfaces. A crucial driver is the open-source ecosystem—models like GR00T and OpenVLA allow startups to build upon pre-trained brains and fine-tune them with their own data, accelerating development. Despite progress, current systems struggle with recovery from errors, sample inefficiency, and long-horizon tasks. This has spurred the rise of **World Models**—neural networks that predict the consequences of actions. By simulating possible futures before acting (like NVIDIA Cosmos or Meta V-JEPA), robots can plan, recover, and generalize better. This represents the next frontier: shifting intelligence from learned reactions to an internal model of physics and cause-and-effect. The field is rapidly evolving. While not yet at its "ChatGPT moment," the convergence of cheaper hardware, scalable simulation, and world models points toward robots that are increasingly capable, adaptive, and useful. The question is shifting from "what can robots do?" to "what *should* they do?"

marsbit4m ago

From Code to Cognition: A Ten-Thousand-Word Guide to the Evolution of the Robot Brain

marsbit4m ago

AI Bubble Is Bursting

The AI Bubble is Bursting: A Necessary Purge on the Path to Ubiquitous Intelligence Market volatility has reignited debates about an AI bubble, with figures like Ray Dalio pointing to high valuations. However, this parallels the dot-com bubble, which, despite its crash, laid the physical infrastructure for today's internet era. The current AI investment frenzy, with tech giants planning trillions in infrastructure spending far outstripping current AI application revenues, appears similarly imbalanced. This 'bubble' is seen as an inevitable phase for a disruptive technology, paying the "innovation tax." Critically, AI inference costs have plummeted over 99.7% since 2023, making intelligence nearly free at the margin. This hasn't reduced spending but has instead unlocked massive new demand, as seen in enterprise AI cloud expenditure tripling. This follows the Jevons Paradox: efficiency gains lead to greater total consumption. The market is now entering a cleansing phase, weeding out speculative ventures lacking real moats. The deeper shift is a move from capital expenditure (CapEx) on hardware to value creation in operational expenditure (OpEx) through AI applications that solve real industry problems. While infrastructure valuations are high, rapid earnings growth from widespread AI adoption across sectors—from manufacturing and finance to law and healthcare—may digest these valuations over time. Ultimately, this creative destruction will leave behind robust infrastructure and optimized models, cheaply powering an AI-augmented future for all industries, much as the internet became indispensable after its own bubble burst. The core productive potential remains undiminished.

链捕手14m ago

AI Bubble Is Bursting

链捕手14m ago

Trading

Spot
Futures

Hot Articles

Discussions

Welcome to the HTX Community. Here, you can stay informed about the latest platform developments and gain access to professional market insights. Users' opinions on the price of AI (AI) are presented below.

活动图片