人工智能公司Synthesia推出数字视频化身——这是它们的工作原理

币界网Publicado em 2024-08-09Última atualização em 2024-08-09

币界网报道:

你曾经想要一个数字双胞胎吗?或者害怕重新录制一系列视频?就像科幻故事情节一样,总部位于伦敦的合成媒体开发公司Synthesia推出了一项服务,让客户能够创建自己的数字视频表示,他们可以在社交媒体和营销活动中使用。

Synthesia在周三的直播中首次宣布推出个人化身。Synthesia声称,这些化身可以通过网络摄像头或手机上的两分钟视频生成,尽管我发现创建它们需要更长的时间。

该公司表示,其个人AI化身可以用来制作员工培训视频、产品解释、销售和营销材料,或与客户互动。对企业的好处是,这些不同的视频可以使用相同的人工智能化身创建,而无需在全新的面对面会话中重新录制演员或员工。

然而,也有一些局限性,我将深入探讨。

防止误用

联觉认识到,创建真人高保真复制品的能力可能会被滥用,用于恶意或欺骗性目的。Synthesia告诉Decrypt,它通过以下三个原则来防止其技术被用于创建人工智能生成的deepfakes:同意、控制和协作。

Synthesia公司事务和政策主管Alexandru Voica表示:“未经他人明确同意,我们绝不会创建人工智能化身。”。“我们的平台为用户提供了一个安全的环境,确保他们的数据是安全的,他们可以控制自己的化身,并通过在创建时进行内容审核来最大限度地减少滥用。”

就Voica而言,在化身生成开始之前,用户被要求同意Synthesia Limited及其供应商收集、使用、存储和披露他们的视频和音频记录,以“验证个人化身提交”

Voica表示,Synthesia还与行业同行、政策制定者和其他人合作,为负责任地使用人工智能制定最佳实践。

沃伊卡说:“未经同意的深度伪造是网上有害内容的最大来源。”。“因为Synthesia化身不能在没有他们所代表的人的明确同意的情况下制作,所以我们不从事未经同意的deepfakes业务,这大大限制了我们平台和个人化身被滥用的可能性。”

当被问及是否有Synthesia不允许的个人化身或一般化身时,Voica表示,该公司使用先进的技术过滤器和人类内容审核来确保Synthesia不会被用来促进创建不恰当或有害的内容。

Voica告诉Decrypt:“当有人试图制作视频时,该内容将在生成之前通过我们的内容审核工作流程。”。“如果发现违反了我们的政策,视频永远不会被创建。屡次违规或严重违规也可能导致他们的帐户被禁用。”

创建您的头像

要开始使用,用户必须创建Synthesia帐户。个人化身可在“初学者”、“创建者”和“企业”级别上使用。Starter和Creator帐户的年度订阅费用分别为每月18美元至59美元。寻求企业许可的企业必须联系Synthesia以确定定价。

这个过程在我的MacBook Pro上比在我的Windows 11 PC上运行得更好,两者都使用Brave浏览器。当你准备好了,Synthesia会询问你是想直接从平台录制还是上传视频。我选择了录音。

根据Synthesia的说法,个人化身是使用一种称为自动对齐的高级循环技术创建的,该技术可以确定化身何时说话,并使身体动作更加灵敏。个人化身可用的语言包括英语、德语、法语、西班牙语、阿拉伯语、克罗地亚语、菲律宾语、希腊语、印地语、意大利语、罗马尼亚语、俄语、土耳其语和乌克兰语。

在录制之前,Synthesia建议使用安静、光线充足的环境,在段落之间停顿,不要遮住脸,使用自然的肢体语言,保持积极和微笑。

根据我使用该工具的经验,其他建议是确保相机不要太近,这样你在画面中看起来会更小。如果您决定上传视频而不是使用录音机,请使用麦克风以提高音频质量。

Synthesia会给你一个脚本来阅读,背诵大约需要三分钟。根据化身生成器的说法,完成这个过程需要一到五分钟的音频和视频。

对于可能对公司或内容创作者的品牌产生重大影响的事情来说,这个过程相对简单。

虽然上传、录制和阅读剧本的过程不到五分钟,但Synthesia表示,实际生成个人头像可能需要24小时。我的第一个视频花了大约10个小时才制作出来。之后,我提供的后续提供新脚本的视频大约需要五分钟。

这是我在Synthesia的个人头像。你怎么认为?

完成的视频的大小可以更改,以适应其上传到的平台的需求,无论是YouTube、Instagram还是TikTok。我发现,使用个人头像的新脚本生成新视频大约需要五分钟。

然而,如果不生成新的个人头像,你就无法更改你的背景、服装或外表,如上所述,这可能需要一天的时间。

虽然个人化身非常令人印象深刻,但看到自己的动画版本很奇怪,这是“恐怖谷”的一个例子。恐怖谷是指人类逼真的表现几乎太接近真实事物的地方,使其外观更加令人不安。

尽管如此,这些视频化身仍然令人印象深刻,如果观众不仔细检查,它们可能会被视为真实的。

由Ryan Ozawa编辑。

Leituras Relacionadas

Market Trend (June 19): US-Iran Deal Drives Out Geopolitical Premium; Chip Stocks Soar to New Highs; Energy Sector Leads Declines

U.S. Market Trends (June 19): U.S.-Iran Deal Eases Tensions, Chip Stocks Soar, Energy Sector Leads Declines. U.S. stocks rallied on Thursday as the signing of a temporary U.S.-Iran deal in Geneva de-escalated Middle East tensions, with Saudi oil tankers transiting the Strait of Hormuz. This geopolitical relief helped markets recover from recent Fed-driven volatility. The S&P 500 rose over 1%, the Nasdaq gained nearly 2%, and the Dow Jones Industrial Average closed at another record high. The Philadelphia Semiconductor Index surged over 6% to a historic peak. Chip stocks were the standout performers. Reports of an Apple-Intel design and foundry deal for certain products, alongside mentions of potential Nvidia and SpaceX collaborations with Intel, propelled the sector. Intel surged ~10.5%, while memory chip makers like Micron also saw significant gains, highlighting sustained confidence in long-term AI capital expenditure. In contrast, the energy sector was the day's sole loser, with the S&P 500 energy sub-index declining as WTI crude fell ~2% to around $74.29/barrel. The reopening of key shipping routes erased prior geopolitical risk premiums. SpaceX extended losses for a second day on news of a potential large bond offering. Market volatility (VIX) dropped sharply, indicating a swift reversal of post-Fed jitters. Treasury yields dipped slightly but remained elevated. The focus now shifts to upcoming economic data, including next week's PCE inflation report and Micron's earnings, which will serve as a key test for the AI trade's durability.

marsbitHá 8m

Market Trend (June 19): US-Iran Deal Drives Out Geopolitical Premium; Chip Stocks Soar to New Highs; Energy Sector Leads Declines

marsbitHá 8m

Will MicroStrategy Fall Into a Death Spiral? How Will the Macro Outlook Evolve in the Second Half of the Year?

**Summary:** The discussion centers on recent Bitcoin price declines and the evolving financial strategy of MicroStrategy (MSTR). The core argument is that the primary pressure is not from one-off Bitcoin sales by MSTR, but from the market's new expectation that MSTR may need to engage in *sustained, small-scale* Bitcoin sales to cover cash flow obligations for its growing portfolio of preferred shares and debt instruments (like STRC). This shift is driven by its stated goal of maintaining "bitcoins per share neutrality." The market is now testing whether it can absorb this potential ongoing selling pressure without entering a severe "death spiral" with Bitcoin's price. A resolution may involve MSTR softening its approach to avoid damaging both its stock and Bitcoin. The conversation then explores the parallel rise of AI-related stocks. The guest posits that AI is fundamentally restructuring labor, with "tokens" (representing access to AI models/compute) becoming a new form of capital and a substitute for human execution. This drives corporate efficiency and profits, benefiting upstream hardware providers (semiconductors, data centers), which explains the sustained rally. This represents the early stages of a "machine economy." Regarding crypto exchanges offering US stock trading, this is seen as a natural evolution. With few crypto-native assets generating lasting value, exchanges are pivoting to distribute valuable real-world assets (RWAs). This doesn't necessarily harm crypto's long-term prospects, as blockchain infrastructure may become crucial for future machine-to-machine economies. The analysis concludes that the era of rampant altcoin speculation is likely over, heavily damaged by the liquidity shock of the "1011" event (likely referring to a major market crash). Meme-driven capital has largely migrated to US equities. Looking ahead, macroeconomic uncertainty is rising due to potential large IPOs (e.g., SpaceX) and the US elections. While short-term market corrections are possible, the long-term trends of AI-driven productivity gains and the maturation of blockchain towards real-world utility and institutional adoption remain intact.

marsbitHá 12m

Will MicroStrategy Fall Into a Death Spiral? How Will the Macro Outlook Evolve in the Second Half of the Year?

marsbitHá 12m

Will MicroStrategy Fall into a Death Spiral? What Will the Macro Trend Be in the Second Half of the Year?

The podcast features investor Didier discussing the recent Bitcoin downturn and the evolving strategy of MicroStrategy (MSTR). He argues the core pressure is not macro factors or ETF outflows, but the market pricing in an expectation that MSTR will engage in continuous, small-scale Bitcoin sales to fund its increasing preferred stock and debt obligations under its "bitcoin-per-share neutrality" principle. This creates a structural headwind. However, he is cautiously optimistic a "death spiral" is avoidable without new major shocks, as market support is likely to emerge at a certain price point. Didier then posits that the AI-driven bull market in US stocks (semiconductors, data centers) is fundamentally driven by AI agents and tokens becoming the "new labor force," displacing human roles and boosting corporate margins. This shift toward a machine economy is still in its early stages. He comments on crypto exchanges adding US stock trading, viewing it as a natural move toward valuable real-world assets as truly valuable crypto-native assets remain scarce. For crypto-native traders, he suggests existing strategies (e.g., meme-chasing or value investing) can translate to similar assets in US markets. The discussion notes the severe liquidity damage from the "1011 event" (likely referring to a major market crash) has essentially ended the altcoin cycle, with speculative momentum shifting to the more liquid US stock market. Regarding the macroeconomic outlook for H2 2024, Didier expresses increased caution due to potential market pressure from upcoming mega-IPOs (e.g., SpaceX) and US midterm election risks. Long-term, he remains bullish on AI's productivity gains and its convergence with blockchain/Web3, which he sees maturing into a more institutional, real-asset-focused phase.

链捕手Há 14m

Will MicroStrategy Fall into a Death Spiral? What Will the Macro Trend Be in the Second Half of the Year?

链捕手Há 14m

Dylan Patel: Founder of SemiAnalysis, Praised by Jensen Huang, is a 'Beekeeper' and 'Forum Enthusiast'

Dylan Patel, founder of the independent research firm SemiAnalysis, has an unconventional background. A former beekeeper from rural Georgia, he entered the semiconductor world as a self-taught "forum warrior," discussing chip technology anonymously online from a young age. He launched the SemiAnalysis blog in May 2020, which later transitioned to a paid subscription model. The firm has grown from a one-person operation to a global team of around 60, with a dedicated teardown lab. Its detailed, technically-focused analysis on semiconductor supply chains, AI infrastructure, and products has earned significant industry recognition. Notably, NVIDIA founder Jensen Huang has publicly cited their reports. In a landmark case, a critical 2024 report on AMD's MI300X GPU software stack led to a 90-minute call with AMD CEO Lisa Su, who thanked him for the constructive feedback. SemiAnalysis later acknowledged AMD's improvements. The firm's influence on markets was seen when a report on NVIDIA's Rubin memory configuration was partially shared, affecting memory stock prices. Dylan Patel emphasized the importance of context, contrasting the shared excerpt with the report's actual title. SemiAnalysis, now a multi-faceted consultancy with revenue projected to reach $100 million, is known for its deep technical insights that influence major industry players and investment decisions.

marsbitHá 1h

Dylan Patel: Founder of SemiAnalysis, Praised by Jensen Huang, is a 'Beekeeper' and 'Forum Enthusiast'

marsbitHá 1h

Trading

Spot
Futuros
活动图片