人工智能公司Synthesia推出数字视频化身——这是它们的工作原理

币界网2024-08-09 tarihinde yayınlandı2024-08-09 tarihinde güncellendi

币界网报道:

你曾经想要一个数字双胞胎吗?或者害怕重新录制一系列视频?就像科幻故事情节一样,总部位于伦敦的合成媒体开发公司Synthesia推出了一项服务,让客户能够创建自己的数字视频表示,他们可以在社交媒体和营销活动中使用。

Synthesia在周三的直播中首次宣布推出个人化身。Synthesia声称,这些化身可以通过网络摄像头或手机上的两分钟视频生成,尽管我发现创建它们需要更长的时间。

该公司表示,其个人AI化身可以用来制作员工培训视频、产品解释、销售和营销材料,或与客户互动。对企业的好处是,这些不同的视频可以使用相同的人工智能化身创建,而无需在全新的面对面会话中重新录制演员或员工。

然而,也有一些局限性,我将深入探讨。

防止误用

联觉认识到,创建真人高保真复制品的能力可能会被滥用,用于恶意或欺骗性目的。Synthesia告诉Decrypt,它通过以下三个原则来防止其技术被用于创建人工智能生成的deepfakes:同意、控制和协作。

Synthesia公司事务和政策主管Alexandru Voica表示:“未经他人明确同意,我们绝不会创建人工智能化身。”。“我们的平台为用户提供了一个安全的环境,确保他们的数据是安全的,他们可以控制自己的化身,并通过在创建时进行内容审核来最大限度地减少滥用。”

就Voica而言,在化身生成开始之前,用户被要求同意Synthesia Limited及其供应商收集、使用、存储和披露他们的视频和音频记录,以“验证个人化身提交”

Voica表示,Synthesia还与行业同行、政策制定者和其他人合作,为负责任地使用人工智能制定最佳实践。

沃伊卡说:“未经同意的深度伪造是网上有害内容的最大来源。”。“因为Synthesia化身不能在没有他们所代表的人的明确同意的情况下制作,所以我们不从事未经同意的deepfakes业务,这大大限制了我们平台和个人化身被滥用的可能性。”

当被问及是否有Synthesia不允许的个人化身或一般化身时,Voica表示,该公司使用先进的技术过滤器和人类内容审核来确保Synthesia不会被用来促进创建不恰当或有害的内容。

Voica告诉Decrypt:“当有人试图制作视频时,该内容将在生成之前通过我们的内容审核工作流程。”。“如果发现违反了我们的政策,视频永远不会被创建。屡次违规或严重违规也可能导致他们的帐户被禁用。”

创建您的头像

要开始使用,用户必须创建Synthesia帐户。个人化身可在“初学者”、“创建者”和“企业”级别上使用。Starter和Creator帐户的年度订阅费用分别为每月18美元至59美元。寻求企业许可的企业必须联系Synthesia以确定定价。

这个过程在我的MacBook Pro上比在我的Windows 11 PC上运行得更好,两者都使用Brave浏览器。当你准备好了,Synthesia会询问你是想直接从平台录制还是上传视频。我选择了录音。

根据Synthesia的说法,个人化身是使用一种称为自动对齐的高级循环技术创建的,该技术可以确定化身何时说话,并使身体动作更加灵敏。个人化身可用的语言包括英语、德语、法语、西班牙语、阿拉伯语、克罗地亚语、菲律宾语、希腊语、印地语、意大利语、罗马尼亚语、俄语、土耳其语和乌克兰语。

在录制之前,Synthesia建议使用安静、光线充足的环境,在段落之间停顿,不要遮住脸,使用自然的肢体语言,保持积极和微笑。

根据我使用该工具的经验,其他建议是确保相机不要太近,这样你在画面中看起来会更小。如果您决定上传视频而不是使用录音机,请使用麦克风以提高音频质量。

Synthesia会给你一个脚本来阅读,背诵大约需要三分钟。根据化身生成器的说法,完成这个过程需要一到五分钟的音频和视频。

对于可能对公司或内容创作者的品牌产生重大影响的事情来说,这个过程相对简单。

虽然上传、录制和阅读剧本的过程不到五分钟,但Synthesia表示,实际生成个人头像可能需要24小时。我的第一个视频花了大约10个小时才制作出来。之后,我提供的后续提供新脚本的视频大约需要五分钟。

这是我在Synthesia的个人头像。你怎么认为?

完成的视频的大小可以更改,以适应其上传到的平台的需求,无论是YouTube、Instagram还是TikTok。我发现,使用个人头像的新脚本生成新视频大约需要五分钟。

然而,如果不生成新的个人头像,你就无法更改你的背景、服装或外表,如上所述,这可能需要一天的时间。

虽然个人化身非常令人印象深刻,但看到自己的动画版本很奇怪,这是“恐怖谷”的一个例子。恐怖谷是指人类逼真的表现几乎太接近真实事物的地方,使其外观更加令人不安。

尽管如此,这些视频化身仍然令人印象深刻,如果观众不仔细检查,它们可能会被视为真实的。

由Ryan Ozawa编辑。

İlgili Okumalar

NVIDIA CPU Advances, China's RISC-V Responds: Semiconductor Deep Dive - Part Four

NVIDIA is set to launch its new Vera AI data center CPU in China as early as August, with high pricing. While this move offers a new option, it highlights China's continued dependence on foreign-controlled Arm architecture. In response, the Chinese semiconductor industry is increasingly turning to RISC-V as a strategic alternative for achieving high-performance computing autonomy. The article explores the concept of the "impossible triangle" in CPU development—balancing prosperity, control, and autonomy—and posits that RISC-V's open-source, modular nature offers a unique path to achieving all three. While RISC-V is already dominant in embedded systems, the focus is now shifting to data centers and AI workloads. China has become a global hotspot for RISC-V development, driven by AI-driven compute demand, supply chain concerns from export controls, cost benefits of open-source, and strong policy support. Multiple Chinese companies have reportedly crossed the key performance threshold of 15 SPECint per GHz, a benchmark for entering the high-performance CPU club. Progress extends beyond single-core benchmarks. Companies are developing complete computing subsystems, including commercial-grade coherent network-on-chip (NoC) technology and server processors with up to 40 cores that strictly adhere to the RVA23 standard to ensure software compatibility. Real-world applications are emerging in areas like video transcoding and edge AI. However, significant challenges remain. The RISC-V ecosystem faces fragmentation, immature toolchains and verification processes, and gaps in single-core performance and energy efficiency compared to mature x86 and Arm architectures. The formidable software moat, epitomized by NVIDIA's CUDA, is a long-term hurdle. In conclusion, while RISC-V cannot immediately replace offerings like NVIDIA's Vera, it represents a viable long-term path for China to develop a self-sufficient, high-performance CPU ecosystem. The journey is acknowledged to be long and arduous, requiring sustained effort to overcome technical and ecosystem challenges.

marsbit13 dk önce

NVIDIA CPU Advances, China's RISC-V Responds: Semiconductor Deep Dive - Part Four

marsbit13 dk önce

My Coding Betting Dashboard is Profiting, but Polymarket is Truly Not a Good Place for 'Arbitrage'

The author built a custom monitoring dashboard for Polymarket, a prediction market platform, and tested it with $1,600, achieving over 30% returns. However, the core argument is that Polymarket is not a good venue for traditional arbitrage. The dashboard has two main sections: a "Portfolio Dashboard" for tracking active positions with key metrics like total capital, P&L, and a risk-control module using a tier system (T1, T2, T3), and an "Opportunity Watchlist" for monitoring markets. The article details a critical structural trap in binary markets: a bet with a high perceived probability of success still carries a 100% loss risk if wrong. The author's T1/T2/T3 system is designed to manage this by limiting position sizes based on conviction and time horizon, emphasizing that high confidence should not equal high concentration. A key insight is the danger of "pseudo-diversification"—betting on different markets driven by the same underlying variable. The author concludes that Polymarket offers few true low-risk, arbitrage opportunities. It is instead a high-risk environment where wins can create a false sense of mastery, leading to large losses. The platform is better viewed as a training ground for honing judgment through disciplined, framework-driven betting rather than a reliable income source. The tools help transform intuition into structured, rule-based decisions to mitigate the risk of catastrophic errors.

marsbit3 saat önce

My Coding Betting Dashboard is Profiting, but Polymarket is Truly Not a Good Place for 'Arbitrage'

marsbit3 saat önce

WeChat AI Card Hands-On Guide: Has the AI Shopping Era Arrived?

**"WeChat AI Card" Practical Test Guide: Has the Era of AI Shopping Arrived?** WeChat has officially launched the "AI Exclusive Card," a feature integrated into its Workbuddy AI assistant. This card is designed to handle payments for AI-initiated purchases. Our hands-on test reveals it's not yet a tool for fully autonomous AI shopping, but rather a controlled payment layer for AI agents. The AI Card functions as an isolated sub-wallet within WeChat Pay. Users must bind the card and transfer funds into it from their main wallet. Crucially, every transaction requires explicit user confirmation via smartphone scan; AI cannot spend autonomously. Currently accessible through the Workbuddy agent, the card targets specific digital consumption scenarios: purchasing paid content (reports, data), calling paid APIs/tools, and subscribing to services. Its design prioritizes security and control by separating funds and mandating approval for each payment. We tested a real-world scenario: ordering bubble tea via Workbuddy using a "Meituan Life Assistant" skill. The process encountered multiple hurdles: high "skill" usage costs (exceeding daily free credits), and most importantly, while a payment was successfully initiated, the AI purchased an incorrect product (a mismatched group-buy coupon instead of the desired drink). This highlights the current limitation: the **AI Card only solves the payment step**. The broader challenge lies in the **AI agent's execution chain**—accurately understanding intent, navigating third-party platforms, selecting the right product, and ensuring proper fulfillment. The payment succeeded, but the purchase failed to meet the user's need. In conclusion, the WeChat AI Exclusive Card is a cautious, early-step experiment in AI commerce. It provides a secure, user-controlled payment method for agent interactions but is not yet capable of reliable, end-to-end complex purchases. For now, it's best used for low-value, low-risk digital services with careful user verification at each step. The vision of AI handling complete shopping tasks remains a work in progress.

marsbit5 saat önce

WeChat AI Card Hands-On Guide: Has the AI Shopping Era Arrived?

marsbit5 saat önce

Deconstructing Notion's Growth: From a Note-taking Tool to 100 Million Users—How Notion Built a Triple Growth Flywheel Through Product, Templates, and Community

Notion's growth from a niche note-taking tool to a platform with 100 million users is powered by three interconnected flywheels: Product-Led Growth (PLG), a Template Economy, and Community-Driven Growth. First, Notion's PLG strategy relies on a highly flexible, "plastic" product that users can adapt to countless personal and team workflows. Its freemium model lowers the barrier to entry, while features like page sharing and collaboration drive organic, usage-based viral growth as users naturally invite others. Second, the Template Economy solves the "blank page" problem. Templates, created by both Notion and its community, transform abstract product capabilities into concrete, copyable solutions for specific scenarios (e.g., project management, content calendars). This dramatically lowers activation costs for new users and fuels SEO-driven discovery. Third, a vibrant Community acts as a distributed growth engine. Users and official Ambassadors create tutorials, share use cases, and host local events. This community not only educates users but also fosters a sense of identity around pursuing "better ways of working," strengthening loyalty and enabling global, low-cost expansion. Together, these flywheels create a self-reinforcing ecosystem: a great product attracts users who create templates and community content, which in turn attracts more users and deepens engagement. This system allowed Notion to scale from individuals to teams and enterprises through a bottom-up adoption path. Looking ahead, AI integration promises to accelerate these flywheels further by making templates smarter and the platform a potential AI-native work operating system. Ultimately, Notion's defensible advantage is not just its features, but this deeply entrenched network of user assets, creators, and community trust.

marsbit5 saat önce

Deconstructing Notion's Growth: From a Note-taking Tool to 100 Million Users—How Notion Built a Triple Growth Flywheel Through Product, Templates, and Community

marsbit5 saat önce

İşlemler

Spot
Futures
活动图片