Technology Trends

Explores the latest innovations, protocol upgrades, cross-chain solutions, and security mechanisms in the blockchain space. It provides a developer-focused perspective to analyze emerging technological trends and potential breakthroughs.

DeepSeek No Longer Wants to Focus Only on Large Models

DeepSeek, a leading Chinese AI company, has released its new model series DeepSeek-V4, featuring two versions: the high-performance V4-Pro with 1.6 trillion parameters and the cost-efficient V4-Flash. Both support 1 million token context windows and use Mixture-of-Experts (MoE) architecture to improve efficiency. The company continues its strategy of offering competitive pricing, with input tokens priced as low as ¥0.2 per million tokens. A key revelation is DeepSeek’s explicit link between future price reductions and the mass availability of Huawei’s Ascend 950 AI chips in the second half of the year. This signals a strategic shift from relying solely on algorithmic and engineering optimizations to integrating domestic computing power into its core cost structure. DeepSeek has adapted its inference system to run efficiently on both NVIDIA GPUs and Huawei NPUs, potentially challenging NVIDIA's CUDA ecosystem dominance. Concurrently, DeepSeek is reportedly seeking significant external investment, with a pre-money valuation of around ¥300 billion. This move highlights growing pressures in scaling compute infrastructure, retaining top talent—amid recent departures of key researchers—and accelerating commercialization efforts. The company has also updated its consumer app with tiered model access, indicating a stronger product focus. The V4 release underscores that China's AI competition is evolving beyond pure model capability into a broader contest involving compute supply chains, engineering systems, financing, and talent strategy.

marsbit04/25 01:45

DeepSeek No Longer Wants to Focus Only on Large Models

marsbit04/25 01:45

OpenAI Goes Left, DeepSeek Goes Right

On April 24, 2026, DeepSeek released V4, a Chinese large language model offering a free "million-token context window," enabling it to process vast amounts of data like entire books or years of corporate documents in one go. In contrast, OpenAI’s GPT-5.5, released around the same time, is more powerful but significantly more expensive, charging up to $180 per million output tokens. DeepSeek’s strategy represents a shift from a pure AI research firm to a heavy-infrastructure player, building data centers in Inner Mongolia’s Ulanqab to bypass U.S. chip export restrictions. This move, supported by Huawei’s Ascend chips and China’s cheap green electricity, highlights a fundamental divergence in AI development models: U.S. firms focus on high-cost, high-margin services, while Chinese players like DeepSeek prioritize accessibility and affordability. Facing intense talent poaching from tech giants, DeepSeek is seeking a $44 billion valuation funding round to retain researchers and scale infrastructure. Meanwhile, Chinese manufacturers are compressing AI models to run on smartphones, making AI accessible offline and across the Global South. Through open-source models and localized solutions, Chinese AI is empowering non-English speakers and low-income users, driving a form of "digital equality." While Silicon Valley builds walled gardens, DeepSeek and others are turning AI into a public utility—like tap water—flowing freely to those previously left behind.

marsbit04/24 07:33

OpenAI Goes Left, DeepSeek Goes Right

marsbit04/24 07:33

Anthropic Starts Poaching Scientists? $27K Weekly Onsite Stipend to Fix Claude's Expert-Level Errors

Anthropic has launched a new STEM Fellow program, offering $3,800 per week for a three-month, in-person residency in San Francisco. The role targets experts from science, technology, engineering, and mathematics (STEM) fields—machine learning experience is helpful but not required. Instead, Anthropic values scientific judgment and a willingness to learn quickly. Fellows will work with Claude models and internal tools under the guidance of an Anthropic researcher. Example projects include a materials scientist identifying errors in Claude’s reasoning or a climate scientist integrating atmospheric modeling software with Claude. The goal is to have experts "tell Claude where it's wrong" and improve its scientific capabilities. This initiative is part of Anthropic’s broader strategy to strengthen its scientific ecosystem, following earlier programs like the AI Safety Fellows and AI for Science programs. The company acknowledges that current AI models, while powerful, still produce high-confidence errors and lack end-to-end research autonomy. The program aims to embed domain expertise directly into model development, turning scientists into "high-level reviewers" for AI. Anthropic CEO Dario Amodei has previously emphasized AI’s potential to accelerate scientific breakthroughs, particularly in biology and healthcare. The company believes that the next phase of AI competition will depend not on scaling parameters, but on integrating human expertise to refine model accuracy and reliability.

marsbit04/22 07:44

Anthropic Starts Poaching Scientists? $27K Weekly Onsite Stipend to Fix Claude's Expert-Level Errors

marsbit04/22 07:44

Three Frameworks for Ordinary People to Achieve AI Capability Leap: Say Goodbye to the Dilemma of 'Repeating Inputs Every Day'

Summary: This article outlines three frameworks for maximizing AI efficiency, moving beyond basic prompt usage. 1. **Three-Layer Evolution**: Users progress from (1) **Prompt** (one-off instructions, reset each session), to (2) **Project** (context-aware within a specific project), to (3) **Skill** (permanent, auto-applied knowledge). Most users stagnate at the first layer, repeating the same instructions daily with no cumulative improvement. Skills transform the AI from a chat tool into a personalized work system. 2. **Transaction vs. Compound Interest Mindset**: Using prompts is a linear transaction—effort and output are 1:1, and stopping resets progress. Investing time in building Skills is compound interest; a small initial time investment pays continuous dividends, as each Skill permanently elevates the AI's baseline performance. 3. **Thin Harness, Fat Skills**: The system architecture should prioritize thick, well-defined Skills (90% of the value—containing processes, standards, and domain knowledge) and a thin "harness" (the minimal technical environment). Avoid over-engineering the toolchain while neglecting the AI's actual knowledge. Skills are permanent assets that automatically improve with model updates. The key takeaway: Identify tasks you repeat, encode them into Skills (using tools like Claude's Skill Creator), and shift focus from daily prompting to building a compounding, self-improving AI system.

marsbit04/22 06:43

Three Frameworks for Ordinary People to Achieve AI Capability Leap: Say Goodbye to the Dilemma of 'Repeating Inputs Every Day'

marsbit04/22 06:43

活动图片