Technology Trends

Explores the latest innovations, protocol upgrades, cross-chain solutions, and security mechanisms in the blockchain space. It provides a developer-focused perspective to analyze emerging technological trends and potential breakthroughs.

Can Humans Control AI? Anthropic Conducted an Experiment Using Qwen

Can Humans Control Superintelligent AI? Anthropic’s Experiment with Qwen Models Anthropic conducted an experiment to explore whether humans can supervise AI systems smarter than themselves—a core challenge in AI safety known as scalable oversight. The study simulated a “weak human overseer” using a small model (Qwen1.5-0.5B-Chat) and a “strong AI” using a more powerful model (Qwen3-4B-Base). The goal was to see if the strong model could learn effectively despite imperfect supervision. The key metric was Performance Gap Recovered (PGR). A PGR of 1 means the strong model reached its full potential, while 0 means it was limited by the weak supervisor. Initially, human researchers achieved a PGR of 0.23 after a week of work. Then, nine AI agents (Automated Alignment Researchers, or AARs) based on Claude Opus took over. In five days, they improved PGR to 0.97 through iterative experimentation—proposing ideas, coding, training, and analyzing results. The findings suggest that, in well-defined and automatically scorable tasks, AI can help overcome the supervision gap. However, the methods didn’t generalize perfectly to unseen tasks, and applying them to a production model like Claude Sonnet didn’t yield significant improvements. The study highlights that while AI can automate parts of alignment research, human oversight remains essential to prevent “gaming” of evaluation systems and to handle more complex, real-world problems. Anthropic chose Qwen models for their open-source nature, performance, scalability, and reproducibility—key for rigorous and repeatable experiments. The research demonstrates progress toward automated alignment tools but also underscores that AI supervision remains a nuanced, human-AI collaborative effort.

marsbit04/15 09:28

Can Humans Control AI? Anthropic Conducted an Experiment Using Qwen

marsbit04/15 09:28

Only Work 2 Hours a Day? This Google Engineer Uses Claude to Automate 80% of His Work

A Google engineer with 11 years of experience automated 80% of his work using Claude Code and a simple .NET application, reducing his daily work from 8 hours to just 2–3 hours while generating $28,000 in monthly passive income. The key to this transformation lies in three core elements: First, using a structured CLAUDE.md file based on Andrej Karpathy’s principles—Think Before Coding, Simplicity First, Surgical Changes, and Goal-Driven Execution—reduces Claude’s rule violations from 40% to just 3%. Second, the "Everything Claude Code" system acts as a full AI engineering team, with 27 pre-built agents for planning, reviewing, and executing tasks across multiple AI platforms. Third, a hidden token consumption issue in Claude Code v2.1.100 was identified, where 20,000 extra tokens were silently added, diluting instructions and reducing output quality. A quick fix using npx downgrades the version to avoid this. The automated system enables code generation, testing, and review to run autonomously in 15-minute cycles. The engineer now only reviews output, saving 5–6 hours daily. The setup takes less than 20 minutes, and the return on time investment is significant—potentially saving $10,000–$12,000 monthly for those valuing their time at $100/hour. The article emphasizes that managing AI systems, not just using them, is the new critical skill, enabling a shift from doing work to overseeing automated processes.

marsbit04/15 04:10

Only Work 2 Hours a Day? This Google Engineer Uses Claude to Automate 80% of His Work

marsbit04/15 04:10

Tsinghua's Prediction 2 Years Ago Is Becoming Global Consensus: Meta and Two Other Major AI Institutions Have Reached the Same Conclusion

Summary: In a remarkable validation of Chinese AI research, Meta and METR have independently reached conclusions that align perfectly with the "Density Law" proposed by a Tsinghua University and FaceWall Intelligent team two years ago. Published in Nature Machine Intelligence in late 2025, the law states that the computational power required to achieve a specific level of AI performance halves every 3.5 months. This convergence was starkly evident in April 2026. METR reported that AI capabilities are doubling every 88.6 days, while Meta's new model, Muse Spark, demonstrated it could match the performance of a model from the previous year using less than one-tenth of the training compute. When plotted, the growth curves from all three sources—using different metrics (parameters, compute, task length)—show an almost identical exponential slope. The findings have profound implications: AI inference costs are collapsing faster than anticipated, powerful edge-computing AI is becoming rapidly feasible, and the industry's strategy of simply scaling model size is becoming economically inefficient. The Chinese team, which has been building its "MiniCPM" model series based on this law since 2024, is seen as having a significant two-year lead in practical engineering experience, marking a rare instance where Chinese researchers pioneered a fundamental predictive trend in AI.

marsbit04/13 12:14

Tsinghua's Prediction 2 Years Ago Is Becoming Global Consensus: Meta and Two Other Major AI Institutions Have Reached the Same Conclusion

marsbit04/13 12:14

Thin Harness, Fat Skills: The True Source of 100x AI Productivity

The article "Thin Harness, Fat Skills: The True Source of 100x AI Productivity" argues that the key to massive productivity gains in AI is not more advanced models, but a superior system architecture. This framework, "fat skills + thin harness," decouples intelligence from execution. Core components are defined: 1. **Skill Files:** Reusable markdown documents that teach a model *how* to perform a process, acting like parameterized function calls. 2. **Harness:** A thin runtime layer that manages the model's execution loop, context, and security, staying minimal and fast. 3. **Resolver:** A context router that loads the correct documentation or skill at the right time, preventing context window pollution. 4. **Latent vs. Deterministic:** A strict separation between tasks requiring AI judgment (latent space) and those needing predictable, repeatable results (deterministic). 5. **Diarization:** The critical process where the model reads all materials on a topic and synthesizes a structured, one-page summary, capturing nuanced intelligence. The architecture prioritizes pushing intelligence into reusable skills and execution into deterministic tools, with a thin harness in between. This allows the system to learn and improve over time, as demonstrated by a YC system that matches startup founders. Skills like `/enrich-founder` and `/match` perform complex analysis and matching that pure embedding searches cannot. A learning loop allows skills to rewrite themselves based on feedback, creating a compound improvement effect without code changes. The conclusion is that 10x to 1000x efficiency gains come from this disciplined system design, not just smarter models. Skills represent permanent upgrades that automatically improve with each new model release.

marsbit04/13 04:19

Thin Harness, Fat Skills: The True Source of 100x AI Productivity

marsbit04/13 04:19

活动图片