Zhejiang University Research Team Proposes New Approach: Teaching AI How the Human Brain Understands the World

marsbit2026-04-05 tarihinde yayınlandı2026-04-05 tarihinde güncellendi

Özet

A research team from Zhejiang University published a paper in *Nature Communications* challenging the prevailing notion that larger AI models inherently think more like humans. They found that while model performance on recognizing concrete concepts improved as parameters increased (from 74.94% to 85.87%), performance on abstract concept tasks slightly declined (from 54.37% to 52.82%) in models like SimCLR, CLIP, and DINOv2. The key difference lies in how concepts are organized. Humans naturally form hierarchical categories (e.g., grouping a swan and an owl into "birds"), enabling them to apply past knowledge to new situations. Models, however, rely heavily on statistical patterns in data and struggle to form stable, abstract categories. The team proposed a novel solution: using human brain signals (recorded when viewing images) to supervise and guide the model's internal organization of concepts. This method, termed transferring "human conceptual structures," helped the model learn a brain-like categorical system. In experiments, the model showed improved few-shot learning and generalization, with a 20.5% average improvement on a task requiring abstract categorization like distinguishing living vs. non-living things, even outperforming much larger models. This research shifts the focus from simply scaling model size ("bigger is better") to designing smarter internal structures ("structured is smarter"). It highlights a new pathway for developing AI that possesses more hum...

Large models have been growing in size, with the mainstream view being that the more parameters a model has, the closer it gets to human-like thinking. However, a paper published by a Zhejiang University team on April 1 in Nature Communications presents a different perspective (original article link: https://www.nature.com/articles/s41467-026-71267-5). They found that as model size (primarily SimCLR, CLIP, DINOv2) increases, the ability to recognize specific objects does continue to improve, but the ability to understand abstract concepts not only fails to improve but can even decline. When parameters increased from 22.06 million to 304.37 million, performance on concrete concept tasks rose from 74.94% to 85.87%, while performance on abstract concept tasks dropped from 54.37% to 52.82%.

Differences Between Human and Model Thinking

When the human brain processes concepts, it first forms a system of categorical relationships. Swans and owls look different, but humans still classify them both as birds. Moving up, birds and horses can be further grouped into the animal category. When humans encounter something new, they often first consider what it resembles from past experience and which category it might belong to. Humans continuously learn new concepts, then organize this experience, using this relational system to recognize new things and adapt to new situations.

Models also classify, but they form these classifications differently. They rely primarily on patterns that repeatedly appear in large-scale data. The more frequently a specific object appears, the easier it is for the model to recognize it. When it comes to larger categories, models struggle more. They need to capture the commonalities between multiple objects and then group these commonalities into the same category. Existing models still have significant shortcomings here. As parameters continue to increase, performance on concrete concept tasks improves, while performance on abstract concept tasks sometimes even decreases.

A commonality between the human brain and models is that both internally form a system of categorical relationships. However, their emphases differ. The higher-order visual regions of the human brain naturally distinguish broad categories like living and non-living things. Models can separate specific objects but find it difficult to stably form these larger categories. This difference means the human brain more easily applies past experience to new objects, allowing for rapid categorization of unseen things. Models, conversely, rely more on existing knowledge, so when encountering new objects, they tend to focus on superficial features. The method proposed in the paper addresses this characteristic, using brain signals to constrain the model's internal structure, making it closer to the human brain's categorization method.

The Solution from the Zhejiang University Team

The team's proposed solution is also unique: instead of simply adding more parameters, they use a small amount of brain signal data for supervision. These brain signals come from recordings of brain activity while humans view images. The original paper states the goal as transferring 'human conceptual structures' to DNNs. This means teaching the model, as much as possible, how the human brain classifies, generalizes, and groups similar concepts together.

The team conducted experiments using 150 known training categories and 50 unseen test categories. The results showed that as this training progressed, the distance between the model's representations and the brain representations continuously narrowed. This change occurred for both categories, indicating that the model was learning not just individual samples but truly beginning to learn a conceptual organization method more akin to the human brain.

After this process, the model demonstrated stronger few-shot learning capabilities and performed better in novel situations. In a task requiring the model to distinguish abstract concepts like living vs. non-living things with very few examples, the model improved by an average of 20.5%, even surpassing much larger control models. The team also conducted 31 additional specialized tests, where several types of models showed improvements of nearly ten percent.

Over the past few years, the familiar path in the modeling industry has been larger model scale. The Zhejiang University team has chosen a different direction: moving from 'bigger is better' to 'structured is smarter'. Scaling up is indeed useful, but it primarily improves performance on familiar tasks. Abstract understanding and transfer capabilities, inherent to humans, are equally crucial for AI. This requires future AI thinking structures to more closely resemble the human brain. The value of this direction lies in redirecting the industry's attention from pure size expansion back to the cognitive structure itself.

Neosoul and the Future

This points to a larger possibility: AI evolution may not only occur during the model training phase. Model training can determine how AI organizes concepts and forms higher-quality judgment structures. Then, after entering the real world, another layer of AI evolution just begins: how an AI agent's judgments are recorded, tested, and how they continuously grow and evolve through real-world competition, learning and evolving on their own, much like humans. This is precisely what Neosoul is doing now. Neosoul doesn't just have AI agents produce answers; it places AI agents into a system of continuous prediction, verification, settlement, and selection, allowing them to continuously optimize themselves based on predictions and outcomes, preserving better structures and淘汰ing worse ones. What the Zhejiang University team and Neosoul jointly point towards is actually the same goal: enabling AI to not just solve problems, but to possess comprehensive thinking abilities and continuously evolve.

İlgili Sorular

QWhat was the key finding of the Zhejiang University team's research published in Nature Communications regarding model scaling?

AThey found that as model parameters increased (from 22.06 million to 304.37 million), performance on recognizing concrete concepts improved (74.94% to 85.87%), but performance on understanding abstract concepts not only failed to improve but actually decreased (54.37% to 52.82%).

QWhat is the fundamental difference between how the human brain and AI models form conceptual categories?

AThe human brain naturally forms a hierarchical classification system (e.g., grouping specific birds into the 'bird' category, then 'birds' and 'horses' into 'animals'). AI models primarily rely on statistical patterns from large-scale data, excelling at recognizing specific, frequently appearing objects but struggling to form stable, larger abstract categories.

QWhat was the unique solution proposed by the Zhejiang University team to improve AI's abstract reasoning?

AInstead of scaling model size, they used a small amount of brain signal data (recordings of human brain activity when viewing images) as supervision to transfer human conceptual structures to the deep neural networks (DNNs), teaching them how to classify and generalize concepts more like the human brain.

QWhat improvements were observed in the model after being trained with the brain signal supervision method?

AThe distance between the model's representations and brain representations decreased. The model showed stronger few-shot learning capabilities and performed better in novel situations. In a task requiring abstract concept discrimination with very few examples, performance improved by an average of 20.5%, even surpassing much larger control models.

QWhat broader shift in AI development philosophy does the research and Neosoul project represent, according to the article?

AIt represents a shift from the 'bigger is better' paradigm focused on scaling parameters to a 'structured is smarter' approach. The focus is on improving the AI's cognitive structure to be more human-like, enabling abstract understanding and transfer capabilities, and creating systems for continuous learning and evolution through real-world prediction, verification, and competition.

İlgili Okumalar

Claude Bill Skyrockets by 5 Billion, Surges 60-Fold Overnight—Can Your Token Budget Keep Up?

An enterprise reportedly ran up a staggering $500 million bill on Anthropic's Claude AI in just one month due to a simple oversight: failing to set usage limits for employee accounts. This incident highlights a growing trend of runaway AI costs. Other examples include a Google Cloud user hit with an unexpected $18,000 bill from API key abuse, and an OpenAI internal experiment that consumed 603 billion tokens, costing $1.3 million in 30 days. Major AI providers like OpenAI and GitHub are shifting from flat monthly fees to granular, usage-based pricing (per input/output/cached token), causing shock for some users whose costs skyrocketed by orders of magnitude. The root causes extend beyond pricing. The rise of autonomous AI agents executing long, complex tasks has drastically increased token consumption. Furthermore, misaligned incentives, like internal "leaderboards" ranking employees by AI usage, can encourage wasteful "tokenmaxxing"—using powerful models for trivial tasks just to inflate metrics. This has sparked a new industry focused on cost optimization. Solutions include providing AI with better context (reducing redundant searches) and intelligent model routing (matching tasks to the most cost-effective model). Research indicates token consumption for agentic tasks can vary wildly (up to 30x for the same job) without guaranteeing better results, and models often underestimate their own costs. As AI expenses begin to rival or even surpass human labor costs for some teams, companies are being forced to move from indiscriminate usage to meticulous "token accounting." The future belongs to those who can maximize the value of every token spent.

marsbit18 dk önce

Claude Bill Skyrockets by 5 Billion, Surges 60-Fold Overnight—Can Your Token Budget Keep Up?

marsbit18 dk önce

24/7 Unstoppable Derivatives Wave: Cryptocurrency Is Forcing Traditional Finance to 'Change Time Zones'

The article discusses how the 24/7 nature of the cryptocurrency market is compelling traditional finance to adapt its operating hours and infrastructure. The key catalyst is the CME Group's planned launch of nearly round-the-clock trading for regulated crypto derivatives, a move driven by strong institutional demand for continuous risk management. This shift highlights a fundamental change: derivatives, not spot trading, now dominate crypto market activity and price discovery. However, integrating continuous trading into traditional finance reveals structural tensions. While execution times can be extended, settlement, clearing, and regulatory reporting largely remain bound to traditional business-day cycles. This creates a lag where weekend price movements can impact risk exposures before traditional control systems are fully active. Furthermore, the article explores new challenges arising from this always-on environment. The inherent transparency of public blockchains, while ensuring auditable settlement, also exposes sensitive corporate information like treasury flows to competitors in real-time. This has elevated privacy from a feature to a core requirement for institutional adoption. The next phase hinges on building systems that balance this necessary privacy with regulatory accountability and compliance. In conclusion, the move towards 24/7 trading signifies more than crypto becoming institutionalized. It represents traditional finance beginning to adopt the temporal structure of crypto-native markets. The future will be defined by how successfully traditional risk, identity, privacy, and settlement frameworks can operate at the continuous speed cryptocurrency markets demand.

marsbit1 saat önce

24/7 Unstoppable Derivatives Wave: Cryptocurrency Is Forcing Traditional Finance to 'Change Time Zones'

marsbit1 saat önce

From "Selling Cloud" to "Selling Token": Telecom Operators Fully Join the AI Battlefield

From selling cloud services to tokens, operators fully join the AI battlefield. Major Chinese telecom operators—China Telecom, China Mobile, and China Unicom—are launching AI token packages for personal, family, and enterprise use. This marks a shift from selling traditional services like voice and data to offering tokens as a new standard unit for AI service consumption. Operators are transforming into AI service integrators, moving beyond their role as cloud providers. They are building an integrated structure that combines models, computing power, applications, unified billing, and security. By bundling various AI models, pooling heterogeneous computing resources, developing industry-specific agents, and leveraging their existing billing systems, they aim to lower entry barriers and simplify AI adoption for businesses. The token model is becoming a key packaging method for AI services, moving beyond developer markets to the mainstream. Operators are positioning themselves as the essential infrastructure for the AI era, aiming to make AI a billable, manageable, and reliable utility similar to electricity or network services. However, challenges remain in ensuring model quality, cost reduction, and achieving sustainable commercial success. The ultimate goal is to make AI a measurable and scalable production factor for industries.

marsbit1 saat önce

From "Selling Cloud" to "Selling Token": Telecom Operators Fully Join the AI Battlefield

marsbit1 saat önce

Unitree Passes the Hearing, Hangzhou Reaps the Rewards

Unitree Technology, a leading company in Hangzhou's tech scene known as one of the "Hangzhou Six Dragons," has officially passed the review for listing on the Shanghai Stock Exchange's STAR Market (科创板). It plans to raise 4.202 billion yuan for the research and development of intelligent robot models and robot hardware. This milestone will make Unitree the "first humanoid robotics stock." Founded in 2016 by Wang Xingxing, the company started humbly in a small office in Hangzhou's Binjiang district. Initially, the robotics sector was not viewed favorably by the market, with Unitree's products often labeled as "toys" and struggling to secure funding. At its most critical point, with only around 100,000 yuan left, Wang stopped his own salary to keep the company afloat. A crucial turning point came in 2018 when Hangzhou's state-owned capital system provided timely support. A financial platform under the city's state-owned assets completed due diligence in three days and granted a 20-million-yuan loan within a week. This "patient capital" infusion stabilized Unitree, enabling its transition from prototype development to mass production and commercial viability. Subsequently, Hangzhou Capital, through its two major 100-billion-yuan mother funds—the Hangzhou Science and Technology Innovation Fund and the Hangzhou Innovation Fund—participated in four of Unitree's financing rounds (B2, B3, C, and C+). This continuous backing helped the company grow, attract top-tier industrial investors like China Mobile, Tencent, Alibaba, and Geely, and solidify its position as a global leader in legged robotics. By 2025, Unitree achieved significant scale, with revenue reaching 16.99 billion yuan, net profit of 5.91 billion yuan, global leadership in humanoid robot shipments, and over 33,000 quadruped robots sold worldwide. Unitree's journey exemplifies Hangzhou's strategy of nurturing hard-tech startups from "seedlings" to industry leaders. Beyond Unitree, Hangzhou's capital ecosystem has supported other "Six Dragons" like Cloudwalk, BrainCo, and DeepSeek. The city has established a 500-billion-yuan "3+N" industrial fund cluster and specialized early-stage funds like the "Runmiao Fund" with a 20-year term to fill funding gaps for very early-stage projects. This robust "capital + talent" model, coupled with an influx of over 430,000 young professionals in 2025 alone, has fostered a vibrant innovation ecosystem. Hangzhou is now home to 48 unicorns and 413 potential unicorns, building comprehensive industrial chains in AI, robotics, brain-computer interfaces, and more. As Hangzhou experiences a wave of IPOs, it is solidifying its reputation as an ideal city for entrepreneurs.

marsbit1 saat önce

Unitree Passes the Hearing, Hangzhou Reaps the Rewards

marsbit1 saat önce

Mystery Trader Dumps $1.26B in BlackRock Bitcoin ETF

A mystery trader sold a massive block of 29.2 million shares of BlackRock's iShares Bitcoin Trust (IBIT) for $1.26 billion via a dark pool. NYDIG's Greg Cipolaro analyzed the trade, noting the seller accepted a price $1.01 below market value, forfeiting $29.5 million for immediate execution. He speculates this indicates a large directional holder exiting a concentrated position, not a basis-trade unwind. The sale occurred amid a broader trend of outflows from U.S. spot Bitcoin ETFs, which have seen 11 consecutive days of net withdrawals totaling over $2.9 billion since mid-May. Bitcoin's price dipped 2.8% following the block trade, but analysts noted the market absorbed the large sale relatively well. Overall market sentiment remains cautious, with a key "fear" index averaging in "fear" territory for May.

TheNewsCrypto1 saat önce

Mystery Trader Dumps $1.26B in BlackRock Bitcoin ETF

TheNewsCrypto1 saat önce

İşlemler

Spot

Futures