Zhejiang University Research Team Proposes New Approach: Teaching AI How the Human Brain Understands the World

marsbitОпубліковано о 2026-04-05Востаннє оновлено о 2026-04-05

Анотація

A research team from Zhejiang University published a paper in *Nature Communications* challenging the prevailing notion that larger AI models inherently think more like humans. They found that while model performance on recognizing concrete concepts improved as parameters increased (from 74.94% to 85.87%), performance on abstract concept tasks slightly declined (from 54.37% to 52.82%) in models like SimCLR, CLIP, and DINOv2. The key difference lies in how concepts are organized. Humans naturally form hierarchical categories (e.g., grouping a swan and an owl into "birds"), enabling them to apply past knowledge to new situations. Models, however, rely heavily on statistical patterns in data and struggle to form stable, abstract categories. The team proposed a novel solution: using human brain signals (recorded when viewing images) to supervise and guide the model's internal organization of concepts. This method, termed transferring "human conceptual structures," helped the model learn a brain-like categorical system. In experiments, the model showed improved few-shot learning and generalization, with a 20.5% average improvement on a task requiring abstract categorization like distinguishing living vs. non-living things, even outperforming much larger models. This research shifts the focus from simply scaling model size ("bigger is better") to designing smarter internal structures ("structured is smarter"). It highlights a new pathway for developing AI that possesses more hum...

Large models have been growing in size, with the mainstream view being that the more parameters a model has, the closer it gets to human-like thinking. However, a paper published by a Zhejiang University team on April 1 in Nature Communications presents a different perspective (original article link: https://www.nature.com/articles/s41467-026-71267-5). They found that as model size (primarily SimCLR, CLIP, DINOv2) increases, the ability to recognize specific objects does continue to improve, but the ability to understand abstract concepts not only fails to improve but can even decline. When parameters increased from 22.06 million to 304.37 million, performance on concrete concept tasks rose from 74.94% to 85.87%, while performance on abstract concept tasks dropped from 54.37% to 52.82%.

Differences Between Human and Model Thinking

When the human brain processes concepts, it first forms a system of categorical relationships. Swans and owls look different, but humans still classify them both as birds. Moving up, birds and horses can be further grouped into the animal category. When humans encounter something new, they often first consider what it resembles from past experience and which category it might belong to. Humans continuously learn new concepts, then organize this experience, using this relational system to recognize new things and adapt to new situations.

Models also classify, but they form these classifications differently. They rely primarily on patterns that repeatedly appear in large-scale data. The more frequently a specific object appears, the easier it is for the model to recognize it. When it comes to larger categories, models struggle more. They need to capture the commonalities between multiple objects and then group these commonalities into the same category. Existing models still have significant shortcomings here. As parameters continue to increase, performance on concrete concept tasks improves, while performance on abstract concept tasks sometimes even decreases.

A commonality between the human brain and models is that both internally form a system of categorical relationships. However, their emphases differ. The higher-order visual regions of the human brain naturally distinguish broad categories like living and non-living things. Models can separate specific objects but find it difficult to stably form these larger categories. This difference means the human brain more easily applies past experience to new objects, allowing for rapid categorization of unseen things. Models, conversely, rely more on existing knowledge, so when encountering new objects, they tend to focus on superficial features. The method proposed in the paper addresses this characteristic, using brain signals to constrain the model's internal structure, making it closer to the human brain's categorization method.

The Solution from the Zhejiang University Team

The team's proposed solution is also unique: instead of simply adding more parameters, they use a small amount of brain signal data for supervision. These brain signals come from recordings of brain activity while humans view images. The original paper states the goal as transferring 'human conceptual structures' to DNNs. This means teaching the model, as much as possible, how the human brain classifies, generalizes, and groups similar concepts together.

The team conducted experiments using 150 known training categories and 50 unseen test categories. The results showed that as this training progressed, the distance between the model's representations and the brain representations continuously narrowed. This change occurred for both categories, indicating that the model was learning not just individual samples but truly beginning to learn a conceptual organization method more akin to the human brain.

After this process, the model demonstrated stronger few-shot learning capabilities and performed better in novel situations. In a task requiring the model to distinguish abstract concepts like living vs. non-living things with very few examples, the model improved by an average of 20.5%, even surpassing much larger control models. The team also conducted 31 additional specialized tests, where several types of models showed improvements of nearly ten percent.

Over the past few years, the familiar path in the modeling industry has been larger model scale. The Zhejiang University team has chosen a different direction: moving from 'bigger is better' to 'structured is smarter'. Scaling up is indeed useful, but it primarily improves performance on familiar tasks. Abstract understanding and transfer capabilities, inherent to humans, are equally crucial for AI. This requires future AI thinking structures to more closely resemble the human brain. The value of this direction lies in redirecting the industry's attention from pure size expansion back to the cognitive structure itself.

Neosoul and the Future

This points to a larger possibility: AI evolution may not only occur during the model training phase. Model training can determine how AI organizes concepts and forms higher-quality judgment structures. Then, after entering the real world, another layer of AI evolution just begins: how an AI agent's judgments are recorded, tested, and how they continuously grow and evolve through real-world competition, learning and evolving on their own, much like humans. This is precisely what Neosoul is doing now. Neosoul doesn't just have AI agents produce answers; it places AI agents into a system of continuous prediction, verification, settlement, and selection, allowing them to continuously optimize themselves based on predictions and outcomes, preserving better structures and淘汰ing worse ones. What the Zhejiang University team and Neosoul jointly point towards is actually the same goal: enabling AI to not just solve problems, but to possess comprehensive thinking abilities and continuously evolve.

Пов'язані питання

QWhat was the key finding of the Zhejiang University team's research published in Nature Communications regarding model scaling?

AThey found that as model parameters increased (from 22.06 million to 304.37 million), performance on recognizing concrete concepts improved (74.94% to 85.87%), but performance on understanding abstract concepts not only failed to improve but actually decreased (54.37% to 52.82%).

QWhat is the fundamental difference between how the human brain and AI models form conceptual categories?

AThe human brain naturally forms a hierarchical classification system (e.g., grouping specific birds into the 'bird' category, then 'birds' and 'horses' into 'animals'). AI models primarily rely on statistical patterns from large-scale data, excelling at recognizing specific, frequently appearing objects but struggling to form stable, larger abstract categories.

QWhat was the unique solution proposed by the Zhejiang University team to improve AI's abstract reasoning?

AInstead of scaling model size, they used a small amount of brain signal data (recordings of human brain activity when viewing images) as supervision to transfer human conceptual structures to the deep neural networks (DNNs), teaching them how to classify and generalize concepts more like the human brain.

QWhat improvements were observed in the model after being trained with the brain signal supervision method?

AThe distance between the model's representations and brain representations decreased. The model showed stronger few-shot learning capabilities and performed better in novel situations. In a task requiring abstract concept discrimination with very few examples, performance improved by an average of 20.5%, even surpassing much larger control models.

QWhat broader shift in AI development philosophy does the research and Neosoul project represent, according to the article?

AIt represents a shift from the 'bigger is better' paradigm focused on scaling parameters to a 'structured is smarter' approach. The focus is on improving the AI's cognitive structure to be more human-like, enabling abstract understanding and transfer capabilities, and creating systems for continuous learning and evolution through real-world prediction, verification, and competition.

Пов'язані матеріали

OpenAI's Largest Internal Wealth Creation: 600 People Cash Out a Total of $6.6 Billion, 75 Take Home the Maximum $30 Million Each

A Wall Street Journal report reveals OpenAI's unprecedented pre-IPO wealth creation. In a single employee stock sale last October, over 600 current and former employees sold shares, collectively cashing out approximately $6.6 billion. Due to high investor demand, the company tripled the individual sale cap to $30 million, with about 75 employees selling the maximum amount. This event represents the largest such transaction in tech industry history for a private company. OpenAI's valuation was $500 billion for this tender offer. Employees with over two years of tenure were eligible, allowing many post-ChatGPT hires their first liquidity event. The company's stock has reportedly grown over 100-fold in seven years. Following a restructuring, employees collectively hold about 26% of OpenAI. The scale of executive wealth is also staggering. In court testimony related to Elon Musk's lawsuit, President and co-founder Greg Brockman confirmed his OpenAI stake is worth around $30 billion. Analysis indicates about 165 current and former employees hold a combined ~$164.9 billion in equity, averaging nearly $1 billion per person in paper wealth. OpenAI's per-employee stock-based compensation is estimated to be 34 times the average of major tech firms before their IPOs. OpenAI continues its rapid ascent, closing a $122 billion funding round at an $852 billion valuation in March. With monthly revenue hitting $2 billion, over 900 million weekly ChatGPT users, and plans for a potential trillion-dollar IPO in late 2026, this wealth-creation engine shows no signs of stopping.

链捕手7 хв тому

OpenAI's Largest Internal Wealth Creation: 600 People Cash Out a Total of $6.6 Billion, 75 Take Home the Maximum $30 Million Each

链捕手7 хв тому

An In-Depth Analysis of the Reasons Behind the Recent Surge in the Memory Storage Industry

"Understanding the Recent Boom in the Memory Industry" summarizes the key structural shift in the memory sector driven by AI. Traditionally, the memory industry (DRAM, NAND) has been highly cyclical, with prices driven by inventory cycles and consumer electronics demand, which can be deferred. However, AI has fundamentally changed the demand structure. AI workloads create continuous, non-deferrable demand for high-bandwidth memory (HBM) due to GPU memory bottlenecks. This high-margin HBM production is prioritized by major manufacturers like Samsung, SK hynix, and Micron, structurally reducing the supply of traditional DRAM and NAND. Consequently, the market is transitioning from a spot-based model to a contract-based allocation system. HBM supply is reportedly booked through 2026, leading to spot shortages and sharp price increases for other memory types. Delivery lead times have also extended dramatically. Pricing power is shifting from the open market to those with long-term contracts. Primary beneficiaries are HBM-leading suppliers and large hyperscalers (Microsoft, AWS, Google) that can lock in future supply. Key risks to this "AI supercycle" include a potential slowdown in AI capital expenditure, disruptive new memory architectures, or a return to aggressive industry capacity expansion, which could reintroduce cyclicality.

marsbit9 хв тому

An In-Depth Analysis of the Reasons Behind the Recent Surge in the Memory Storage Industry

marsbit9 хв тому

BlackRock Expands Tokenized Finance Push With New Treasury Fund Filings

BlackRock, the world's largest asset manager, has intensified its push into tokenized finance by filing with the SEC to create a new fund. Named the BlackRock Daily Reinvestment Stablecoin Reserve Vehicle, the proposed fund would invest in cash, U.S. Treasuries, and repurchase agreements. It would issue "OnChain Shares" using a permissioned system linked to public blockchains, with Securitize Transfer Agent LLC maintaining ownership records. In a separate move, BlackRock also filed to establish an onchain share class for its existing BlackRock Select Treasury Based Liquidity Fund. For this, the transfer agent BNY Mellon outlined plans to use Ethereum's ERC-20 standards, combining blockchain data with offchain identification. These filings represent a significant expansion into tokenization, the process of creating digital tokens for traditional assets like funds. Advocates believe this technology can enhance transparency, enable 24/7 trading, and speed up settlement times.

TheNewsCrypto29 хв тому

BlackRock Expands Tokenized Finance Push With New Treasury Fund Filings

TheNewsCrypto29 хв тому

Understanding CPO (Co-Packaged Optics) in One Article: Why Nvidia Is Willing to Spend $3.2 Billion on a Fiber?

NVIDIA and Corning announced a multi-year strategic partnership on May 6, 2026, with NVIDIA committing up to $3.2 billion to support Corning's U.S. expansion. This investment will triple Corning's manufacturing plants and significantly boost its optical fiber and communications production capacity. The core driver behind this massive investment is the fundamental shift from copper to optical interconnect technology within AI data centers. As GPU clusters scale, copper wires face critical limitations: severe signal attenuation over distance, high energy consumption for signal integrity, and excessive heat generation. Optical fiber, transmitting light instead of electrical signals, solves these issues with minimal loss, near-light speed, and lower power needs. The article outlines a three-stage evolution of data center interconnect: 1. **Traditional Copper Interconnects:** The mainstream solution of the 2010s, now being phased out due to scaling bottlenecks. 2. **Pluggable Optical Modules:** The current mainstream, where modules convert electrical signals to light externally. This process still introduces energy loss and latency. 3. **CPO (Co-Packaged Optics):** The next-generation technology where the optical engine is integrated directly with the GPU chip package. This drastically reduces the electrical signal travel distance to mere millimeters, slashing power consumption and latency while boosting data density. NVIDIA CEO Jensen Huang has identified CPO as an essential core technology for AI infrastructure. NVIDIA's investment signifies a strategic shift from being a buyer to actively controlling its supply chain for critical components. With demand for specialized optical fiber far outstripping supply—evidenced by soaring prices—securing long-term manufacturing capacity has become a competitive necessity. While Corning's expansion may pressure some suppliers, a projected global fiber supply gap of 5-15% over the next few years creates a significant opportunity window, particularly for Chinese manufacturers competitive in optical preforms, chips, and modules. Ultimately, NVIDIA's move is not about chasing a trend but an engineering imperative. The transition to light-based interconnects like CPO is driven by the physical limits of copper, marking a definitive step in the ongoing AI computing revolution.

marsbit31 хв тому

Understanding CPO (Co-Packaged Optics) in One Article: Why Nvidia Is Willing to Spend $3.2 Billion on a Fiber?

marsbit31 хв тому

KOL's Perspective: Why Is SOL Set to Rise from This Point?

**Summary: Why SOL is Positioned for Growth at This Level** The article argues that SOL is poised for an upward move from its current price point, citing several key factors. Primarily, SOL has just broken out of a 4-month consolidation phase. This breakout signals a return of risk appetite to the broader crypto market, as SOL is seen as a key indicator of overall crypto health. The token's ownership has reportedly shifted from short-term traders and tourists to long-term accumulators, leading to low volume. Any meaningful increase in trading activity could thus trigger significant upward momentum. Fundamental strengths include strong institutional adoption, integration with DeFi and RWAs (Real-World Assets), and the potential benefits from the Clarity Act. Despite its high volatility—having dropped 70% from its all-time high but still up 12x from its bear market low—SOL is highlighted as one of the few tokens from the last cycle to reach new highs. It boasts a robust ecosystem of applications, users, and protocols. Future catalysts include the expected influx of AI developers following the Miami Accelerate conference, which focused on AI on Solana. Furthermore, Solana is positioned as the premier chain for memecoin activity, a trend expected to continue and drive network usage and fees. The article concludes that recent price action reflects a healthy transfer to long-term holders, setting the stage for growth.

marsbit1 год тому

KOL's Perspective: Why Is SOL Set to Rise from This Point?

marsbit1 год тому

Торгівля

Спот

Ф'ючерси