The Year of AI Applications: Saying 'Yes' While Ignoring Risks? A Comprehensive Open Source Log of Software Development's Journey

marsbitPublished on 2026-06-16Last updated on 2026-06-16

Abstract

The Year of AI Applications: Blindly Saying "Yes" While Ignoring Risks? A Software Development Log Goes Fully Open Source. AI-generated code harbors risks hidden within seemingly correct programs, potentially leading to data leaks or asset loss. The open-source project "Narwhal AI Code Risks," from Peking University's Narwhal-Lab, compiles real-world cases, early warning signs, and typical risk pathways. Its goal is to help developers identify potential hazards early and avoid repeating past mistakes. In 2026, code is generated faster than ever but deployed with less scrutiny. The danger often lies not in glaring errors, but in code that appears normal—syntactically correct, passing all checks—yet introduces subtle but critical flaws like non-existent dependencies, excessive permissions, or exposed databases. A stark example is the Moonwell cbETH oracle incident. A configuration file error, where a cryptocurrency price was set to ~$1.12 instead of ~$2,200, slipped through 28 checks and a pull request signed by both AI (Claude, Copilot) and human developers. This "semantic deviation" resulted in a loss of $1.78 million. The risk is that AI can produce functionally valid code that is semantically wrong for the business context. As AI moves beyond simple code completion to modifying configurations, installing dependencies, and operating via autonomous agents, it traverses longer, less traceable paths within software engineering, blurring traditional boundaries and oversight ...

The risks of AI-written code lurk within seemingly correct code, potentially leading to data breaches or asset loss. The open-source Narwhal AI Code Risks project compiles real-world cases, early warning signs, and typical risk pathways to help developers identify hidden dangers early and avoid repeating past mistakes.

In 2026, code is being generated at an ever-increasing pace, yet deployed with less and less scrutiny.

More and more often, user requirements are placed in a dialogue box, AI reads the context, completes functions, pulls dependencies, fixes configurations, and even conveniently generates tests.

Before you know it, a piece of code is already sitting in the repository, awaiting merge.

Users have developed a new habit: let the AI write it first and get it running, then see what needs fixing if there's a problem.

But in the software world, the most dangerous things are often pieces of code that appear utterly ordinary: syntactically correct, interfaces valid, tests passing, comments perfect.

Yet it may still introduce non-existent package names, open overly broad permissions, expose databases... or even allow an Agent capable of directly calling system tools to exfiltrate sensitive data from internal systems under prompt injection.

The real danger is not a flashing red error light. It's when all risk indicators show normal.

Risks from AI-generated code used to be scattered: a case buried in a security blog, a clue recorded in an Issue. When the next team encountered a similar problem, they had to piece together the source of risk from scratch and expend immense time and effort conducting large-scale empirical measurements on the code.

Now, Peking University's Narwhal-Lab has just open-sourced Narwhal AI Code Risks, which organizes these information fragments into three categories for researchers to examine: real incidents, early signals, and typical risk paths.

Paper link: https://github.com/Narwhal-Lab/Narwhal-aicode-risks

When All 28 Checks Pass, the System Still Veers Off Course

The first clue was a merged Pull Request, where the signature field prominently featured Claude Opus 4.6, Copilot, and four human developers. All 28 checks passed: No one spotted the issue.

Then, the liquidation bot took a few minutes and seized collateral worth $1,778,044.83.

The configuration file set the price of cbETH to its conversion ratio with ETH, approximately $1.12, instead of the actual price near $2,200.

A semantic price error slipped through development, review, and merge processes, ultimately turning into real loss in the financial system. This is the most glaring aspect of the Moonwell cbETH oracle configuration incident.

The problem lay in code without syntax errors, and human developers not immediately halting the anomalous process. On the contrary, it looked complete, smooth—a normal engineering delivery.

But it is precisely this undercurrent of normalcy that makes it a quintessential example of a security incident.

The risk of AI Coding lies in the fact that it doesn't always manifest as errors.

Often, it cloaks itself in the guise of a correct answer, quietly entering the engineering pipeline. The code runs, checks pass, PRs get merged, but the business semantics have already deviated from reality.

In low-risk projects, such semantic drift might just mean rework. But in sensitive contexts like finance or enterprise data systems, it directly leads to data leaks, exposed permissions, and asset loss.

When AI participates in writing code, modifying configurations, conducting reviews, or even co-signing and entering PRs, can we be sufficiently certain of how each deviation occurs?

The Green Light Doesn't Illuminate Every Corner

Early AI code assistants mostly remained at the level of local completions. If the syntax was wrong, the compiler would error, unit tests would fail, and the CI pipeline would block it.

Today's AI Coding ventures much further, while oversight has lagged behind.

It can read files, modify configurations, install dependencies, generate infrastructure scripts, and plan autonomously across multiple tasks via Agents.

AI is no longer just sitting on the sidelines handing over tools; it's beginning to enter longer chains of the software engineering process.

>The once-clear boundaries in software engineering are being reconnected by AI Agents into longer, harder-to-trace pathways.

Scattered Records Need a Common Logbook

Security incidents rarely start with complete conclusions. Some events have solid evidence and can enter the directory as real cases; some remain at the stage of community screenshots, researcher discussions, or preliminary disclosures, suitable only for continued observation; others are not tied to a single real event but have already formed clear patterns, suitable for proactive scenario planning.

Narwhal AI Code Risks divides the material into three layers: `cases/`, `inferred/`, and `scenarios/`.

`cases/` records real incidents with public sources and evidential chains; `inferred/` stores early signals not yet fully substantiated but worth continuous tracking; `scenarios/` organizes typical scenarios with clear risk paths, not yet bound to a single specific incident.

Without such public records, risks from AI Coding easily become short-term memories on the internet.

Today, everyone remembers a certain package name; tomorrow, they discuss a data exposure incident; after a few months, it's all covered by the next wave of tool hype. When similar problems arise again, teams still blunder like headless flies into waters of unknown risk.

What Narwhal AI Code Risks does is anchor these scattered risk fragments, allowing those who come later to turn to the same page.

Following Seven Index Categories to See Where Risks Come From

The problems brought by AI-generated code are not only in the code itself. They are in dependencies, in permissions, in Agent tool calls, and even more so in the way humans trust AI output.

Currently, Narwhal AI Code Risks categorizes risks into 7 types: Supply Chain, Code-Level Vulnerabilities, Cloud & Infrastructure Configuration, Agent Risks, Vertical Domain Risks, Intellectual Property & Compliance Risks, and Human Factors.

In Supply Chain risks, AI may recommend non-existent dependencies. In Code-Level Vulnerabilities, AI might reintroduce path traversal, missing input validation, or authentication issues into business code. In Cloud & Infrastructure Configuration, AI might grant overly broad permissions, public storage buckets, or exposed ports just to get the code running initially. Agent Risks are even more complex, moving beyond text generation to action execution. AI-generated artifacts are planting hidden dangers in real systems.

The AI Engine Is Firing Up, and the Logbook Is Just Beginning

As AI increasingly steps into the real world, related risk prevention and mitigation should not remain confined to post-mortems or scattered discussions.

The truly important aspect of Narwhal AI Code Risks is transforming risk cases into reusable knowledge.

Developers can use it to identify similar issues; security researchers can treat it as a sample library; tool vendors can extract detection rules and evaluation benchmarks from it; the open-source community can continue to contribute new cases, new evidence, and new risk types.

The AI engine is roaring, and every course deviation should leave its coordinates. Risks never disappear by being ignored, but experience can be recorded and passed on. The real value lies not in discovering a single vulnerability, but in ensuring later voyagers don't have to step into the same trap.

What Narwhal AI Code Risks is doing is providing an open-source logbook for the software world in the Year of AI Applications.

References:

https://github.com/Narwhal-Lab/Narwhal-aicode-risks

This article is from the WeChat public account "New Zhiyuan," author: LRST

Trending Cryptos

CitreaCTR

wrapped stUSDTWSTUSDT

Velodrome FinanceVELODROME

BrevisBREV

ZRX（0X）ZRX

PancakeSwapCAKE

Buffett and VC, One Must Lose

The article examines the perceived investment conflict between Warren Buffett's value-oriented approach and the current aggressive, tech-focused venture capital (VC) landscape, particularly in AI. It notes that while direct public debate is limited, underlying tensions exist as Buffett's Berkshire Hathaway holds record cash, signaling caution, while VCs pour unprecedented capital into tech. The piece revisits two historical confrontations: the 2000 dot-com bubble, where Buffett warned against speculative tech investments despite market euphoria, and his 2007-2017 bet against hedge funds, which he won by advocating low-cost index funds. In both cases, Buffett opposed investing based on hype, excessive fees, or complexity rather than concrete business value. The core argument is that Buffett does not oppose innovation or VC itself but critiques investment decisions detached from fundamentals—such as ignoring profitability, over-relying on narratives like “this time is different,” or obscuring valuation with complex structures. For the current AI boom, he would likely question sustainable business models and cash flows amid high capital costs. The conclusion reiterates Buffett's “Mr. Market” parable: investors should use market volatility to their advantage, not be guided by it, emphasizing that disciplined valuation, not sentiment, should drive decisions.

marsbit1m ago

marsbit1m ago

Dalio's Major Article: How to Position in the Current Market Environment?

In the current market environment, dominated by excitement and uncertainty around revolutionary AI technology, Ray Dalio emphasizes the critical importance of diversification. He identifies key drivers—debt/monetary conditions, political/social issues, geopolitics, natural forces, and new tech—that create a highly concentrated and risky landscape, reminiscent of past technological cycles. Dalio argues that while AI presents immense opportunities, investing heavily in a few leading tech stocks carries significant risk due to their inherent volatility, competitive pressures, potential over/under-investment, and unforeseen disruptions. Historical precedent shows that most investors fail during such phases by making concentrated bets. His core principle is to embrace diversification—holding 15+ high-quality, uncorrelated, and risk-balanced investments. This mathematically improves the risk-return profile, allowing for better returns at the same risk level through engineering, compared to any single concentrated bet. He notes that current equity valuations suggest low-to-negative expected returns, and cautions against conflating excitement for the technology with the attractiveness of the stocks. Ultimately, Dalio advises that knowing when not to bet—acknowledging the limits of one's knowledge—is as vital as knowing when to bet. In an environment of high uncertainty and concentration, a well-constructed, diversified portfolio is the optimal strategy.

链捕手4m ago

Dalio's Major Article: How to Position in the Current Market Environment?

链捕手4m ago

"Nvidia Concept Stock" CoreWeave Co-founder Interview: AI Demand Seems to Intensify Daily

Interview with CoreWeave co-founder Brannin McBee and VP Nick Robbins highlights the ongoing intensification of AI demand, driven by agents, reasoning, and enterprise adoption. They note a structural shift: bottlenecks are expanding beyond GPU availability to powered data center shells, CPU, storage, electrical work, and supply chain execution. CoreWeave, serving top AI labs and cloud giants, emphasizes its customer-driven model to anticipate and build for next-gen needs. The company has redesigned data centers to accommodate more CPU and storage alongside GPUs, anticipating increased demand for Nvidia's Vera CPU and Vera Rubin servers. While GPU procurement remains critical, competition now hinges on comprehensive engineering and delivery capability. CoreWeave leverages its execution track record and performance to compete with hyperscalers and other neocloud providers. Current primary constraint is powered data center shell availability, not components like HBM memory, though cost fluctuations are passed to customers under their contract model. Vera Rubin deployment is expected to ramp significantly through 2027, following a pattern similar to previous Nvidia platforms.

marsbit11m ago

"Nvidia Concept Stock" CoreWeave Co-founder Interview: AI Demand Seems to Intensify Daily

marsbit11m ago

Arthur Hayes' Latest Interview: AI Drains Market Liquidity, BTC Unlikely to Reach $100K This Year

Arthur Hayes recently liquidated major crypto holdings including HYPE, NEAR, Worldcoin, and Zcash. His decision stems from a macro view linking oil prices, the Iran conflict, U.S. politics, and an impending AI bubble peak. He argues that high oil prices, driven by prolonged Middle East tensions, are hurting the Republican Party's election prospects. To salvage the midterms, Hayes predicts Trump may pivot to an anti-AI populist stance—advocating for taxes and regulation—which could deflate the AI investment frenzy. Hayes highlights that the AI sector has absorbed nearly all excess market liquidity via massive debt issuance ($1.5T estimated since ChatGPT's launch), leaving little for crypto. He warns that the upcoming SpaceX IPO (valued at ~$1.8T, 100x sales) and other AI mega-listings are unsustainable and risk a broad market correction. When the AI bubble bursts, he expects high correlation across assets, dragging Bitcoin down too. For now, Hayes holds Treasury bonds and energy stocks. He sees crypto outperforming only after the AI mania cools and liquidity shifts, but not before a significant downturn. In a rapid-fire, he predicts Bitcoin below $100k by year-end and would currently invest new capital in ExxonMobil.

marsbit18m ago

Arthur Hayes' Latest Interview: AI Drains Market Liquidity, BTC Unlikely to Reach $100K This Year

marsbit18m ago

Survival Guide for Crypto KOLs

The article titled "A Survival Guide for Crypto KOLs" discusses the recent ban on "InfoFi" or "post-to-earn" applications (e.g., Kaito, Cookie) on X (formerly Twitter), which effectively ended the era of easy monetization through engagement farming. It attributes this move to X's desire to reclaim platform control over user attention and ad inventory, noting that the incentive structure had degraded content quality and user experience. The piece outlines both the benefits—like a cleaner feed and a focus on genuine content—and the drawbacks, including immediate economic losses for creators and exposed platform risk. It then provides an updated guide for crypto content creators, emphasizing a shift from quantity to quality. Key recommendations include: 1) Doubling down on high-signal, original content (e.g., deep-dive threads, data-driven analysis). 2) Diversifying platforms (e.g., Substack, YouTube) to mitigate dependency on X. 3) Pursuing direct monetization through brand deals, newsletters, and paid communities. 4) Aligning with professional crypto marketing agencies for distribution. 5) Prioritizing reputation, offline networking, and evergreen content. The conclusion frames the ban as a painful but necessary reset for Crypto Twitter, pushing the ecosystem towards more sustainable and professional content creation models where genuine creators can thrive with less noise.

marsbit20m ago

Trading

Spot

Futures

Hot Articles

Hot Tokens Learning Week 7: Privacy Coins Rally in Rotation, with RIVER Standing Out as 2026’s Surprise Performer

The privacy + payments narrative has been the primary catalyst driving rotation and substantial price gains in privacy coins such as DASH and XMR.

16.6k Total ViewsPublished 2026.01.20Updated 2026.01.20

Hot Tokens Learning Week 7: Privacy Coins Rally in Rotation, with RIVER Standing Out as 2026’s Surprise Performer

Hot Tokens Learning Week 8: ADA's Ouroboros Leios Mainnet Expected to Launch in 2026

ADA's Ouroboros Leios mainnet is expected to launch in 2026, and the hard fork to Protocol Version 11 is planned for Q1 2026.

40.6k Total ViewsPublished 2026.02.10Updated 2026.02.12

Hot Tokens Learning Week 8: ADA's Ouroboros Leios Mainnet Expected to Launch in 2026

Hot Tokens Learning Week 14: Glamsterdam Set to Be Ethereum's Most Closely Watched Upgrade in 2026

Ordinals/Runes continue to drive block fee revenue and developer activity, and are seen as the starting point for Bitcoin's "native asset issuance".

26.9k Total ViewsPublished 2026.04.29Updated 2026.04.29

Hot Tokens Learning Week 14: Glamsterdam Set to Be Ethereum's Most Closely Watched Upgrade in 2026

Discussions

Welcome to the HTX Community. Here, you can stay informed about the latest platform developments and gain access to professional market insights. Users' opinions on the price of S (S) are presented below.

Hot Categories

Regulatory Policy1,803 news items