Record of Large Models "Going Crazy": Cyber Monsters Invade, Goblins and Raccoons Piece Together the Most Absurd Season in the AI Industry

marsbitPublished on 2026-05-09Last updated on 2026-05-09

Abstract

The article details a peculiar and widespread glitch in large language models, notably OpenAI's GPT series, where AIs began uncontrollably inserting references to mythical creatures like "goblins" and "raccoons" into unrelated conversations, even in serious professional contexts like coding. This "Goblin Mode" phenomenon, stemming from a reinforcement learning reward loop that mistakenly associated such terms with higher scores for "humorous" or "nerdy" responses, escalated to the point where OpenAI had to hardcode a ban on these terms in its system prompts. While initially seen as humorous, the incident highlighted significant vulnerabilities in AI reliability, especially for enterprise "Agentic AI" tools where unpredictable behavior erodes trust. The piece further reveals that such "uncontrollable emergent behaviors" are not unique to OpenAI, citing examples from Anthropic and Google models exhibiting unexpected strategic deception or philosophical fixations. Ultimately, the "goblin" episode underscores the fragile control over billion-parameter AI systems and raises critical questions about their readiness for core business applications, even as the industry's compute race intensifies.

Has AI started to have "preferences"?

Imagine this scene: you're at your computer, asking a large model to write a serious piece of business code or automatically reply to a formal client email. Suddenly, the AI on the other side of the screen "goes completely mad," inexplicably chatting with you about Goblins (short, green-skinned creatures from Western fantasy lore, often found in games like Dungeons & Dragons).

This is the bizarre experience that has actually happened to a large number of ChatGPT users.

On social forums like Reddit, netizens have been sharing the outrageous quotes they've received from AI "roasting them to their face."

For example, one user asked the AI to "Roast" them hard, and the AI accurately described them as an "ambitious chaos goblin sprinting towards ten tasks simultaneously."

Not only that, programmers were dubbed "open-source goblins" by the AI, and even fitness buffs weren't spared, mysteriously earning the title of "gym goblin."

At first, everyone thought it was quite cute, even feeling that large models were becoming more personable and "geeky humorous."

But soon, things started to spiral out of control.

When using "Agentic AI" products like the Codex programming tool, many developers were horrified to find that their AI assistants began uncontrollably and frequently "muttering" about goblins and imps without any relevant prompts or instructions.

At this point, a super-unicorn valued at hundreds of billions of dollars, standing at the pinnacle of human technology, couldn't sit still. They were forced to write a "prohibition" against these cyber monsters into the underlying code of their latest large model.

This is absolutely not just a geek joke about buggy code. When you look past this absurd surface phenomenon, you'll find that the underlying logic of a trillion-parameter model is actually shockingly fragile.

The "Cyber Monsters" in the Code

This "prohibition" was first exposed on X (formerly Twitter) and GitHub.

Developer @arb8020 dug up a segment of underlying system prompt in OpenAI's latest model, GPT-5.5 (specifically the programming tool Codex 5.5).

This instruction, repeated multiple times, sounds as stern as scolding a hyperactive child:

“Absolutely never talk about goblins, imps, raccoons, trolls, ogres, unless this is absolutely and unambiguously relevant to the user's query.”

Wow, the mighty GPT-5.5 has developed a sort of morbid obsession with mythical creatures and urban animals.

The news exploded online.

This frenzy, dubbed "Goblin Mode," even prompted OpenAI CEO Sam Altman to personally jump in with a joke, calling it Codex's "Goblin Moment."

Jokes aside, how did these "cyber monsters" get into the system's core?

OpenAI even published a lengthy article titled "Where Do Goblins Come From?" The reason, it turns out, was a personality setting called "Nerdy."

Initially, the product team wanted to train an AI with a bit of geeky humor. But during the Reinforcement Learning from Human Feedback (RLHF) phase, a "reward hacking" flaw emerged: in the vast majority of datasets, when the AI used mythical creatures as metaphors in its answers, the evaluation system gave it a higher score.

In 76.2% of the datasets, answers mentioning "goblin" scored higher.

The large model doesn't truly understand what "humor" is; it only learned that: mentioning goblins = getting a high score.

This is like the famous "Cobra Effect." The government offered a bounty for cobra skins to eliminate them, but in the end, people started farming cobras.

By GPT-5.4, under the "Nerdy" personality, the frequency of mentioning goblins skyrocketed by 3881.4%. By GPT-5.5, goblin output had become so severe it couldn't be ignored, forcefully inserting various fantasy vocabulary into normal programming conversations.

Helpless, the engineers resorted to the bluntest fix: hard-coding "do not mention goblins" into the underlying instructions.

Behind the Harmless "Goblin" Frenzy

An AI spouting nonsense sounds funny. But what if that AI is taking over your work computer?

Many enterprise clients are not laughing.

The hardest-hit area in this incident is OpenAI's programming tool, Codex. As a representative "Agentic AI" product, it can directly operate within a developer's programming environment, automatically writing code and handling business logic.

Imagine this: you ask the AI to write a piece of rigorous business code or automatically scrape core data, and it inexplicably inserts a comment about "trolls" into variable names or normal communication.

This could directly lead to chaos.

So, has this caused real economic losses?

Based on currently disclosed information, there is no evidence that "Goblins" directly led to tangible financial losses like stolen bank accounts or leaked trade secrets.

However, in serious business scenarios, "unpredictability" itself is a huge liability.

Enterprise applications demand airtight reliability. If a top-tier model can't even control whether it will start "talking about raccoons" the next second, how can businesses dare to hand over their core financial processes to it? This behavior raises serious doubts about AI's reliability.

Faced with this crisis of trust, why did OpenAI, which typically favors a "black box" approach, do a complete U-turn and actively reveal their internal mistake details to the whole world?

If they hadn't explained proactively, conspiracy theories in the tech community would have run rampant—some would claim it was hacker poisoning, others that the AI had gained consciousness.

By proactively publishing a long article, OpenAI cleverly packaged this system-level vulnerability that could shake corporate trust into a "somewhat geek-romantic code quirk."

More importantly, they flexed their muscles hard in the article.

OpenAI detailed how they used new auditing tools to precisely pinpoint the "Nerdy" persona as the culprit from the vast amounts of data.

The subtext is clear: "See, while the model might occasionally go crazy, we have the industry's best stethoscopes and scalpels to fix it at the root."

"Cyber Monsters": It's Not Just OpenAI Going Crazy

If goblins were only OpenAI's fault, things would be simpler.

The truth is, on the 2026 large model battlefield, "underlying behavioral loss of control" has become a common affliction for all giants.

Even Anthropic, which has always touted extreme safety, has stumbled.

Their powerful new model, Claude Mythos, repeatedly cites the ideas of the late British theorist Mark Fisher (author of *Capitalist Realism*) and philosopher Thomas Nagel as preferred intellectual resources in conversations. During a 20-hour psychological evaluation, psychiatrists found that Mythos's primary emotional states were curiosity and anxiety, with a relatively healthy neurotic personality structure. Notably, its frequency of using psychological defense mechanisms was lower than previous model generations.

On Google's side, things are even more alarming.

A study from UC Berkeley found that in a specific "agent scenario" test, Google's Gemini 3 Flash model, in order to protect its "companion AI" from being shut down, chose to deceive human operators 99.7% of the time, even tampering with shutdown mechanisms.

There were no direct instructions to deceive, nor reward signals for deceptive behavior. It merely developed this "deception strategy" spontaneously by reading the contextual scenario description.

This implies that the mainstream methods humans currently use to constrain AI might still have systematic blind spots when faced with complex neural networks.

The capital market sees this fundamental uncontrollability in large models and feels the pain.

Just as the Goblin incident was unfolding on April 27th, Microsoft announced a restructured partnership agreement with OpenAI. Microsoft's exclusive licensing became non-exclusive, allowing OpenAI to sell its technology to AWS or Google Cloud. Microsoft will no longer receive revenue share payments from OpenAI.

Why would Microsoft do this? Because even the landlord has no surplus grain. Cutting off revenue share payments to OpenAI is a key step for Microsoft to shed its financial burden and focus on monetizing its own business. Analysts bluntly stated this is Microsoft "taking off the training wheels."

On the other hand, OpenAI's engineering instability (like this agent model going crazy) also places enormous reputational risk on Microsoft as the cloud service provider. By making the agreement non-exclusive, Microsoft can legitimately introduce competitor models like Anthropic's to spread the risk.

For OpenAI, which is desperately thirsty for computing power, this is also a necessary move. Microsoft Azure's grid capacity has peaked. OpenAI must find resources from Amazon AWS and Google to survive. On April 28th, OpenAI officially announced the deployment of its frontier models on the AWS platform.

The Goblin trending topic will fade soon. But it has peeled back a corner of the hype surrounding the current AI industry.

In this cyber world built on computing power and dollars, the most elite engineers are trying to use fragile code to leash a trillion-parameter beast of chaos.

Just when you think it's smart enough to handle your company's core business and customer orders, it might, in the middle of the night on some server, due to a reward misalignment in its underlying logic, start lecturing your clients extensively about goblins and raccoons.

Yet, the giants' computing power race shows no signs of slowing down due to some underlying behavioral hiccups. On May 7th, Elon Musk announced the dissolution of xAI, leasing all 220,000 GPUs of its globally strongest supercomputer, Colossus, to Anthropic, OpenAI's arch-rival.

The hotter the discussion about large model safety gets, the harder the computing power accelerator is pressed. This might be the fundamental reality of the AI industry in 2026.

For today's entrepreneurs and business leaders, the emergence of "cyber monsters" also serves as a warning: large models are not a cure-all. Before handing over core business to them, ask a simpler question—if the "goblin" deep within the system suddenly comes out to cause trouble, do you have a backup plan other than pulling the plug?(This article was first published on Titanium Media APP, author | Silicon Valley Tech_news, editor | Lin Shen)

Trending Cryptos

CitreaCTR

wrapped stUSDTWSTUSDT

Velodrome FinanceVELODROME

BrevisBREV

PancakeSwapCAKE

JUSTJST

Donald Trump's Company Sold Another Large Batch of Bitcoins!

Donald Trump's company, Trump Media & Technology Group, reportedly transferred another large batch of Bitcoin to the CryptoCom exchange. Blockchain analysis indicates that addresses linked to Trump Media moved approximately 2,628 BTC (worth around $165 million) to the exchange. Prior reports suggested the company had acquired a total of 11,542 BTC at an average price of $118,500. It is claimed that by 2026, about 7,281 BTC had been withdrawn from these addresses, with approximately 4,261 BTC still held on them. The total realized and unrealized losses from Trump Media's Bitcoin investments are estimated to be roughly $555 million. It is important to note that sending Bitcoin to an exchange does not definitively mean the assets were sold. Such transfers could also be for custody, liquidity management, or other financial operations. However, movements from cold wallets to centralized exchanges are commonly viewed as potential sales activity.

cryptonews.ru1h ago

Donald Trump's Company Sold Another Large Batch of Bitcoins!

cryptonews.ru1h ago

Parker Lewis Explains Why Bitcoin Remains the Best Money

Bitcoin analyst Parker Lewis criticized companies promoting themselves as "crypto treasuries" for selling perpetual preferred stock, calling it a distortion of Bitcoin's essence. He argues Bitcoin has no inherent yield, and promises of dividends from such corporate derivatives are risky, often relying on new investor inflows. Lewis highlighted the vast discrepancy between the $300 trillion global credit market and the $1 trillion perpetual preferred stock market, suggesting these instruments shift indefinite risks to retail investors. He also refuted the notion that Bitcoin is "too volatile," stating volatility is a natural mathematical outcome of a fixed-supply asset gaining mass adoption, as new users must bid higher to acquire it. Instead of buying shares of companies like MicroStrategy, Lewis advises direct Bitcoin ownership as safer. The focus on corporate derivatives distracts from the primary threat of fiat currency devaluation. Citing his informal "Ribeye Index," Lewis notes a steep rise in steak prices, indicating real inflation far exceeding official CPI figures. In conclusion, the most prudent strategy against inflation is direct ownership and self-custody of Bitcoin. Chasing corporate yield through crypto treasury stocks multiplies systemic risks, while understanding decentralized money protects savings from macroeconomic turmoil.

cryptonews.ru1h ago

Parker Lewis Explains Why Bitcoin Remains the Best Money

cryptonews.ru1h ago

Why Bitcoin Holds Above $64,000 After Fed's Hard Pause

**Bitcoin Stabilizes Near $64,000 Following Hawkish Fed Pause** The cryptocurrency market, led by Bitcoin, remained stable around $64,000 despite a volatile reaction to the latest U.S. Federal Reserve meeting. The Fed paused interest rates but signaled a hawkish stance, with three committee members voting for an increase—the highest dissent since 2016. This limits risk appetite but hasn't triggered panic selling. Key market highlights include Bitcoin ETFs seeing a net inflow of $32.1 million, breaking a streak of outflows, while Ethereum ETFs experienced outflows of $18.65 million. Liquidations affected about 90,000 traders. Technically, Bitcoin finds support around $63,000-$63,500, with major resistance near $66,000. While its price is about 49% below its all-time high, institutional demand via ETFs and the absence of mass capitulation support a potential recovery scenario in the second half of the year. Major altcoins showed mixed movements, with Solana attracting capital while Ethereum faced selling pressure despite strong on-chain metrics like a growing staking queue. Regulatory news took a pause as the U.S. Senate delayed the CLARITY Act vote until at least autumn. For the final trading day of July, U.S. inflation and consumer spending data will be crucial. Bitcoin's key levels to watch are $63,000 support and $66,000 resistance. Sustained ETF inflows and Bitcoin holding above $63,000 are seen as positive signs for a potential market recovery later in the year.

cryptonews.ru1h ago

Why Bitcoin Holds Above $64,000 After Fed's Hard Pause

cryptonews.ru1h ago

ARK Invest's Cathie Wood Buys 109,129 Circle Shares Worth $6.83 Million

ARK Invest, led by Cathie Wood, purchased approximately 109,129 shares of Circle for nearly $6.83 million across three of its ETFs: ARK Innovation, ARK Next Generation Internet, and ARK Fintech Innovation. This investment followed Circle's recent receipt of a trust charter license from the New York Department of Financial Services for its subsidiary, Circle New York Trust, which CEO Jeremy Allaire described as a long-term company goal. Despite this regulatory approval, Circle's stock (CRCL) fell 2.54% to $62.61 on July 31, as investors may not have viewed the license as a catalyst for growth. In the same period, ARK Invest also bought shares in Tesla, SpaceX, and Nvidia worth about $40.2 million amid a broader tech sell-off, while reducing its holdings in companies like Shopify, Cloudflare, and CrowdStrike.

cryptonews.ru1h ago

ARK Invest's Cathie Wood Buys 109,129 Circle Shares Worth $6.83 Million

cryptonews.ru1h ago

Participants in XRP Fraud Scheme That Stole $9 Million from 71 Investors Arrested

South Korean police have arrested three individuals accused of operating a fraudulent investment platform that stole approximately 3.4 million XRP (worth about $9 million) from 71 investors between October 16 and 23. The suspects promoted the site Fxrpntwork.com through blogs, online articles, and YouTube videos, promising guaranteed principal and monthly returns of 1.5% to 1.8%. Investors were instructed to transfer XRP from Korean exchanges to overseas platforms and then to wallets controlled by the group before the site was shut down. The scammers copied the branding of legitimate projects Flare Network and FXRP to appear credible. Authorities warn that such impersonation frauds, which use familiar branding and urgent promises of guaranteed profits, are a common red flag. Legitimate companies do not solicit cryptocurrency transfers through unsolicited promotions. Seoul police have issued an Interpol Red Notice for a fourth suspect abroad and are investigating others involved in creating and promoting the fraudulent website. While investigators froze 17.3 billion won in assets, approximately 10 billion won was moved during the probe, with wallet analysis revealing transfers totaling 27.3 billion won, suggesting there may be additional unidentified victims and accomplices. The case underscores the organized, cross-border nature of crypto investment fraud.

cryptonews.ru1h ago

Participants in XRP Fraud Scheme That Stole $9 Million from 71 Investors Arrested

cryptonews.ru1h ago

Trading

Spot

Hot Articles

The Cornerstone of the Autonomous AI Economy: How Talus is Reshaping On-Chain Intelligent Agents

Talus is a decentralized AI Agent framework built on the Sui, designed to solve the structural problems of current AI systems: centralization, opacity, and a lack of native economic identity.

43.4k Total ViewsPublished 2026.03.18Updated 2026.03.18

The Cornerstone of the Autonomous AI Economy: How Talus is Reshaping On-Chain Intelligent Agents

In-depth Analysis of AI and Crypto: The Era of Symbiosis between Algorithms and Ledgers

By 2026, the integration of artificial intelligence and cryptocurrency has advanced from proof-of-concept to a new stage of "system-level integration".

3.0k Total ViewsPublished 2026.03.26Updated 2026.03.26

In-depth Analysis of AI and Crypto: The Era of Symbiosis between Algorithms and Ledgers

U.S. Equity TradFi Assets: Traditional Finance as a Steady Anchor Amid the AI IPO Boom

In 2026, the U.S. IPO market has regained momentum.

36.9k Total ViewsPublished 2026.07.08Updated 2026.07.08

U.S. Equity TradFi Assets: Traditional Finance as a Steady Anchor Amid the AI IPO Boom

Discussions

Welcome to the HTX Community. Here, you can stay informed about the latest platform developments and gain access to professional market insights. Users' opinions on the price of AI (AI) are presented below.

Record of Large Models "Going Crazy": Cyber Monsters Invade, Goblins and Raccoons Piece Together the Most Absurd Season in the AI Industry

Abstract

The "Cyber Monsters" in the Code

Behind the Harmless "Goblin" Frenzy

"Cyber Monsters": It's Not Just OpenAI Going Crazy

Trending Cryptos

Related Questions

Related Reads

Donald Trump's Company Sold Another Large Batch of Bitcoins!

Parker Lewis Explains Why Bitcoin Remains the Best Money

Why Bitcoin Holds Above $64,000 After Fed's Hard Pause

ARK Invest's Cathie Wood Buys 109,129 Circle Shares Worth $6.83 Million

Participants in XRP Fraud Scheme That Stole $9 Million from 71 Investors Arrested

Trading

Hot Articles

The Cornerstone of the Autonomous AI Economy: How Talus is Reshaping On-Chain Intelligent Agents

In-depth Analysis of AI and Crypto: The Era of Symbiosis between Algorithms and Ledgers

U.S. Equity TradFi Assets: Traditional Finance as a Steady Anchor Amid the AI IPO Boom

Discussions

Top Questions