From "Manual Rules" to "AI Mind Reading": X's New Algorithm Reshapes the Information Flow, More Accurate and More Dangerous

比推Published on 2026-01-20Last updated on 2026-01-20

Abstract

Elon Musk's X (formerly Twitter) has transitioned from a recommendation system based on "manually stacked rules and heuristic algorithms" to one that relies entirely on a large AI model to predict user preferences. The new algorithm, For You," mixes content from accounts a user follows with posts from across the platform that the AI believes the user will like. The process begins by building a user profile based on historical interactions (likes, retweets, dwell time) and user features (following list, preferences). The system then gathers candidate posts from two sources: the user's direct network ("Thunder") and a broader network of potentially interesting content from strangers ("Phoenix"). After data hydration and an initial filtering step to remove duplicates, old posts, or content from blacklisted authors, the core scoring process begins. A Transformer model (Phoenix Grok) predicts the probability of a user taking various positive actions (like, retweet, reply, click) or negative ones (block, mute, report) on each post. A final score is calculated by weighting these probabilities. An Author Diversity Scorer is then applied to reduce the visibility of multiple posts from the same author in a single batch. The highest-scoring posts undergo a final filter to remove policy-violating content and remove duplicates from the same thread before being sorted into the user's feed. The shift represents a move from "telling the machine what to do" to "letting the machine learn ...

Written by: KarenZ, Foresight News

Original title: Plain Language Breakdown of X's New Recommendation Algorithm: From "Data Fishing" to "Scoring"


Has Musk changed Twitter's recommendation system from "manually stacking rules and mostly heuristic algorithms" to "purely relying on AI large models to guess what you like"?

On January 20, Twitter (X) officially disclosed the new recommendation algorithm, which is the logic behind the "For You" timeline on the Twitter homepage.

Simply put, the current algorithm is: mixing "content posted by people you follow" and "content from the entire network that might suit your taste," then sorting it based on a series of your previous actions on X, such as likes, comments, etc., according to its appeal to you. After two rounds of filtering, it eventually becomes the recommended information flow you see.

Below is the core logic translated into plain language:

Building a Profile

The system first collects the user's contextual information to build a "profile" for subsequent recommendations:

  • User behavior sequence: Historical interaction records (likes, retweets, dwell time, etc.).

  • User features: Follow list, personal preference settings, etc.

Where does the content come from?

Every time you refresh the "For You" timeline, the algorithm fetches content from the following two sources:

  • Inner Circle (Thunder): Tweets from people you follow.

  • Outer Circle (Phoenix): Posts from people you don't follow, but which the AI, based on your taste, fishes out from the vast sea of people as posts you might be interested in (even if you don't follow the author).

These two piles of content are mixed together to form the candidate tweets.

Data Completion and Preliminary Filtering

After fishing up thousands of posts, the system pulls the complete metadata of the posts (author information, media files, core text). This process is called Hydration. Then it performs a quick cleaning round, eliminating duplicate content, old posts, posts the user themselves posted, content from blocked authors, or content containing muted keywords.

This step is to save computing resources and prevent invalid content from entering the core scoring phase.

How is scoring done?

This is the most crucial part. The Transformer model based on Phoenix Grok scrutinizes each remaining candidate post after filtering and calculates the probability of you performing various actions on it. It's a game of adding and subtracting points:

Plus points (Positive feedback): The AI thinks you are likely to like, retweet, reply, click on the image, or click to view the profile.

Minus points (Negative feedback): The AI thinks you are likely to block the author, mute, or flag the post.

Final Score = (Like probability × weight) + (Reply probability × weight) – (Block probability × weight)...

It is worth noting that in the new recommendation algorithm, the Author Diversity Scorer usually intervenes after the AI calculates the final score. When it detects multiple pieces of content from the same author in a batch of candidate posts, this tool automatically "downgrades" the score of that author's subsequent posts, making the authors you see more diverse.

Finally, sort by score and pick the batch of posts with the highest scores.

Secondary Filtering

The system re-checks the top-scoring posts, filters out violations (such as spam, violent content), deduplicates multiple branches of the same thread, and finally arranges them in order from highest to lowest score, becoming the information flow you see.

Summary

X has removed all manually designed features and most heuristic algorithms from the recommendation system. The core advancement of the new algorithm lies in "letting the AI autonomously learn user preferences," achieving a leap from "telling the machine what to do" to "letting the machine learn how to do it itself."

First, recommendations are more accurate, and "multi-dimensional prediction" fits real needs better. The new algorithm relies on the Grok large model to predict various user behaviors—not only calculating "whether you will like/retweet" but also calculating "whether you will click the link to view," "how long you will stay," "whether you will follow the author," and even predicting "whether you will report/block." This refined judgment allows the recommended content to fit users' subconscious needs with unprecedented precision.

Second, the algorithm mechanism is relatively fairer and can, to some extent, break the curse of "big account monopoly," giving new and small accounts more opportunities: The old "heuristic algorithm" had a fatal problem: big accounts, relying on historically high interaction volumes, could get high exposure no matter what content they posted, while new accounts, even with high-quality content, were buried due to "lack of data accumulation." The candidate isolation mechanism allows each post to be scored independently, unrelated to "whether other content in the same batch is a hit." At the same time, the Author Diversity Scorer also reduces the spamming behavior of subsequent posts by the same author in the same batch.

For X the company: This is a cost-reducing and efficiency-increasing measure, using computing power to replace manpower, and using AI to improve retention. For users, we are dealing with a "super brain" that constantly tries to read our minds. The more it understands us, the more we rely on it. But precisely because it understands us too well, we will sink deeper into the "information cocoon" woven by the algorithm and become more easily targeted by emotionally charged content.


Twitter:https://twitter.com/BitpushNewsCN

Bitpush TG Discussion Group:https://t.me/BitPushCommunity

Bitpush TG Subscription: https://t.me/bitpush

Original link:https://www.bitpush.news/articles/7604412

Related Questions

QWhat is the core change in X's new recommendation algorithm compared to the old system?

AThe core change is shifting from 'manually designed rules and mostly heuristic algorithms' to a system that 'relies purely on AI large models to guess user preferences', allowing the AI to autonomously learn user preferences.

QFrom which two sources does the new algorithm gather candidate content for a user's 'For You' timeline?

AIt gathers content from the 'Thunder' circle (posts from people the user follows) and the 'Phoenix' circle (posts from accounts the user doesn't follow but that the AI predicts they might be interested in).

QWhat is the purpose of the 'Author Diversity Scorer' in the new algorithm?

AThe Author Diversity Scorer detects when multiple posts from the same author are in a batch of candidate content and automatically lowers the score of that author's subsequent posts to ensure the user sees a more diverse range of authors.

QHow does the AI model determine the final score for a piece of content?

AA Transformer model calculates the probability of the user performing various actions on the content. It adds points for predicted positive feedback (like, retweet, reply) and subtracts points for predicted negative feedback (block, mute, report), with each action weighted. The final score is the sum of these weighted probabilities.

QWhat are two main potential consequences for users mentioned in the article regarding the new algorithm?

AThe consequences are: 1) More accurate and personalized content that better fits the user's subconscious needs. 2) A deeper entrapment in an 'information cocoon' and a higher likelihood of being precisely targeted by emotional content because the algorithm understands them so well.

Related Reads

55TB to 28TB? The Rumor and Panic Behind Rubin's Memory Being Halved

Title: 55TB to 28TB? The Rumor and Panic Behind the Potential Halving of Rubin's Memory. On June 4th, a report from SemiAnalysis suggested NVIDIA's next-gen Vera Rubin NVL72 AI rack may ship with roughly 28TB of SOCAMM DRAM per rack instead of the anticipated 55TB, primarily using 96GB modules. This sparked a market panic, causing Micron's stock to drop over 10% on fears of halved memory demand. However, the article argues this panic is misguided for several key reasons. First, SOCAMM modules are socketed and upgradeable, not soldered. Lower initial configuration doesn't mean permanent demand loss. Second, the primary driver is a severe 2026 LPDDR5X supply shortage, not diminished need. NVIDIA is likely prioritizing rack shipments with available components. Third, with fixed total LPDDR5X supply, using less per rack could allow NVIDIA to ship *more* racks, not necessarily reducing overall memory orders. Micron's sharp drop was also attributed to a broader semiconductor sell-off triggered by Broadcom's earnings, with the SemiAnalysis report providing a convenient narrative for profit-taking after Micron's massive rally. In summary: the report on lower default configurations is likely accurate, but interpreting it as a demand collapse is wrong. The real risk for Micron lies in its reportedly minimal HBM4 share for Rubin, not in potentially flexible SOCAMM demand. The sell-off appears more like a correction amplified by coinciding negative catalysts.

marsbit16m ago

55TB to 28TB? The Rumor and Panic Behind Rubin's Memory Being Halved

marsbit16m ago

Exclusive from Yingke | Tang Wenbin's 'Yuanli Lingji' Merges with Logistics Robotics Company, and Secures Investment from Zhipu, SenseTime, Jieyue, and Others

Exclusive report: Embodied AI company "Yuanli Lingji" recently completed a new round of financing from major AI model firms including Zhipu AI, Stepfun, and SenseTime, alongside continued investments from industrial backers like Huaqin and SAIC Hengxu. Founded in March 2025 by Tang Wenbin, former co-founder and CTO of Megvii, Yuanli Lingji is a general-purpose embodied AI model company. In a notable move, the company has merged with logistics robotics firm "Atomix" (formerly known as Yuanli Juhe) through a share acquisition. Atomix, which originated from Megvii's logistics robotics business led by Tang in 2016 and was spun off in July 2024, has grown to become the world's second-largest supplier of pallet shuttle robots, with annual revenue nearing 1 billion RMB and over 500 projects globally for clients like Uniqlo and CATL. This merger aims to break the industry's "data deadlock" by combining Atomix's extensive real-world operational data from more than 20 countries with Yuanli Lingji's model training capabilities. The company's embodied AI model "DM0" utilizes a cross-domain training approach, integrating internet semantics, autonomous driving rules, and robotics data to achieve hardware-agnostic, precise manipulation even with a compact 2.4B parameter size. The collective investment from key AI players and the strategic merger signal a shift in the competitive landscape, as major model companies pivot from language tokens to physical actions ("from Token to Action"). The industry is entering a consolidation phase where hardware, AI models, data, and application scenarios converge to scale embodied intelligence, a trend mirrored by recent moves from giants like ByteDance and Skild AI.

marsbit24m ago

Exclusive from Yingke | Tang Wenbin's 'Yuanli Lingji' Merges with Logistics Robotics Company, and Secures Investment from Zhipu, SenseTime, Jieyue, and Others

marsbit24m ago

U.S. Stock Market Trends: Dow Hits New High, Nasdaq Falls, Whom Did Broadcom's Slap Wake Up?

U.S. Stocks Split: Dow Hits Record High as Nasdaq Slips; Broadcom's Plunge Sparks Rotation On June 4, the U.S. stock market saw a sharp divergence. The Dow Jones surged 875 points (+1.73%) to a record high of 51,561.93, while the Nasdaq Composite edged down 0.09%. The S&P 500 rose 0.41%. The primary catalyst was a sharp sell-off in AI-related chip stocks, led by Broadcom (AVGO). Despite reporting a 143% year-over-year jump in AI semiconductor revenue to $10.8 billion, the company's shares plunged about 14%. This was triggered by its maintained long-term AI revenue target, which failed to meet heightened expectations for a stock that had gained 55% this quarter and traded at a high P/E ratio. The slide dragged down the broader semiconductor sector and the technology板块. Conversely, money rotated into sectors like Healthcare (+3.14%), Financials (+2.67%), and Real Estate (+1.87%). UnitedHealth and Goldman Sachs were major contributors to the Dow's gains. The rotation was attributed to a search for value outside overheated tech names and a slight dip in Treasury yields. In other major news, SpaceX confirmed its IPO for June 12, targeting a record $75 billion raise at a ~$1.75 trillion valuation. Additionally, initial jobless claims rose to a four-month high, adding nuance to the labor market narrative ahead of the key May non-farm payrolls report. The day's action signaled that while the AI growth story remains intact, excessive valuations are prompting a market reassessment. Funds are moving, at least temporarily, from high-flying tech to more defensive and value-oriented sectors. The sustainability of this rotation hinges on upcoming economic data, particularly the jobs report, and the market's absorption of the massive SpaceX IPO.

marsbit27m ago

U.S. Stock Market Trends: Dow Hits New High, Nasdaq Falls, Whom Did Broadcom's Slap Wake Up?

marsbit27m ago

From 'Old Dogs' to 'New Darlings': How AI is Revaluing Old Infrastructure, from Dell to Nokia

"Old Dogs" Become AI's New Darlings: Revaluing Legacy Infrastructure The AI investment narrative is shifting. Beyond the spotlight on core chipmakers like Nvidia, a new wave of interest is rising for legacy tech companies—Dell, HPE, Nokia, Cisco, Corning, Western Digital—once labeled as slow-growth, outdated stories. This resurgence stems from AI's evolution from model development to real-world deployment, creating massive demand for physical infrastructure. As AI moves into data center construction and enterprise adoption, the focus turns to who can actually build and deliver complex systems. These established players hold decades of experience in supply chains, integration, networking, and enterprise delivery—assets now critical for scaling AI. The revaluation can be grouped into three key infrastructure areas: 1. **Servers & Integration (e.g., Dell, HPE):** They are becoming essential system integrators, transforming GPUs into full-scale AI servers with networking, power, and cooling, then delivering them to clients. Strong recent earnings and AI-specific revenue/order growth for Dell and HPE underscore this shift. 2. **Networking & Connectivity (e.g., Corning, Nokia, Cisco):** As AI clusters grow, high-speed data transfer becomes paramount. Corning benefits from fiber demand for data center links, Nokia is exploring AI-integrated wireless networks (AI-RAN), and Cisco sees surging orders for data center switches—all critical for efficient AI operations. 3. **Storage (e.g., Western Digital, Seagate):** The AI data explosion requires vast capacity. Beyond high-speed memory (HBM), there's growing need for high-capacity HDDs to store training data, logs, video, and cold/archival data cost-effectively. This revaluation, however, is not a blanket endorsement. True reassessment requires concrete proof: AI-driven orders and revenue growth, upward revisions to company guidance, and sustainable improvements in profit quality, not just top-line sales. In essence, AI is not turning all old tech firms into high-growth stocks; it is selectively re-pricing the "old assets" of companies that are mission-critical for building the new AI infrastructure, transforming their legacy capabilities into renewed growth engines.

marsbit36m ago

From 'Old Dogs' to 'New Darlings': How AI is Revaluing Old Infrastructure, from Dell to Nokia

marsbit36m ago

Trading

Spot
Futures
活动图片