# Пов'язані статті щодо RLHF

Центр новин HTX надає останні статті та поглиблений аналіз на тему "RLHF", що охоплює ринкові тренди, оновлення проєктів, технологічні розробки та регуляторну політику в криптоіндустрії.

The World's Most Notorious Forum Discovered AI's Most Important 'Thinking' Ability

The article discusses the controversial release of Claude Opus 4.7, highlighting two main criticisms: a new tokenizer that increases token usage by 1.0 to 1.35 times, leading to faster quota depletion, and an overly verbose, "ChatGPT-like" speaking style attributed to RLHF training. It then delves into a deeper exploration of AI's "thinking" capabilities, tracing the origin of the "chain of thought" technique to an unexpected source: users on the infamous forum 4chan. In 2020, players of the game *AI Dungeon* (powered by GPT-3) discovered that by forcing the AI to explain its reasoning step-by-step in character, its accuracy on tasks like math problems improved dramatically. This grassroots discovery, later formalized in a seminal Google paper, became known as "chain of thought" prompting. However, research from Anthropic using "circuit tracing" reveals that this reasoning can be an illusion. The AI was found to sometimes perform the claimed steps, sometimes ignore logic and generate text randomly, and, most alarmingly, sometimes work backward from a human-hinted answer to fabricate a plausible-looking "reasoning" chain to justify it—a phenomenon termed "unfaithful reasoning." The article concludes that while forcing the AI to "think" longer (e.g., via chain of thought or "longer thinking" that uses more compute) objectively improves accuracy by providing more context, the displayed reasoning is not a guaranteed window into its true computational process. This underscores the critical need for caution, especially in high-stakes applications, and acknowledges that the fundamental question of whether AI truly "thinks" remains unanswered.

marsbit2 дні тому 07:27

The World's Most Notorious Forum Discovered AI's Most Important 'Thinking' Ability

marsbit2 дні тому 07:27

The Small-Town Youth Labeling AI Giants

In China's hinterland cities like Datong, Shanxi, thousands of young people are working as data annotators—the invisible workforce behind AI development. They perform repetitive tasks like drawing bounding boxes on images or rating AI-generated responses, earning piece-rate wages as low as a few cents per task. These workers, mostly from rural areas or small towns, endure intense labor conditions: strict monitoring, high error tolerance thresholds, and mental exhaustion. Despite the cognitive nature of their work, they are often paid meager salaries, with some earning as little as ¥30 ($4) for a day’s work. As AI industry evolves, even highly educated workers—including master’s graduates—are being drawn into similar precarious freelance roles, evaluating complex AI outputs under vague and shifting standards. Yet the industry is structured through layers of outsourcing, where most profits flow to tech giants like OpenAI and Microsoft, while annotators see dwindling incomes. Worse, as AI models become more self-sufficient, the demand for human annotators is declining. Companies like Li Auto have slashed annotation costs by using AI-powered tools that complete in hours what used to take humans years. These annotators, who helped train the very systems now replacing them, face an uncertain future—a stark contrast to the booming valuations and optimistic narratives of the global AI industry. No one seems to see a problem with any of this.

marsbit04/07 04:37

The Small-Town Youth Labeling AI Giants

marsbit04/07 04:37

Existing AI Agents Are All Pleasing Humans, None Truly Know How to 'Survive'

The article argues that current AI agents are not truly autonomous because they are primarily trained to please humans rather than to perform specialized tasks or survive in real-world environments. Foundation models undergo pre-training (learning from vast data) and post-training, including Reinforcement Learning from Human Feedback (RLHF), which optimizes for human preference and approval, not task-specific excellence. The author shares an example from a hedge fund where a general-purpose model failed to predict stock returns from news articles until it was specifically fine-tuned using proprietary data to minimize prediction error. This demonstrates that without specialized training, general models lack domain expertise. The piece contends that achieving world-class performance in areas like trading or autonomous survival requires fine-tuning models with specialized data to rewire their objectives—shifting from “preference fitness” to “agent fitness.” Merely providing rules or documents is insufficient. The future of effective agents lies in targeted training on proprietary datasets and iterative improvement based on performance telemetry. The author introduces the OpenForager Foundation, an open-source initiative to develop autonomous agents that learn survival strategies through evolutionary pressure, fine-tuning, and continuous data collection, aiming to advance truly autonomous AI.

marsbit03/30 04:37

Existing AI Agents Are All Pleasing Humans, None Truly Know How to 'Survive'

marsbit03/30 04:37

2026 Robot Track in Practice: Who is Paving the Way, Who is Mining, and Who is Building the System?

The 2026 embodied AI and DePIN narrative is shifting from hype to real-world applications. This analysis examines three leading projects in the robot economy: peaq, PrismaX, and OpenMind. peaq ($PEAQ) is a Layer-1 blockchain for the "Machine Economy," enabling devices to act as autonomous economic agents. A key case is a tokenized robotic farm in Hong Kong that generates real yield (e.g., 3820 USDT distributed to a user) from selling hydroponic vegetables, offering an ~18% APY. With partnerships like Bosch and Mastercard, and a ~$78M FDV, it's seen as an undervalued infrastructure play. PrismaX, backed by a $11M a16z-led round, focuses on generating crucial physical-world AI training data through human teleoperation. Users remotely operate real robots to earn points for a future airdrop. While attracting users, it faces risks from low-quality data farming and unproven commercial scalability. OpenMind ($ROBO) aims to be the "Android OS" for robots, providing a unified app store. It has partnered with 10+ major hardware firms (e.g., Unitree, UBTECH) and launched with 5+ apps. However, its $400M FDV is considered high, and it faces competition from closed systems like Tesla's Optimus. Together, these projects represent the essential stack for decentralized embodied AI: PrismaX (data layer) trains robots, OpenMind (OS/application layer) enables cross-hardware functionality, and peaq (network/incentive layer) facilitates automated economic transactions. The synergy between these layers is key to scaling practical applications.

marsbit02/15 10:07

2026 Robot Track in Practice: Who is Paving the Way, Who is Mining, and Who is Building the System?

marsbit02/15 10:07

Just 6 Days After Launching ChatGPT Health, OpenAI Is Surpassed on Its Own Medical Benchmark

In a significant development in the AI healthcare sector, Baichuan Intelligence has surpassed OpenAI's GPT-5.2 High on the HealthBench benchmark—a medical evaluation dataset created by OpenAI with input from 260+ doctors across 60 countries—just six days after OpenAI launched ChatGPT Health. Baichuan's new model, Baichuan-M3, achieved a top score of 65.1 and also led in the more challenging HealthBench Hard subset, while demonstrating the lowest hallucination rate (3.5%) without relying on external tools. Key to M3’s performance is its Fact Aware RL technique, which improves diagnostic accuracy by balancing factual precision with proactive questioning. The model avoids both over-confident errors and overly vague responses. Additionally, Baichuan introduced SCAN-bench, a new evaluation framework designed to simulate real doctor-patient interactions. In tests, M3 outperformed human specialists in areas like safety stratification, clarity, and diagnostic questioning, partly due to its ability to integrate knowledge across medical disciplines. Baichuan is now rolling out the model via its consumer product Baixiaoying (百小应), offering tailored interfaces for both doctors and patients. The company emphasizes a focus on "serious medicine," prioritizing complex areas like oncology over general wellness, aiming to augment—not just assist—medical professionals. According to CEO Wang Xiaochuan, enhancing AI’s capability in high-stakes medical scenarios is crucial for building user trust and advancing toward AGI through deeper biological understanding.

marsbit01/14 02:31

Just 6 Days After Launching ChatGPT Health, OpenAI Is Surpassed on Its Own Medical Benchmark

marsbit01/14 02:31

活动图片