GitHub Announces Default Use of Copilot User Data for AI Model Training Starting April 24

marsbitОпубликовано 2026-03-26Обновлено 2026-03-26

Введение

GitHub has announced an update to its repository policy, effective April 24, 2026, allowing the use of user interaction data to train its AI models. The data collection will include users of Copilot Free, Pro, and Pro+, covering model inputs and outputs, code snippets, contextual information, repository structures, and chat logs. According to GitHub’s Chief Product Officer Mario Rodriguez, the move aims to enhance the accuracy and security of the model’suggestions, with internal Microsoft tests already showing improved acceptance rates. The policy follows an opt-out model, meaning affected users must manually disable data sharing in their privacy settings, sparking debate within the developer community over data ownership and the definition of private repositories. Copilot Business, Enterprise, and educational users are currently exempt due to contractual terms. GitHub defended the change as consistent with industry practices adopted by companies like Anthropic, JetBrains, and Microsoft. However, the inclusion of private repository code in training sets challenges conventional notions of privacy. This shift reflects a broader industry trend where leading AI providers are turning to user interaction data as high-quality public code resources diminish. It signals GitHub’s continued transition from an open-source platform to a closed-loop AI training ecosystem and highlights growing tensions between data compliance and AI model advancement.

GitHub recently announced an update to its repository policy effective April 24, 2026, planning to utilize user interaction data to train its AI models. This data collection covers Copilot Free, Pro, and Pro+ users, specifically including model inputs and outputs, code snippets, contextual information, repository structures, and chat interaction logs.

GitHub's Chief Product Officer, Mario Rodriguez, stated that the introduction of interaction data aims to improve the accuracy and security of the model's code suggestions, noting that pre-testing with Microsoft's internal data has significantly increased suggestion acceptance rates. Notably, the policy adopts an "opt-in by default" mechanism, requiring affected users to manually disable the relevant option in their privacy settings to opt out, which has sparked widespread discussion in the developer community regarding the definition of private repositories and data ownership.

Currently, Copilot Business, Enterprise users bound by contract terms, and educational users are temporarily unaffected by this change. GitHub emphasized in its statement that this move aligns with industry practices commonly adopted by major players like Anthropic, JetBrains, and Microsoft. However, incorporating private repository code into training datasets essentially challenges the traditional boundaries of "private" concepts, even though GitHub claims its purpose is to optimize development workflows.

From an industry perspective, as high-quality public code data becomes increasingly scarce, leading AI vendors are accelerating their shift toward mining "deep data" such as private interaction data to seek performance gains in models. This policy shift not only marks GitHub's further tilt from an open-source hosting platform toward a closed-loop AI training ecosystem but also signals that the AI developer tools sector is entering a new stage of博弈 between data compliance and model evolution.

Связанные с этим вопросы

QWhat is the main change GitHub announced regarding Copilot and user data?

AGitHub announced that starting April 24, 2026, it will update its repository policy to use user interaction data from Copilot Free, Pro, and Pro+ users to train its AI models.

QWhich groups of users are exempt from this new data usage policy?

ACopilot Business, Enterprise users, and educational users are currently not affected by this change due to contractual terms.

QWhat reason did GitHub's Chief Product Officer give for collecting this data?

AMario Rodriguez stated that introducing interaction data aims to improve the model's code suggestion accuracy and security, noting that internal testing at Microsoft has already significantly increased suggestion acceptance rates.

QHow can users opt out of having their data used for training?

AThe policy uses an 'opt-out' mechanism, meaning affected users must manually go into their privacy settings to disable the relevant option to exclude their data.

QWhat broader industry trend does this policy change reflect according to the article?

AIt reflects a trend where top AI vendors are turning to 'deep data' like private interaction data to seek model performance gains as high-quality public code data becomes scarce, signaling a new phase of balancing data compliance with model evolution in AI developer tools.

Похожее

Single-Day Plunge of 30%, Arthur Hayes Suddenly Liquidates: Why Did ZEC Get Exploded by Security Issues?

On June 5th, Zcash founder Zooko Wilcox disclosed a critical soundness vulnerability in the project's latest Orchard privacy pool. This flaw, found in the elliptic curve multiplication constraints, could allow an attacker to create unlimited counterfeit ZEC within the shielded pool, with transactions appearing valid. The vulnerability was discovered in late May by security researcher Taylor Hornby, who utilized Anthropic's new Opus 4.8 AI model for a targeted audit. The Zcash ecosystem had already performed an emergency network upgrade to patch the issue. However, the detailed disclosure triggered severe market panic, causing ZEC's price to plummet over 30% in a single day. Notably, prominent investor Arthur Hayes announced he had sold his entire ZEC position following the news. The incident starkly challenges the "technological trust" narrative central to privacy coins. Despite years of top-tier cryptographic audits, the bug persisted until uncovered with advanced AI-assisted research. This highlights the growing gap between theoretical perfection and practical implementation in privacy technology. The event serves as a industry-wide warning: in an AI-driven security landscape, the assumption that "undiscovered equals safe" is obsolete. It underscores the urgent need for continuous, proactive security practices combining AI audits, formal verification, and rapid response mechanisms.

foresightnews_api44 мин. назад

Single-Day Plunge of 30%, Arthur Hayes Suddenly Liquidates: Why Did ZEC Get Exploded by Security Issues?

foresightnews_api44 мин. назад

Breaking the Curse of DeFi Cascading Liquidations, Vitalik Proposes a New Solution

**Vitalik Buterin Proposes New DeFi Design to Eliminate Forced Liquidations** Ethereum co-founder Vitalik Buterin has published a proposal for a new decentralized finance (DeFi) architecture aimed at removing the automatic liquidation mechanisms prevalent in current lending protocols. The core idea involves creating synthetic assets using options as building blocks, fundamentally avoiding the抵押借贷结构 that triggers forced sell-offs. The proposal responds to a recurring flaw in DeFi: during sharp market downturns, mass自动清算 of under-collateralized positions can exacerbate price declines, creating systemic selling pressure and market instability, as evidenced by recent crypto market volatility. Buterin's model would split an asset like 1 ETH into two option-like derivatives, P and N, pegged to a price index with a set strike price and expiration. At expiry, an oracle determines the settlement price to allocate the underlying ETH between P and N holders. This design eliminates the "cliff" of instant liquidation. Instead, a position's value would gradually drift from its target peg if not actively rebalanced by the user, transferring the rebalancing decision from the protocol to the user or automated tools. A key advantage is the reduced reliance on high-frequency, real-time oracle price feeds, which are vulnerable to manipulation and errors in current systems. The delayed settlement in the options model allows for more robust, fault-tolerant oracle designs. However, significant challenges remain for practical adoption. High transaction costs (slippage) from frequent rebalancing on automated market makers (AMMs) could erode user funds. The model may not be suitable for stablecoins requiring a strict 1:1 dollar peg, as it inherently allows for value drift. Success would depend on developing new liquidity provisioning models and deep markets for these synthetic assets. The proposal represents a fundamental rethinking of DeFi risk management, challenging the industry to explore alternatives to被动集中平仓 rather than merely optimizing existing liquidation processes. It remains a theoretical framework awaiting implementation and testing by development teams.

foresightnews_api47 мин. назад

Breaking the Curse of DeFi Cascading Liquidations, Vitalik Proposes a New Solution

foresightnews_api47 мин. назад

Bitcoin's Decline Marks the Transformation of Crypto

Title: The Decline of Bitcoin Marks the Transformation of Crypto While Bitcoin's price recently fell below $70,000, down approximately 45% from its peak, the broader crypto industry is not following it into decline. Instead, crypto is maturing and evolving beyond its dependence on Bitcoin's price movements. Two of Bitcoin's core functions are being usurped. First, AI has captured its role as the primary speculative asset. AI, with its tangible revenue, explosive demand, and massive capital inflows ($700-830 billion in 2024), is siphoning off the speculative "hot money" that once drove Bitcoin. It also contributes to a sustained high-interest-rate environment, further tightening liquidity for assets like Bitcoin. Second, dollar-pegged stablecoins like USDC and USDT have replaced Bitcoin as the crypto market's foundational currency and primary on/off-ramp. Most trading pairs and on-chain transactions are now settled in stablecoins, severing the historical link where all capital inflows had to pass through Bitcoin first. This decoupling allows projects to thrive based on their own fundamentals rather than Bitcoin's price. Examples include Hyperliquid, an on-chain derivatives exchange with annual revenues of $8-13 billion, and prediction market platform Polymarket, valued at $200 billion with $3.65 billion in annual fees. These projects are evaluated on traditional metrics like revenue and user growth. New opportunities are emerging, particularly around privacy. Privacy coins like Zcash (ZEC) are seeing surging demand, while infrastructure like NEAR enables private, cross-chain asset transfers without requiring users to hold a specific token—privacy becomes a universal service layer. In this new paradigm, stablecoins are the universal cash, various project tokens represent equity, and privacy-enabled cross-chain coordination layers (like NEAR) act as the critical infrastructure connecting a fragmented, multi-chain ecosystem. Bitcoin is now just one asset among many. The era where the entire crypto market moved in lockstep with Bitcoin is over. The industry's health should now be judged by project fundamentals—real revenue, active users, and tokenomics that capture value—and the development of the underlying infrastructure enabling a mature, dollar-denominated crypto economy.

foresightnews_api50 мин. назад

Bitcoin's Decline Marks the Transformation of Crypto

foresightnews_api50 мин. назад

Lightspark CEO: In Ten Years, Bitcoin Will Be as Invisible as TCP/IP, Yet Power Trillions in Daily Transactions

A decade from now, Bitcoin will function like TCP/IP — invisible yet foundational, supporting trillions in daily transactions globally, according to Lightspark CEO David Marcus. In this future, a coffee shop in Lagos receives instant payment, a manufacturer in São Paulo settles an invoice with a supplier in Ho Chi Minh City, and a freelancer in Bangalore gets paid weekly from an Austin startup — all via Bitcoin's settlement layer, with none of the parties consciously interacting with it. This vision parallels the adoption of open protocols: first driven by necessity where existing systems fail, then scaling rapidly as tools mature and economic benefits become clear. The structural shift begins with wallets. Modern non-custodial wallets, like Spark, allow users to hold dollars, local currency, and Bitcoin in a single address, seamlessly switching between them. This eliminates friction and revolutionizes global custody, moving significant deposits to user-controlled keys not by ideology, but by superior utility. As a result, Bitcoin becomes the default savings layer for billions, as its fixed supply and appreciating value make it a rational choice for savers holding it alongside stablecoins in their everyday wallets. Businesses follow a similar path, from small companies in emerging markets to multinational corporations, holding Bitcoin alongside operational stablecoins. The latest trend is direct Bitcoin transactions for commerce. When both parties hold Bitcoin, transacting in it becomes the simplest option — no conversions, no intermediary currency. This starts in niche areas like high-value B2B settlements but grows as infrastructure makes sending Bitcoin as easy as stablecoins. An accelerating force is AI agents. By 2036, AI agents conducting commerce on behalf of individuals and firms will increasingly choose Bitcoin for settlement. Optimizing for speed, finality, and minimal counterparty risk across jurisdictions, they find Bitcoin's global, neutral, and programmable network ideal for netting and settling obligations. Thus, Bitcoin is becoming the native currency for machine commerce, just as it has become a native savings asset for humans. The global monetary system is being rebuilt from the protocol layer: open infrastructure, default self-custody, Bitcoin settling everything underneath, with stablecoins as the interface. Most users won't think about Bitcoin when they transact — and they won't need to.

foresightnews_api54 мин. назад

Lightspark CEO: In Ten Years, Bitcoin Will Be as Invisible as TCP/IP, Yet Power Trillions in Daily Transactions

foresightnews_api54 мин. назад

Торговля

Спот
Фьючерсы
活动图片