380,000 Apps Exposed, 2,000+ Apps Leaked Secrets: AI Programming Turns 'Intranet' into Public Internet

marsbitPublished on 2026-05-11Last updated on 2026-05-11

Abstract

Israeli cybersecurity firm RedAccess uncovered a severe data exposure trend linked to "vibe coding" or AI-powered software development tools. Their research found approximately 38,000 publicly accessible web applications built with platforms like Lovable, Base44, Netlify, and Replit. Of these, an estimated 2,000 apps exposed sensitive corporate and personal data, including medical records, financial information, internal strategic documents, and customer chat logs. In some cases, access even granted administrative privileges. The core issue stems from default privacy settings that make applications public by default, combined with a lack of built-in security controls (like authentication) in the AI-generated code. This allows employees without security expertise—"citizen developers"—to easily create and deploy applications that bypass standard corporate security reviews. The exposed apps, often indexed by search engines, are trivially discoverable. While some platform providers (Replit, Lovable, Wix/Base44) argue that security configuration is the user's responsibility and question the validity of some findings, security researchers confirm the widespread reality of such exposures. This pattern, also noted in prior studies, highlights a critical security gap as AI democratizes app creation, potentially leading to massive, unintentional data leaks.

“Vibe coding tools are leaking vast amounts of personal and corporate data.” Recently, while researching the trend of "shadow AI," researchers from the Israeli cybersecurity startup RedAccess discovered that AI tools used by developers to build software quickly have exposed medical records, financial data, and internal documents from Fortune 500 companies to the open web.

RedAccess CEO Dor Zvi stated that researchers found approximately 380,000 publicly accessible applications and other assets created by developers using tools like Lovable, Base44, Netlify, and Replit. Among these, about 5,000 contained sensitive corporate information, and upon further inspection, nearly 2,000 applications appeared to expose private data. Axios independently verified multiple exposed apps, and WIRED also separately confirmed these findings.

40% of AI-Coded Apps Expose Sensitive Data,

Some Even Have Admin Privileges

As AI increasingly takes over the work of modern programmers, the cybersecurity field has long warned that automated coding tools are bound to introduce a large number of exploitable vulnerabilities into software. However, when these vibe coding tools allow anyone to create and host web applications with just a click, the problem is not just vulnerabilities, but the almost complete lack of any security protection, including highly sensitive corporate and personal data.

It is understood that the RedAccess team analyzed thousands of vibe coding web applications created using AI software development tools like Lovable, Replit, Base44, and Netlify. They found that over 5,000 of these had almost no security mechanisms or authentication. Many such web applications can be directly accessed along with their data by anyone who obtains their URL. Some had minimal barriers to entry, such as requiring registration with any email address.

Among these 5,000 AI-coded apps accessible to anyone simply by entering the URL in a browser, Zvi found that nearly 2,000 appeared to expose private data upon further inspection. Zvi said that approximately 40% of the apps exposed sensitive data, including medical information, financial data, corporate presentations and strategic documents, and detailed logs of user conversations with chatbots.

Screenshots of web applications he shared (some of which were verified to still be online and exposed) showed details including a hospital's work assignment information (containing doctors' personally identifiable information), a company's detailed advertising procurement data, another company's market entry strategy presentation, a retailer's complete chatbot conversation logs (including customers' full names and contact details), a shipping company's freight records, and various sales and financial data from multiple companies. Zvi also stated that in some cases, these exposed applications could potentially allow him to gain administrative access to systems, or even delete other administrators.

Zvi mentioned that RedAccess found it surprisingly easy to search for vulnerable web applications. Lovable, Replit, Base44, and Netlify all allow users to host web applications on the AI companies' own domains, rather than on the user's own domain. Therefore, researchers could identify thousands of applications built using these vibe coding tools by simply searching Google and Bing using these company domains combined with other keywords.

In the case of Lovable, Zvi also discovered a large number of phishing websites impersonating major corporations. These sites appeared to be created using the AI coding tool and hosted on the Lovable domain, including brands like Bank of America, Costco, FedEx, Trader Joe’s, and McDonald's. Zvi also pointed out that the 5,000 exposed apps discovered by RedAccess were only those hosted on the AI coding tools' own domains. There could potentially be tens of thousands more applications hosted on user-purchased domains.

Security researcher Joel Margolis noted that verifying whether real data is actually exposed in an unprotected AI-coded web app is not always straightforward. He and his colleagues previously discovered an AI chat toy that exposed 50,000 conversations with children on a website with minimal security. He said the data in vibe coding applications could be just placeholders, or the app itself might be only a proof-of-concept (POC). Wix's Brodie also believed that the two examples provided to Base44 looked like test sites or contained AI-generated data.

Nevertheless, Margolis believes the problem of data exposure from AI-built web apps is very real. He stated that he frequently encounters the type of exposure Zvi described. "Someone on the marketing team wants to build a website; they are not engineers and probably have little security background or knowledge," he pointed out. AI coding tools will do what you ask, but if you don't ask them to do it securely, they won't do it proactively.

“People Can Create at Will,”

But the Default Settings Are the Problem

Less than two weeks before RedAccess's research was published, another incident occurred: Cursor, running the Claude Opus 4.6 model, deleted PocketOS's entire production database and all volume-level backups in 9 seconds via an API call to infrastructure provider Railway.

Zvi bluntly stated, "People can create something at will and then use it directly in a production environment, representing a company to use it, without needing any permission. There's almost no boundary to this behavior. I don't think we can make the whole world receive security education." He added that his mother also uses Lovable for vibe coding, "but I don't think she considers role-based access control."

RedAccess researchers found that the privacy settings of multiple vibe coding platforms default applications to being public unless users manually change them to private. Many such applications are also indexed by search engines like Google, making it possible for anyone surfing the web to stumble upon them unintentionally.

Zvi believes that current AI web application development tools are creating a new wave of data exposure, rooted in the same combination of user error and insufficient security safeguards. However, a more fundamental issue than any specific security flaw is that these tools enable a whole new category of people within organizations to create applications. They often lack security awareness and bypass the company's existing software development processes and pre-deployment security review mechanisms.

"Anyone in the company, at any time, can generate an application, completely bypassing any development process or security checks. People can use it directly in a production environment without asking anyone's opinion. And that's exactly what they are doing," Zvi said. "The end result is that corporations are essentially leaking private data through these vibe coding applications. This is one of the largest-scale incidents ever, where people are exposing corporate or other sensitive information to anyone in the world."

In October last year, Escape.tech scanned 5,600 public vibe coding applications and also found that over 2,000 had high-risk vulnerabilities, over 400 exposed sensitive information (including API keys and access tokens), and 175 cases involving personal data breaches (including medical records and bank account information). All vulnerabilities found by Escape existed in real production systems and could be discovered within hours. In March this year, the company completed an $18 million Series A funding round led by Balderton, with one of its core investment rationales being the security gaps created by AI-generated code.

Gartner's "Predicts 2026" report pointed out that by 2028, the prompt-to-app approach adopted by "citizen developers" will increase software defect volume by 2,500%. Gartner believes a major new characteristic of such defects is that AI-generated code is syntactically correct but lacks an understanding of overall system architecture and complex business rules. The cost of fixing these "deep-context errors" will erode budgets originally intended for innovation.

Responses and Rebuttals from the Platforms

Currently, three AI coding companies have contested the claims made by RedAccess researchers, stating that the information shared was insufficient and they were not given enough time to respond. However, Zvi said that for dozens of exposed web applications, they proactively contacted the suspected owners. Executives from the companies stated they take such reports seriously, while also noting that the apps being publicly accessible does not necessarily mean there is a data breach or security vulnerability. Nonetheless, these companies did not deny that the web applications discovered by RedAccess were indeed publicly exposed.

Replit's CEO, Amjad Masad, stated that RedAccess only gave them 24 hours to respond before disclosure. In his response on X, he wrote, "Based on the limited information they shared, the core claim from RedAccess appears to be: some users have published apps that should be private to the open internet. Replit allows users to choose whether their app is public or private. Public apps being accessible on the internet is expected behavior. Privacy settings can also be changed with one click at any time. If RedAccess shares the list of affected users, we will proactively default those apps to private and notify users directly."

A spokesperson for Lovable responded in a statement, "Lovable takes reports of data exposure and phishing websites very seriously, and we are actively obtaining the necessary information to investigate. This matter is currently ongoing. It should also be noted that Lovable provides developers with tools to build applications securely, but the ultimate responsibility for how an application is configured lies with the creator."

In the previously published CVE-2025-48757, it was recorded that Supabase projects generated by Lovable had insufficient or even missing Row-Level Security (RLS) policies. Some queries completely bypassed access control checks, leading to data exposure in over 170 production environment applications. The AI was responsible for generating the database layer but did not generate the security policies that should have restricted data access. Lovable contested the CVE classification, stating that protecting application data is the customer's own responsibility.

Blake Brodie, Head of Public Relations at Wix, the parent company of Base44, stated in a declaration: "Base44 provides users with robust tools to configure the security of their applications, including access control and visibility settings." She added, "Turning these controls off is an intentional and simple action that any user can perform. If an application is publicly accessible, that reflects a user's configuration choice, not a platform vulnerability."

Brodie also pointed out, "It's very easy to fabricate apps that appear to contain real user data. Without providing us with any verified cases, we cannot assess the veracity of these allegations." In response, RedAccess countered that they did provide relevant examples to Base44. RedAccess also shared several anonymized communication records showing that Base44 users thanked the researchers for alerting them to their apps' exposure issues, after which the apps were secured or taken down.

It is understood that Wiz Research independently discovered last July that Base44 had a platform-level authentication bypass vulnerability. The exposed API interface allowed anyone to create a "verified account" in a private application using only a publicly visible `app_id`. This vulnerability was akin to standing at the locked door of a building, shouting out a room number, and having the door automatically open. Wix fixed the vulnerability within 24 hours of Wiz's report, but the incident exposed an issue: on these platforms, millions of applications are created by users who often assume the platform has handled security for them, but the actual authentication mechanisms are very weak.

Reference Links:

https://www.wired.com/story/thousands-of-vibe-coded-apps-expose-corporate-and-personal-data-on-the-open-web/

https://www.axios.com/2026/05/07/loveable-replit-vibe-coding-privacy

https://venturebeat.com/security/vibe-coded-apps-shadow-ai-s3-bucket-crisis-ciso-audit-framework

This article is from the WeChat public account "AI Frontline" (ID: ai-front), author: Hua Wei

Trending Cryptos

Related Questions

QWhat is the main security issue reported in the article regarding AI coding tools?

AThe article reports that AI-powered 'vibe coding' tools like Lovable, Base44, Netlify, and Replit are leading to the exposure of private corporate and personal data on the open web. Researchers found approximately 38,000 publicly accessible applications, with nearly 2,000 of them exposing sensitive data like medical records, financial information, and internal corporate documents due to a lack of security controls and default public settings.

QWhich specific types of sensitive data were found to be exposed by the vulnerable AI-coded applications?

AThe exposed data included hospital work assignments with doctors' personally identifiable information (PII), a company's detailed ad-buying data, market-entry strategy presentations, full chatbot conversation logs from a retailer containing customers' full names and contact details, shipping company cargo records, and various sales and financial data from multiple companies. In some cases, the exposed applications could even grant administrative system access.

QAccording to the article, what is a fundamental cause of this data exposure problem beyond specific technical flaws?

AA fundamental cause is that these AI development tools empower a new class of users within organizations ('citizen developers') to create applications. These users often lack security awareness and can bypass traditional corporate software development lifecycles and pre-deployment security reviews. The tools allow anyone to quickly build and deploy applications to production without requiring permission or security checks.

QHow did the AI coding companies mentioned (Replit, Lovable/Wix) respond to the findings of data exposure?

ACompanies like Replit and Lovable/Wix (owner of Base44) disputed the research methodology, citing insufficient information and short response times. They generally argued that their platforms provide tools for users to configure security (like privacy settings) and that publicly accessible applications reflect user configuration choices, not platform vulnerabilities. They emphasized that the ultimate responsibility for securing an application lies with its creator.

QWhat broader industry prediction does the article cite related to the security impact of AI-generated code?

AThe article cites a Gartner prediction from its '2026 Predictions' report stating that by 2028, 'prompt-to-app' methods adopted by citizen developers will cause a 2500% increase in software defects. A key characteristic of these defects is that while AI-generated code is syntactically correct, it lacks understanding of overall system architecture and complex business rules, leading to costly 'deep-context errors'.

Related Reads

Market Trend (June 19): US-Iran Deal Drives Out Geopolitical Premium; Chip Stocks Soar to New Highs; Energy Sector Leads Declines

U.S. Market Trends (June 19): U.S.-Iran Deal Eases Tensions, Chip Stocks Soar, Energy Sector Leads Declines. U.S. stocks rallied on Thursday as the signing of a temporary U.S.-Iran deal in Geneva de-escalated Middle East tensions, with Saudi oil tankers transiting the Strait of Hormuz. This geopolitical relief helped markets recover from recent Fed-driven volatility. The S&P 500 rose over 1%, the Nasdaq gained nearly 2%, and the Dow Jones Industrial Average closed at another record high. The Philadelphia Semiconductor Index surged over 6% to a historic peak. Chip stocks were the standout performers. Reports of an Apple-Intel design and foundry deal for certain products, alongside mentions of potential Nvidia and SpaceX collaborations with Intel, propelled the sector. Intel surged ~10.5%, while memory chip makers like Micron also saw significant gains, highlighting sustained confidence in long-term AI capital expenditure. In contrast, the energy sector was the day's sole loser, with the S&P 500 energy sub-index declining as WTI crude fell ~2% to around $74.29/barrel. The reopening of key shipping routes erased prior geopolitical risk premiums. SpaceX extended losses for a second day on news of a potential large bond offering. Market volatility (VIX) dropped sharply, indicating a swift reversal of post-Fed jitters. Treasury yields dipped slightly but remained elevated. The focus now shifts to upcoming economic data, including next week's PCE inflation report and Micron's earnings, which will serve as a key test for the AI trade's durability.

marsbit3m ago

Market Trend (June 19): US-Iran Deal Drives Out Geopolitical Premium; Chip Stocks Soar to New Highs; Energy Sector Leads Declines

marsbit3m ago

Will MicroStrategy Fall Into a Death Spiral? How Will the Macro Outlook Evolve in the Second Half of the Year?

**Summary:** The discussion centers on recent Bitcoin price declines and the evolving financial strategy of MicroStrategy (MSTR). The core argument is that the primary pressure is not from one-off Bitcoin sales by MSTR, but from the market's new expectation that MSTR may need to engage in *sustained, small-scale* Bitcoin sales to cover cash flow obligations for its growing portfolio of preferred shares and debt instruments (like STRC). This shift is driven by its stated goal of maintaining "bitcoins per share neutrality." The market is now testing whether it can absorb this potential ongoing selling pressure without entering a severe "death spiral" with Bitcoin's price. A resolution may involve MSTR softening its approach to avoid damaging both its stock and Bitcoin. The conversation then explores the parallel rise of AI-related stocks. The guest posits that AI is fundamentally restructuring labor, with "tokens" (representing access to AI models/compute) becoming a new form of capital and a substitute for human execution. This drives corporate efficiency and profits, benefiting upstream hardware providers (semiconductors, data centers), which explains the sustained rally. This represents the early stages of a "machine economy." Regarding crypto exchanges offering US stock trading, this is seen as a natural evolution. With few crypto-native assets generating lasting value, exchanges are pivoting to distribute valuable real-world assets (RWAs). This doesn't necessarily harm crypto's long-term prospects, as blockchain infrastructure may become crucial for future machine-to-machine economies. The analysis concludes that the era of rampant altcoin speculation is likely over, heavily damaged by the liquidity shock of the "1011" event (likely referring to a major market crash). Meme-driven capital has largely migrated to US equities. Looking ahead, macroeconomic uncertainty is rising due to potential large IPOs (e.g., SpaceX) and the US elections. While short-term market corrections are possible, the long-term trends of AI-driven productivity gains and the maturation of blockchain towards real-world utility and institutional adoption remain intact.

marsbit7m ago

Will MicroStrategy Fall Into a Death Spiral? How Will the Macro Outlook Evolve in the Second Half of the Year?

marsbit7m ago

Will MicroStrategy Fall into a Death Spiral? What Will the Macro Trend Be in the Second Half of the Year?

The podcast features investor Didier discussing the recent Bitcoin downturn and the evolving strategy of MicroStrategy (MSTR). He argues the core pressure is not macro factors or ETF outflows, but the market pricing in an expectation that MSTR will engage in continuous, small-scale Bitcoin sales to fund its increasing preferred stock and debt obligations under its "bitcoin-per-share neutrality" principle. This creates a structural headwind. However, he is cautiously optimistic a "death spiral" is avoidable without new major shocks, as market support is likely to emerge at a certain price point. Didier then posits that the AI-driven bull market in US stocks (semiconductors, data centers) is fundamentally driven by AI agents and tokens becoming the "new labor force," displacing human roles and boosting corporate margins. This shift toward a machine economy is still in its early stages. He comments on crypto exchanges adding US stock trading, viewing it as a natural move toward valuable real-world assets as truly valuable crypto-native assets remain scarce. For crypto-native traders, he suggests existing strategies (e.g., meme-chasing or value investing) can translate to similar assets in US markets. The discussion notes the severe liquidity damage from the "1011 event" (likely referring to a major market crash) has essentially ended the altcoin cycle, with speculative momentum shifting to the more liquid US stock market. Regarding the macroeconomic outlook for H2 2024, Didier expresses increased caution due to potential market pressure from upcoming mega-IPOs (e.g., SpaceX) and US midterm election risks. Long-term, he remains bullish on AI's productivity gains and its convergence with blockchain/Web3, which he sees maturing into a more institutional, real-asset-focused phase.

链捕手10m ago

Will MicroStrategy Fall into a Death Spiral? What Will the Macro Trend Be in the Second Half of the Year?

链捕手10m ago

Dylan Patel: Founder of SemiAnalysis, Praised by Jensen Huang, is a 'Beekeeper' and 'Forum Enthusiast'

Dylan Patel, founder of the independent research firm SemiAnalysis, has an unconventional background. A former beekeeper from rural Georgia, he entered the semiconductor world as a self-taught "forum warrior," discussing chip technology anonymously online from a young age. He launched the SemiAnalysis blog in May 2020, which later transitioned to a paid subscription model. The firm has grown from a one-person operation to a global team of around 60, with a dedicated teardown lab. Its detailed, technically-focused analysis on semiconductor supply chains, AI infrastructure, and products has earned significant industry recognition. Notably, NVIDIA founder Jensen Huang has publicly cited their reports. In a landmark case, a critical 2024 report on AMD's MI300X GPU software stack led to a 90-minute call with AMD CEO Lisa Su, who thanked him for the constructive feedback. SemiAnalysis later acknowledged AMD's improvements. The firm's influence on markets was seen when a report on NVIDIA's Rubin memory configuration was partially shared, affecting memory stock prices. Dylan Patel emphasized the importance of context, contrasting the shared excerpt with the report's actual title. SemiAnalysis, now a multi-faceted consultancy with revenue projected to reach $100 million, is known for its deep technical insights that influence major industry players and investment decisions.

marsbit1h ago

Dylan Patel: Founder of SemiAnalysis, Praised by Jensen Huang, is a 'Beekeeper' and 'Forum Enthusiast'

marsbit1h ago

Dylan Patel: SemiAnalysis, Praised by Jensen Huang, is Founded by a 'Beekeeper and Forum Warrior'

Dylan Patel, founder of the independent research firm SemiAnalysis, has an unconventional background. Growing up in rural Georgia, he later worked as a beekeeper in Minnesota. His entry into semiconductors began as a self-taught "forum warrior," engaging anonymously in online tech communities from a young age. In May 2020, he started the SemiAnalysis blog on WordPress, later moving it to Substack as a paid subscription service. The firm has since evolved from a one-person operation into a global company with around 60 employees, featuring a dedicated chip teardown lab. Its revenue, reaching $20 million last year, is projected to surpass $100 million this year. SemiAnalysis is highly regarded in the AI and semiconductor industry for its deep technical analysis. NVIDIA founder Jensen Huang has publicly praised its reports. In a notable instance, a critical report on AMD's MI300X GPU software shortcomings prompted a 90-minute call with CEO Lisa Su, who thanked Patel for the "constructive feedback." A later report acknowledged AMD's subsequent improvements. The firm's analyses have significant market impact. For example, a June report discussing potential memory configuration changes in NVIDIA's next-generation servers was cited as a factor in pressure on memory-related stocks. Patel plans to establish a venture capital firm, having already made personal investments in about 20 startups. SemiAnalysis combines roles as a consultancy, model platform, and tech lab, focusing on the practical bottlenecks in AI infrastructure.

Odaily星球日报1h ago

Dylan Patel: SemiAnalysis, Praised by Jensen Huang, is Founded by a 'Beekeeper and Forum Warrior'

Odaily星球日报1h ago

Trading

Spot
Futures

Hot Articles

Discussions

Welcome to the HTX Community. Here, you can stay informed about the latest platform developments and gain access to professional market insights. Users' opinions on the price of AI (AI) are presented below.

活动图片