The Image Generation Model That's Hotter Than Nano Banana Has Leaked, Screenshots Are No Longer Evidence | Includes Prompts

marsbitXuất bản vào 2026-04-19Cập nhật gần nhất vào 2026-04-19

Tóm tắt

A new AI image generation model, widely referred to as "GPT Image 2," has been leaked and is demonstrating significant advancements over predecessors like DALL-E 3 and even Google's Nano Banana Pro. It excels in four key areas: text rendering, prompt adherence, photorealism, and world knowledge. The model can generate highly accurate text in multiple languages, including complex Chinese characters, making it capable of producing convincing fake documents, UI screenshots, and product labels. This capability also raises concerns about the reliability of using screenshots as evidence. The model is currently in A/B testing, with a full release expected around May 2026 when DALL-E services are officially retired. It is accessible for testing on the LM Arena platform. The article includes several prompt templates optimized for the model, such as generating realistic app screenshots, product photos with detailed labels, and street scenes with accurate signage. This advancement is reshaping creative workflows but also accelerating the displacement of some traditional design roles.

Is your impression of text-to-image still stuck on Nano Banana?

But kid, times have changed again.

@johnAGI168 https://x.com/johnAGI168/status/2044781168151724067

@0115hippo https://x.com/0115hippo/status/2044722124611539160

In early April, three anonymous image models, codenamed maskingtape-alpha, packingtape-alpha, and gaffertape-alpha, appeared on the LM Arena evaluation platform. They disappeared a few hours later.

OpenAI has not officially announced this model yet, but based on the metadata returned by the API and user-side testing records, it has already gained a widely accepted name: GPT Image 2.

Screenshots Can No Longer Be Used as Evidence

Over the past few years, one of the most obvious weaknesses of AI image generation models has been text within images. In the DALL-E 3 era, if you asked it to write "Hello" in an image, it might output "Hellp" or even "Hl10", with letters tilting drunkenly. GPT Image 1 improved a lot, handling simple English labels. By GPT Image 1.5, its accuracy in rendering English text was close to 95%, but it still had significant flaws with non-Latin scripts like Chinese, Japanese, and Korean.

But the leaked sample images from GPT Image 2 have changed this impression.

@MrLarus https://x.com/MrLarus/status/2044824800909054181

@akokoi1 https://x.com/akokoi1/status/2044789531615056175

The text in the images is exactly what it should be. Chinese characters are clear, with accurate glyphs and complete strokes. Someone tested generating an ID card-style image, where the name, address, and ID number were all rendered correctly, with neat formatting, looking at first glance like a photo of a real document.

This is good news. The improvement in text rendering means generating infographics, posters, product packaging, and complex charts becomes more reliable.

But there's always another side to the coin. A model that can generate photo-realistic ID-style images and precisely render UI screenshots naturally makes "screenshots can be used as evidence" increasingly questionable.

By comparison, this is also a core difference between the GPT Image series and other models. Midjourney still has no progress in text rendering, and the Stable Diffusion series also has this old problem. According to the leaked Arena test results, GPT Image 2 surpassed Midjourney in four dimensions: text rendering, instruction following, photorealism, and world knowledge. Midjourney's advantages are mainly retained in artistic style and aesthetic control.

Does It Really Know What the World Looks Like?

A tester asked the model to generate a hypothetical GPT-8 product pricing page. The resulting image had a layout that was indeed in the style of the OpenAI website, with button placement and font choices resembling those from a real interface, and the hierarchical logic of the price table was correct.

GPT Image 2 can generate images extremely similar to real software interfaces, including browser windows, mobile app interfaces, and data visualization charts, with a level of fidelity unmatched by the previous generation.

@johnAGI168 https://x.com/johnAGI168/status/2044781168151724067

@levelsio https://x.com/levelsio/status/2040333489476681758

This will lead to some very interesting practical uses. When designers are creating product prototypes, they don't need to open Figma first and draw a bunch of wireframes; they can directly describe the desired interface in text, and the output is a reference image that can be used for team discussions. When creating investor decks, they can show a "product screenshot" without waiting for an engineer to write code. When writing documentation, example interface images for illustration can be generated directly, without having to think about where to find screenshots for a blank page.

@marmaduke091 https://x.com/marmaduke091/status/2040338311873515597

Image Generation Is No Longer Just "Image Generation"

OpenAI has already announced that DALL-E 2 and DALL-E 3 will officially cease service on May 12, 2026. Azure OpenAI's DALL-E 3 was retired early in February.

DALL-E was the first place many people encountered AI image generation, from those blurry early works to today, in just a few short years.

Meanwhile, Google, which had just established its industry position with Nano Banana Pro in early 2026, might feel the pressure. Early test reports indicate that GPT Image 2 simultaneously surpasses Nano Banana Pro in three dimensions: realism, text rendering, and world knowledge. This kind of triple win is not common.

For creators, the feeling is complex. Illustrators, graphic designers, and photographers are not facing this topic for the first time. Since the release of GPT Image 1, the number of freelance graphic design positions has decreased by about 18%. AI has indeed replaced the decision to "hire someone to do this" in certain scenarios, but it is also creating new ways of working, allowing one person to do more.

The evolution speed of image generation models no longer leaves much time for adaptation. It was only a few months from GPT Image 1's launch to version 1.5. And from 1.5 to 2, it's only been about half a year. Each generation solves the core shortcomings of the previous one while opening up new possibilities.

GPT Image 2 is currently still in the A/B testing phase, with some ChatGPT users randomly gaining access. The official release window is widely predicted to be around May, coinciding with the retirement of DALL-E. If you want to experience it early, you can currently try your luck on the LM Arena evaluation platform.

Test Address: https://arena.ai

Based on community feedback and the known strengths of this model, the following prompt templates can maximize your chances of success:

UI/Screenshot Prompt: A photorealistic screenshot of a mobile banking app, clearly showing transaction history with dates, amounts, and merchant names legible. iPhone 16 screen, natural hand holding the phone, coffee shop background.

Product Label Prompt: A photographic product photo of a craft beer bottle, with clear label details showing the brewery name "Oakridge Brewing Co.", alcohol content 6.8%, a mountain logo, and an ingredient list. Studio lighting, white background.

Signage Prompt: A street scene photo of a Tokyo alley at night, showing multiple neon signs in both Japanese and English, including a ramen shop sign reading "Ichiban Ramen — Est. 1987", a karaoke bar sign, and various glowing advertisements. Wet, reflective pavement with light reflections.

Interface/World Knowledge Prompt: A photorealistic YouTube video screenshot showing a video titled "How to Assemble a Computer in 2026" with 2.3 million views, featuring realistic comments, sidebar video recommendations, and channel info. Desktop browser view.

Widescreen Trigger Prompt: A cinematic widescreen photo of an IKEA store exterior at dusk, showing the glowing IKEA sign, a parking lot with realistic cars, and shoppers entering and leaving. Golden hour lighting, 16:9 format.

Unattributed image sources and references: https://miraflow.ai/blog/how-to-use-duct-tape-ai-model-arena-gpt-image-2-guide

This article is from the WeChat public account "APPSO", author: Discovering Tomorrow's Products

Tiền kỹ thuật số thịnh hành

CitreaCTR

wrapped stUSDTWSTUSDT

Velodrome FinanceVELODROME

Câu hỏi Liên quan

QWhat is the name of the leaked image generation model mentioned in the article, and what is its significance?

AThe leaked model is referred to as GPT Image 2. Its significance lies in its dramatic improvement in text rendering accuracy, especially for non-Latin scripts like Chinese, and its ability to generate highly realistic images, including convincing UI screenshots and document-style images, which challenges the reliability of screenshots as evidence.

QHow does GPT Image 2's performance compare to other models like Midjourney and Google's Nano Banana Pro?

AAccording to the article, GPT Image 2 outperforms Midjourney in text rendering, prompt following, photorealism, and world knowledge, with Midjourney retaining an advantage mainly in artistic style and aesthetic control. It also reportedly surpasses Google's Nano Banana Pro in realism, text rendering, and world knowledge.

QWhat are some of the potential practical applications of GPT Image 2's capabilities?

APotential applications include generating product prototypes and UI mockups for designers, creating realistic 'screenshots' for investor decks without coding, producing example interface images for documentation, and generating accurate product labels, packaging, and information graphics.

QWhat major change is OpenAI making to its image generation services in relation to this new model?

AOpenAI has announced that DALL-E 2 and DALL-E 3 will officially stop service on May 12, 2026, with Azure's DALL-E 3 having already been retired in February. This suggests a transition to the new GPT Image model series.

QWhere can users currently try to access or test the GPT Image 2 model, and what is a recommended strategy for getting good results?

AThe model is currently in A/B testing, with some ChatGPT users randomly gaining access. Users can also try their luck on the LM Arena评测平台 (arena.ai). The article recommends using specific, detailed prompt templates focused on UI/screenshots, product labels, signage, interface/world knowledge, and widescreen formats to maximize success.

Nội dung Liên quan

TIN TỨC MỚI NHẤT: Michael Saylor ra tuyên bố về vụ bán Bitcoin hôm nay! 'Tôi đã nói tôi sẽ không bao giờ bán Bitcoin cá nhân của mình'

**MicroStrategy Bán 1.638 Bitcoin, Giám đốc Michael Saylor Tái Khẳng Định: "Tôi Không Bao Giờ Bán Bitcoin Cá Nhân"** Ngày hôm nay, MicroStrategy - công ty do Michael Saylor đứng đầu - đã bán 1.638 Bitcoin với giá trung bình $63,957, thu về khoảng $105 triệu. Sau giao dịch, tổng tài sản Bitcoin của công ty giảm xuống còn 842.138 BTC, trị giá khoảng $63.5 tỷ. Saylor giải thích rằng đây là động thái quản lý vốn của một công ty đại chúng, hoàn toàn tách biệt với chiến lược đầu tư cá nhân của ông. Số tiền thu được sẽ dùng để tài trợ cho việc phân phối cổ phiếu ưu đãi và mua lại cổ phiếu. Trong một tuyên bố rõ ràng, Saylor nhấn mạnh: "Khi tôi nói 'Đừng bao giờ bán Bitcoin', đó là lời khuyên giữa những nhà đầu tư cá nhân. Tôi chưa từng bán bất kỳ Bitcoin cá nhân nào. MicroStrategy là một công ty đại chúng, không phải ví tiền cá nhân của tôi." Ông khẳng định niềm tin cốt lõi của công ty vào Bitcoin vẫn không thay đổi.

cryptonews.ru11 phút trước

TIN TỨC MỚI NHẤT: Michael Saylor ra tuyên bố về vụ bán Bitcoin hôm nay! 'Tôi đã nói tôi sẽ không bao giờ bán Bitcoin cá nhân của mình'

cryptonews.ru11 phút trước

Công ty phân tích tiết lộ ngưỡng quan trọng cho Bitcoin (BTC): "Đây có thể là dấu chấm hết cho thị trường gấu!" Dưới đây là chi tiết

Một công ty nghiên cứu tiền điện tử đã tiết lộ một ngưỡng quan trọng đối với Bitcoin (BTC) mà có thể chấm dứt thị trường giá xuống. Marcus Thielen, người sáng lập 10x Research, cho biết việc đóng cửa giá hàng tháng trên mức 63.000 USD sẽ là tín hiệu quan trọng xác nhận đáy của thị trường gấu. Tuy nhiên, với việc giá tháng 7 đóng cửa dưới ngưỡng này, ông nhận định thị trường vẫn chưa có xác nhận cuối cùng. Thielen chỉ ra xu hướng ngắn hạn vẫn là giảm giá, khi BTC hiện giao dịch dưới cả đường trung bình động 7 ngày và 30 ngày, đồng thời giảm 3,2% trong tuần qua. Ông cũng cảnh báo về các rủi ro vĩ mô, như việc Cục Dự trữ Liên bang Mỹ (Fed) có thể tăng lãi suất trở lại vào tháng 9 nếu lợi tức trái phiếu kho bạc Mỹ kỳ hạn 10 năm tiếp tục tăng, gây áp lực lên các tài sản rủi ro như Bitcoin. Ngoài ra, áp lực bán có thể gia tăng từ các công ty khai thác Bitcoin đang chuyển hướng sang lĩnh vực trí tuệ nhân tạo (AI) - những đơn vị nắm giữ khoảng 100.000 BTC, cũng như từ các công ty tổ chức quản lý Bitcoin khác. Theo Thielen, để Bitcoin quay trở lại xu hướng tăng, nó cần phải giữ vững các mức hỗ trợ kỹ thuật quan trọng và có điều kiện kinh tế vĩ mô thuận lợi.

cryptonews.ru11 phút trước

Công ty phân tích tiết lộ ngưỡng quan trọng cho Bitcoin (BTC): "Đây có thể là dấu chấm hết cho thị trường gấu!" Dưới đây là chi tiết

cryptonews.ru11 phút trước

XDC Tech tích hợp nền tảng stablecoin Bridge để đưa thanh toán stablecoin on-chain vào thương mại AI tự trị

New York, ngày 3 tháng 8 năm 2026 – XDC Tech, bộ phận thể chế của XDC Network tại Mỹ, đã công bố tích hợp nền tảng stablecoin hàng đầu Bridge (một công ty của Stripe). Hợp tác này cung cấp cho các nhà phát triển trên XDC công cụ chuyển đổi tiền mặt/stablecoin, tài khoản ảo và lưu ký đa tiền tệ, loại bỏ nhu cầu tự xây dựng lớp tuân thủ. Trọng tâm là thanh toán: doanh nghiệp có thể nhận USD, EUR qua tài khoản ảo Bridge và được quyết toán bằng stablecoin trên XDC gần như thời gian thực, cắt giảm ngân hàng đại lý và thời gian xử lý nhiều ngày. Ứng dụng bao gồm tài chính thương mại và tài sản được mã hóa. Đây là bước nền tảng cho lộ trình của XDC trở thành lớp quyết toán cho nền kinh tế tác nhân AI (agentic economy), nơi các tác nhân AI tự trị giao dịch với tốc độ máy. Sự kết hợp mang lại 6 lợi thế chính: 1. **Thanh toán tốc độ máy** cho các giao dịch vi mô của AI. 2. **Truy cập tuân thủ** vào hệ thống tiền pháp định ở Mỹ, EU, Mỹ Latinh. 3. **Ví bản địa cho AI** thông qua tài khoản ảo Bridge. 4. **Lưu ký đa tiền tệ** cho thương mại xuyên biên giới. 5. **Hỗ trợ tài chính thương mại & tài sản mã hóa** do AI điều hành. 6. **Tuân thủ sẵn có** (KYC/KYB) cho các khoản thanh toán tự trị. Tích hợp hiện đã sẵn sàng cho các nhà phát triển, với tài liệu có tại cổng nhà phát triển XDC.

cointelegraph24 phút trước

XDC Tech tích hợp nền tảng stablecoin Bridge để đưa thanh toán stablecoin on-chain vào thương mại AI tự trị

cointelegraph24 phút trước

BlackRock ra mắt quỹ thị trường tiền tệ được token hóa để làm tài sản dự trữ cho stablecoin

BlackRock, nhà quản lý tài sản lớn nhất thế giới, đã ra mắt hai sản phẩm quỹ thị trường tiền tệ được token hóa, được thiết kế để làm tài sản dự trữ cho stablecoin. Sản phẩm đầu tiên là một cổ phần được token hóa từ quỹ hiện có của họ trên Ethereum. Sản phẩm thứ hai là một quỹ mới, hỗ trợ nhiều blockchain và tự động tái đầu tư cổ tức hằng ngày. Cả hai quỹ đều được cấu trúc để đủ điều kiện làm tài sản dự trữ cho các nhà phát hành stablecoin thanh toán được chấp thuận tại Mỹ theo Đạo luật GENIUS. Việc ra mắt này mở rộng sự hiện diện của BlackRock trong thị trường trái phiếu kho bạc được token hóa, nơi quỹ BUIDL của họ hiện là lớn nhất.

cointelegraph51 phút trước

BlackRock ra mắt quỹ thị trường tiền tệ được token hóa để làm tài sản dự trữ cho stablecoin

cointelegraph51 phút trước

Một Kỷ Nguyên Mới Có Đang Bắt Đầu Cho Bitcoin? Chiến Lược Khởi Động Tháng Tám Với Các Đợt Bán Tháo Tiếp Theo: Công Bố Một Đợt Bán BTC Lớn Khác!

Công ty Strategy, nhà nắm giữ Bitcoin lớn nhất thế giới được giao dịch công khai, đã thông báo một đợt bán ra mới. Theo người sáng lập Michael Saylor, công ty đã bán 1.638 Bitcoin với giá trị khoảng 105 triệu USD từ ngày 27/7 đến 2/8, thu về 104,7 triệu USD ở mức giá trung bình 63.957 USD/BTC. Chiến lược bán BTC này nhằm mục đích tài trợ cho cổ tức ưu đãi và mua lại cổ phiếu STRC. Với đợt bán bắt đầu từ tháng 8, tổng lượng Bitcoin Strategy sở hữu đã giảm xuống còn 842.138 BTC. Đáng chú ý, công ty đã không mua BTC trong khoảng 6 tuần. Trong cùng kỳ, Strategy cũng huy động được 290,6 triệu USD thông qua việc phát hành cổ phiếu MSTR, một phần trong số đó (81,2 triệu USD) đã được dùng để mua lại STRC. Công ty cho biết dự trữ tiền mặt bằng USD của họ đã tăng lên khoảng 4 tỷ USD, giúp kéo dài thời gian có thể chi trả cổ tức bằng nguồn dự trữ này thêm 57 ngày, lên tổng cộng 2,3 năm.

cryptonews.ru1 giờ trước

Một Kỷ Nguyên Mới Có Đang Bắt Đầu Cho Bitcoin? Chiến Lược Khởi Động Tháng Tám Với Các Đợt Bán Tháo Tiếp Theo: Công Bố Một Đợt Bán BTC Lớn Khác!

cryptonews.ru1 giờ trước

Giao dịch

Giao ngay

Bài viết Nổi bật

Làm thế nào để Mua BANANA

Chào mừng bạn đến với HTX.com! Chúng tôi đã làm cho mua Banana Gun (BANANA) trở nên đơn giản và thuận tiện. Làm theo hướng dẫn từng bước của chúng tôi để bắt đầu hành trình tiền kỹ thuật số của bạn.Bước 1: Tạo Tài khoản HTX của BạnSử dụng email hoặc số điện thoại của bạn để đăng ký tài khoản miễn phí trên HTX. Trải nghiệm hành trình đăng ký không rắc rối và mở khóa tất cả tính năng. Nhận Tài khoản của tôiBước 2: Truy cập Mua Crypto và Chọn Phương thức Thanh toán của BạnThẻ Tín dụng/Ghi nợ: Sử dụng Visa hoặc Mastercard của bạn để mua Banana Gun (BANANA) ngay lập tức.Số dư: Sử dụng tiền từ số dư tài khoản HTX của bạn để giao dịch liền mạch.Bên thứ ba: Chúng tôi đã thêm những phương thức thanh toán phổ biến như Google Pay và Apple Pay để nâng cao sự tiện lợi.P2P: Giao dịch trực tiếp với người dùng khác trên HTX.Thị trường mua bán phi tập trung (OTC): Chúng tôi cung cấp những dịch vụ được thiết kế riêng và tỷ giá hối đoái cạnh tranh cho nhà giao dịch.Bước 3: Lưu trữ Banana Gun (BANANA) của BạnSau khi mua Banana Gun (BANANA), lưu trữ trong tài khoản HTX của bạn. Ngoài ra, bạn có thể gửi đi nơi khác qua chuyển khoản blockchain hoặc sử dụng để giao dịch những tiền kỹ thuật số khác.Bước 4: Giao dịch Banana Gun (BANANA)Giao dịch Banana Gun (BANANA) dễ dàng trên thị trường giao ngay của HTX. Chỉ cần truy cập vào tài khoản của bạn, chọn cặp giao dịch, thực hiện giao dịch và theo dõi trong thời gian thực. Chúng tôi cung cấp trải nghiệm thân thiện với người dùng cho cả người mới bắt đầu và người giao dịch dày dạn kinh nghiệm.

Tổng lượt xem 329Xuất bản vào 2024.12.11Cập nhật vào 2026.06.02

Thảo luận

Chào mừng đến với Cộng đồng HTX. Tại đây, bạn có thể được thông báo về những phát triển nền tảng mới nhất và có quyền truy cập vào thông tin chuyên sâu về thị trường. Ý kiến của người dùng về giá của BANANA (BANANA) được trình bày dưới đây.

The Image Generation Model That's Hotter Than Nano Banana Has Leaked, Screenshots Are No Longer Evidence | Includes Prompts

Tóm tắt

Screenshots Can No Longer Be Used as Evidence

Does It Really Know What the World Looks Like?

Image Generation Is No Longer Just "Image Generation"

Tiền kỹ thuật số thịnh hành

Câu hỏi Liên quan

Nội dung Liên quan

TIN TỨC MỚI NHẤT: Michael Saylor ra tuyên bố về vụ bán Bitcoin hôm nay! 'Tôi đã nói tôi sẽ không bao giờ bán Bitcoin cá nhân của mình'

Công ty phân tích tiết lộ ngưỡng quan trọng cho Bitcoin (BTC): "Đây có thể là dấu chấm hết cho thị trường gấu!" Dưới đây là chi tiết

XDC Tech tích hợp nền tảng stablecoin Bridge để đưa thanh toán stablecoin on-chain vào thương mại AI tự trị

BlackRock ra mắt quỹ thị trường tiền tệ được token hóa để làm tài sản dự trữ cho stablecoin

Một Kỷ Nguyên Mới Có Đang Bắt Đầu Cho Bitcoin? Chiến Lược Khởi Động Tháng Tám Với Các Đợt Bán Tháo Tiếp Theo: Công Bố Một Đợt Bán BTC Lớn Khác!

Giao dịch

Bài viết Nổi bật

Làm thế nào để Mua BANANA

Thảo luận

Danh mục Phổ biến

Thẻ Nổi bật