OpenAI Completely Overhauls Codex, Grows Its Own Mouse, Schedules Itself to Outwork Humans

marsbitDipublikasikan tanggal 2026-04-17Terakhir diperbarui pada 2026-04-17

Abstrak

OpenAI has fundamentally transformed Codex from a coding assistant into a versatile AI agent capable of autonomous computer control. The updated Codex can now operate in the background with its own independent cursor, allowing it to run applications like Xcode, test software, and fix bugs without interrupting the user’s workflow. It features a built-in browser for real-time visual editing of web pages, enabling developers to make changes through direct UI interactions. The release also introduces over 90 plugins for tools like JIRA, CircleCI, and SharePoint, integrating end-to-end development and productivity workflows. A new "heartbeat" feature allows Codex to self-schedule tasks, maintain context across sessions, and work autonomously over extended periods. Additional enhancements include memory capabilities, support for handling GitHub code reviews, multi-tab terminals, and built-in file generation for Excel, PDF, and PowerPoint. Currently, advanced computer control is available on macOS (excluding EU/UK users), with Windows support limited to data retrieval. This expansion positions Codex as a foundational step toward OpenAI’s planned "super-app," combining coding, automation, and multi-tool integration into a single platform.

Just now, OpenAI has turned Codex upside down!

Yesterday you were still using Codex to write code.

Today it can already view your screen, click your mouse, remember your preferences from last week, and schedule its own tasks.

Multiple AI Agents work for you in the background simultaneously, without affecting your mouse and keyboard at all.

Codex's "Secret Sauce": It can use apps directly in the background without taking over your entire computer.

From today, this tool used weekly by 3 million developers is no longer just a programming Agent.

You Do Your Work, It Runs Xcode in the Background

Now, Codex has its own cursor, which does not interfere with your mouse.

You write documents while it runs Xcode to test an App next to you, both happening at the same time.

This feature has significant origins; its lead, Ari Weinstein, is the co-creator of Apple Shortcuts, whose team was acquired by OpenAI last fall.

To understand what it can do, just watch a demo.

First, the user gives the command: "Run this Tic-Tac-Toe App in Xcode, play a game to test it yourself, and fix any bugs you find."

Then, Codex opens Xcode by itself, starts the iOS simulator, and begins playing with its own cursor.

During testing, it discovers a logic bug—after a human move, the computer draws two O's simultaneously.

After some thought, Codex decisively switches back to the code interface and locates the position of the bug.

After modifying the Swift code, it immediately recompiles and performs a second round of complete testing for verification.

In less than a minute. Run → Test → Find bug → Fix → Regression verification, the entire Debug closed loop, all in one go.

Currently, Computer Use only supports macOS; users in the EU and UK cannot use it for now.

The Windows version can pull information from other apps into Codex but does not yet support background cursor-level control.

In this update, Codex gains support for Intel Macs for the first time.

Click to Modify, No More Code Hunting for Front-End Debugging

The Codex client now has a built-in browser,底层 powered by OpenAI's own Atlas engine.

In practical terms,以前 front-end developers adjusting UI had to jump back and forth between code and the browser. Now you can operate on the rendered webpage.

Click the main title, leave a comment "Reduce font size and shorten the slogan"; click the top left corner, "Add a logo";发现 the chart's X-axis legend is out of bounds, click the error spot, write "Fix the out-of-bounds issue".

Codex understands visual and spatial context,修改 code in the background instantly, and the page refreshes in real-time.

OpenAI used a web application called Brickfolio, a Lego set tracker, for the demo.

Codex wrote the code from scratch, configured the environment, started the local server, and opened the rendered page in the built-in browser. The whole process took just seconds.

Then came the WYSIWYG (What You See Is What You Get) modification experience. It feels like reviewing a design mockup; you just annotate the problems, and all underlying iterations are handled by the AI.

In other words, the user只需 clicks and marks here and there on the page, and Codex changes the code in the background, producing results in real-time on the front end.

The built-in browser is currently limited to localhost preview. OpenAI stated that future expansions will include full browser control capabilities.

Over 90 Plugins Launched, The Entire Toolchain Welded into Codex

On the plugin front, OpenAI launched over 90 in one go.

Atlassian Rovo manages JIRA, CircleCI handles CI/CD, GitLab Issues tracks requirements, Microsoft Suite processes documents, Neon by Databricks operates databases—covering almost all tools a development team uses daily.

Usage is simple: just @ the plugin name in the input box.

For example, @SharePoint, ask Codex to read documents in the product catalog and generate an executive brief. It automatically searches the file tree, parses documents, and extracts core information, saving you from searching through various cloud drives.

Another example: @Superpowers, ask Codex to conceptualize a feature plan within the local code directory. It will traverse your file structure, read code and CSS, and provide a set of implementation suggestions that fit the current project architecture.

@CircleCI can help diagnose branch build failures; @Atlassian Rovo can read product specifications on Confluence, output summaries in format, and even convert feature points into standard JIRA tasks.

From upstream requirements to local coding, to CI/CD and task management, plugins串起来了 the entire chain.

AI Starts Scheduling Itself

Even more noteworthy is the new "Heartbeat" mechanism.

Now Codex can schedule future work for itself, automatically waking up at the set time to continue working, even across days or weeks. Moreover, it can reuse previous conversation threads, so the context accumulated last time is not lost.

For example, a user can ask Codex to check Slack, Gmail, Google Calendar, and Notion. It pulls relevant information from these four channels and throws out a prioritized to-do list.

The user adds, "Can you keep an eye on this for me?"

Codex immediately sets itself an hourly automatic patrol schedule, proactively reports on key points requiring decisions, and even asks, "Do you need me to draft a reply for you?"

This is no longer just a tool; it's a junior employee who doesn't sleep.

Coupled with the natively built-in image generation capability of gpt-image-1.5, product concept maps, front-end designs, and visual prototypes can all be completed in one go within the same workflow.

Daily Needs Filled in One Go

Besides these major features, there are also a batch of experience-level upgrades.

First, the memory function is available in preview; Codex can remember your preferences and corrected points, so you don't have to explain from scratch the next time you start a chat.

Second, GitHub code review comments can now also be processed in Codex.

Support for opening multiple terminal tabs simultaneously is here, and the ability to connect to remote development machines via SSH is also in internal testing. There's also a new summary panel to help you keep an eye on the Agent's work plans, information sources, and output files at any time.

In the demo, the user asked Codex to organize the recent open issues of the current project and generate a table grouped by theme.

Codex then pulled the code repository context in the background and, after a few minutes, threw out a core summary listing the project's current biggest pain points.

Click to generate an Excel file—no need to switch to external software; a complete spreadsheet preview can be opened right in the sidebar.

The same goes for PDFs and PPTs, all handled within the Codex window.

The First Piece of the Super App Puzzle

Looking back at the timeline, you can feel OpenAI's pace.

March 19: Foreign media reported that OpenAI plans to merge ChatGPT, Codex, and the Atlas browser into a desktop "Super App".

March 31: OpenAI secured $122 billion in funding, with a valuation of $852 billion, led by Amazon, NVIDIA, and SoftBank. The funding documents clearly stated that the capital would be used for the development and deployment of the Super App.

April 16: This wave of Codex updates landed.

Another telling data point: over 80% of OpenAI's internal employees are already using Codex, and not just engineers.

Writing weekly reports, organizing feedback, drafting product requirement documents, reviewing contracts, sending security training reminders—they use it for everything.

50% of Codex users are already using it for non-coding tasks.

This isn't a programming tool adding features. This is a Super App using the shell of a programming tool for a cold start.

If You Can't Beat Them, Infiltrate: Made an Official Plugin for Anthropic

Even more interestingly, OpenAI also made an official plugin for Claude Code, actively embedding Codex into the competitor's ecosystem.

It somewhat has the vibe of, rather than waiting for developers to switch camps, better to infiltrate their workflow first.

Currently, Codex emphasizes background execution, multi-Agent parallelism, and unattended operation; Claude Code's advantages lie in long-context reasoning and deep code understanding. More and more teams are choosing to use both.

However, what OpenAI wants is clearly not just a share of the pie.

$122 billion is bet on something far greater than a programming tool.

Reference: https://openai.com/index/codex-for-almost-everything/

This article is from the WeChat public account "新智元" (New Wisdom Yuan), edited: Hao Kun (Very Sleepy)

Pertanyaan Terkait

QWhat is the key new feature of OpenAI's Codex that allows it to operate in the background without taking over the user's entire computer?

AThe key new feature is called 'Computer Use', which gives Codex its own independent cursor, allowing it to operate applications in the background (like running Xcode) while the user continues their work uninterrupted with their own mouse and keyboard.

QWhich operating system currently has the most comprehensive support for Codex's new background control capabilities, and which regions are temporarily excluded?

AmacOS currently has the most comprehensive support for the new background cursor-level control. Users in the European Union and the United Kingdom are temporarily unable to use this feature.

QHow does the new 'heartbeat' mechanism in Codex function?

AThe 'heartbeat' mechanism allows Codex to schedule future work for itself. It can wake up automatically at a set time (even across days or weeks) to continue tasks and can reuse previous conversation threads, maintaining all accumulated context from earlier interactions.

QWhat was the significant business move mentioned in the article that suggests OpenAI is building a 'super app'?

AThe significant business move was OpenAI securing $122 billion in funding at an $852 billion valuation, with investors including Amazon, Nvidia, and SoftBank. The funding documents explicitly stated the capital would be used for the development and deployment of a desktop 'super app' that merges ChatGPT, Codex, and the Atlas browser.

QWhat surprising strategic move did OpenAI make regarding its competitor Anthropic's Claude Code?

AOpenAI created and released an official plugin for Anthropic's Claude Code, effectively embedding its own Codex tool into the ecosystem of its direct competitor. This is a strategic move to integrate into developers' workflows rather than waiting for them to switch platforms entirely.

Bacaan Terkait

Dialog dengan Ray Dalio: Saat Ini Berada dalam Gelembung AI, 1% Portofolio Investasi Adalah Bitcoin

Sumber: The Diary Of A CEO Ray Dalio, pendiri Bridgewater Associates yang meramalkan krisis keuangan 2008, memperingatkan bahwa ledakan AI saat ini menunjukkan tanda-tanda klasik gelembung ekonomi yang dapat pecah dan memicu resesi. Dalam wawancara podcast, dia menjelaskan dinamika "siklus besar" yang didorong oleh ketimpangan kekayaan, defisit pemerintah, dan perubahan geopolitik. Dalio mengidentifikasi pola di mana antusiasme berlebihan terhadap teknologi revolusioner baru, seperti AI, menyebabkan harga aset melambung dan pinjaman berlebihan. Ketika kondisi berubah (seperti kenaikan suku bunga atau kebutuhan tunai), gelembung ini dapat pecah, menyebabkan penurunan harga aset, kerugian luas, dan kontraksi ekonomi. Untuk melindungi kekayaan di masa ketidakpastian, Dalio sangat menekankan pentingnya **diversifikasi portofolio**—termasuk saham, obligasi, emas, dan real estat—daripada mengandalkan uang tunai saja. Dia mengungkapkan bahwa sekitar **1% portofolionya adalah Bitcoin**, yang diakuinya sebagai aset keras, tetapi dia lebih menyukai **emas fisik** karena sejarahnya sebagai penyimpan nilai dan aset bebas liabilitas. Mengenai dampak AI, Dalio percaya bahwa teknologi ini akan menggantikan tidak hanya tenaga fisik tetapi juga kemampuan kognitif manusia, berpotensi memperlebar kesenjangan antara pemilik modal dan pekerja. Masa depan akan menguntungkan mereka yang dapat memadukan kecerdasan manusia (seperti emosi dan intuisi) dengan kemitraan AI. Secara geopolitik, Dalio menggambarkan dunia yang memasuki fase "penurunan" dalam tatanan global, dengan Amerika Serikat menghadapi tantangan internal dan eksternal, termasuk konflik seperti di Iran yang mengungkap kelemahannya. Dia memprediksi dunia mungkin menjadi lebih terregionalisasi di masa depan. Secara keseluruhan, kunci untuk navigasi melalui periode kompleks ini adalah pemahaman akan pola sejarah, adaptasi, dan diversifikasi yang cermat.

marsbit3j yang lalu

Dialog dengan Ray Dalio: Saat Ini Berada dalam Gelembung AI, 1% Portofolio Investasi Adalah Bitcoin

marsbit3j yang lalu

Rekor! Beli Bersih Asing 7,2 Triliun Won dalam Sehari, Wall Street: Tekanan Likuiditas di Pasar Saham Korea Telah Mereda

**Ringkasan: Rekor Pembelian Asing dan Peningkatan Likuiditas di Pasar Saham Korea** Aliran modal asing menunjukkan perubahan signifikan di pasar saham Korea (KOSPI). Pada 31 Juli, investor asing melakukan pembelian bersih rekor sebesar 7,2 triliun Won Korea dalam sehari, menandai pembalikan dari tren penjualan bersih besar-besaran dalam beberapa bulan terakhir. Secara bulanan, penjualan bersih asing menyusut drastis menjadi 9,8 triliun Won di Juli, turun dari 48,4 triliun dan 44,5 triliun Won pada Juni dan Mei. Tekanan penjualan dari lembaga domestik juga mereda. Dana pensiun dan reksa dana domestik justru menjadi pembeli bersih 1,0 triliun Won di Juli, setelah dua bulan sebelumnya menjadi penjual bersih. Faktor pendukung lainnya adalah peraturan baru dari Komisi Jasa Keuangan (FSC) yang memberlakukan syarat lebih ketat bagi investor ritel untuk masuk ke ETF leverage saham tunggal, yang langsung mengurangi volume perdagangan instrumen tersebut hingga sekitar 50%. Kebijakan ini diperkirakan dapat menekan volatilitas pasar. Citigroup mempertahankan target indeks KOSPI di level 10.000 poin, menyoroti memudarnya angin penentu likuiditas. Analis mereka menilai faktor fundamental seperti industri chip memori yang solid, valuasi historis yang rendah, fundamental ekonomi Korea yang kuat, dan dukungan kebijakan berpotensi menjadi pendorong bagi pasar.

marsbit3j yang lalu

Rekor! Beli Bersih Asing 7,2 Triliun Won dalam Sehari, Wall Street: Tekanan Likuiditas di Pasar Saham Korea Telah Mereda

marsbit3j yang lalu

Pembaruan! AI Generasi Berikut OpenAI Pecahkan 10 Masalah Kelas Medali Fields

**OpenAI Model Astra Pecahkan 10 Masalah Matematika Kelas Fields Medal!** OpenAI mengumumkan terobosan besar dari model internal terbarunya, Astra. Model ini dilaporkan telah membuat kemajuan signifikan dalam **10 masalah matematika yang belum terpecahkan**, dengan biaya komputasi hanya sekitar **$2000**. Hasilnya dipublikasikan dalam makalah setebal 249 halaman. Beberapa pencapaian utama meliputi: 1. **Menyelesaikan masalah "non-sofic group"** yang diajukan Mikhail Gromov tahun 1999, dengan membangun contoh kelompok yang tak hingga dan finitely presented yang bukan sofic. Ini dianggap sebagai kemajuan bersejarah. 2. **Memecahkan batas lama dalam masalah pengepakan bola berdimensi tinggi** (sphere packing), meningkatkan batas yang telah bertahan sejak 1978 untuk dimensi tak hingga. 3. **Menyangkal dugaan "Connes Rigidity"** dengan membangun keluarga tak terhitung dari kelompok berbeda yang menghasilkan aljabar von Neumann yang sama persis. Semua bukti telah diverifikasi menggunakan asisten pembuktian formal Lean 4, memastikan ketepatannya. Para ahli matematika menyebut temuan ini sebagai **momen bersejarah**, setara dengan prestasi penghargaan Fields Medal, dan menandai kemampuan AI untuk melakukan penalaran matematika mendalam di berbagai bidang. OpenAI juga membagikan proses penalaran model, menunjukkan langkah maju yang besar menuju AGI (Artificial General Intelligence).

marsbit4j yang lalu

Pembaruan! AI Generasi Berikut OpenAI Pecahkan 10 Masalah Kelas Medali Fields

marsbit4j yang lalu

Bagaimana Membuat Diri Sendiri Tak Tergantikan oleh Kecerdasan Buatan

**Ringkasan: Bagaimana Membuat Diri Anda Tak Tergantikan oleh AI** Artikel ini membahas ancaman nyata dalam era AI: bukan kehilangan pekerjaan, tetapi "perbudakan gaji"—ketergantungan pada sistem dan orang lain untuk bertahan hidup. Solusinya adalah menjadi "individu super" yang "tak bisa dipekerjakan" dengan membangun bisnis atau karya sendiri. Kunci untuk bertahan dan berkembang di masa depan bukan sekadar keterampilan teknis, tetapi menguasai lima elemen yang sulit digantikan AI: 1. **Otonomi (Agency):** Kemampuan bertindak tanpa menunggu perintah. 2. **Rasa (Taste):** Pengalaman untuk mengetahui apa yang bernilai untuk ditawarkan. 3. **Kemampuan Persuasi (Persuasion):** Keterampilan menarik perhatian dan pengakuan. 4. **Ketekunan (Persistence):** Memahami bahwa kegagalan adalah bagian dari proses. 5. **Iterasi (Iteration):** Kemampuan memperbaiki kesalahan berdasarkan umpan balik. Lima elemen ini dapat dikembangkan dengan **membuat konten** (media). Dibandingkan pemrograman (code), konten lebih unggul karena nilainya subjektif, membutuhkan penilaian manusia, dan merupakan alat distribusi yang ampuh untuk membangun koneksi dan otoritas. Untuk memulai transformasi: 1. **Ubah lingkungan** Anda secara drastis untuk memicu perubahan identitas. 2. **Gali "bahan mentah"** Anda: Identifikasi pengetahuan mendalam, masalah yang pernah Anda selesaikan, dan minat unik masa kecil Anda. 3. **Temukan "poros pemikiran balik"**: Tentukan pendapat kontra-intuitif atau keyakinan Anda yang bertentangan dengan arus utama dalam bidang Anda. 4. **Luncurkan ide pertama** Anda besok. Gabungkan jawaban dari langkah 2 dan 3, lalu publikasikan. Umpan balik nyata dari dunia adalah guru terbaik. Intinya, bangunlah *karier seumur hidup* yang autentik berdasarkan pengalaman dan sudut pandang unik Anda. Dengan memanfaatkan AI sebagai alat dan fokus pada pengembangan diri yang tak tergantikan, Anda dapat mengambil kendali atas hidup dan masa depan Anda.

marsbit5j yang lalu

Bagaimana Membuat Diri Sendiri Tak Tergantikan oleh Kecerdasan Buatan

marsbit5j yang lalu

Kunci Bitcoin Disimpan Offline Berkat Lemparan Dadu, Tetapi Tidak Semua Orang Akan Melakukannya

Berdasarkan insiden kerentanan generator angka acak pada perangkat hardware wallet Coldcard, artikel ini membahas metode pembuatan kunci Bitcoin menggunakan dadu untuk menghasilkan entropi mandiri. Claude Shannon mengukur ketidakpastian dengan konsep entropi, di mana satu lemparan dadu enam sisi setara dengan sekitar 2,585 bit. Praktik melempar dadu 50 hingga 99 kali dapat menghasilkan frasa pemulihan 12 kata yang aman (128 bit entropi), melampaui ketergantungan pada generator perangkat. Namun, insiden Coldcard mengungkap bahwa meskipun seed utama dibuat dari dadu aman, fungsi lain seperti dompet kertas, kunci kloning, dan password masih berpotensi menggunakan generator cacat. Peneliti keamanan Kevin Loaec menekankan bahwa perlindungan hanya berlaku untuk seed utama, bukan keseluruhan sistem. Proses manual ini membutuhkan ketelitian tinggi, rentan kesalahan, dan tidak praktis bagi kebanyakan pengguna baru. Oleh karena itu, meski kuat secara matematis, metode ini lebih cocok untuk pengguna berpengalaman. Artikel menyarankan pemilik Coldcard untuk memperbarui firmware, memeriksa fungsi yang pernah digunakan, dan mempertimbangkan skema multisignature dengan perangkat dari produsen berbeda untuk mitigasi risiko. Tujuan jangka panjang adalah perangkat yang dapat menghasilkan keacakan kuat secara mandiri, tanpa memerlukan prosedur rumit dari pengguna.

cryptonews.ru8j yang lalu

Kunci Bitcoin Disimpan Offline Berkat Lemparan Dadu, Tetapi Tidak Semua Orang Akan Melakukannya

cryptonews.ru8j yang lalu

Trading

Spot

OpenAI Completely Overhauls Codex, Grows Its Own Mouse, Schedules Itself to Outwork Humans

Abstrak

You Do Your Work, It Runs Xcode in the Background

Click to Modify, No More Code Hunting for Front-End Debugging

Over 90 Plugins Launched, The Entire Toolchain Welded into Codex

AI Starts Scheduling Itself

Daily Needs Filled in One Go

The First Piece of the Super App Puzzle

If You Can't Beat Them, Infiltrate: Made an Official Plugin for Anthropic

Pertanyaan Terkait

Bacaan Terkait

Dialog dengan Ray Dalio: Saat Ini Berada dalam Gelembung AI, 1% Portofolio Investasi Adalah Bitcoin

Rekor! Beli Bersih Asing 7,2 Triliun Won dalam Sehari, Wall Street: Tekanan Likuiditas di Pasar Saham Korea Telah Mereda

Pembaruan! AI Generasi Berikut OpenAI Pecahkan 10 Masalah Kelas Medali Fields

Bagaimana Membuat Diri Sendiri Tak Tergantikan oleh Kecerdasan Buatan

Kunci Bitcoin Disimpan Offline Berkat Lemparan Dadu, Tetapi Tidak Semua Orang Akan Melakukannya

Trading

Kategori Populer

Tag Populer