🎧How OpenAI’s Codex Team Uses Their Coding Agent
- Get link
- X
- Other Apps
🎧How OpenAI's Codex Team Uses Their Coding AgentThibault Sottiaux and Andrew Ambrosino on product strategy, the workflows they rely on, and why speed creates a new bottleneckby Rhea Purohit TL;DR: Today, we're releasing a new episode of our podcast AI & I, where Dan Shipper sits down with two members of the team building OpenAI's coding agent, Codex, Thibault Sottiaux, head of Codex, and Andrew Ambrosino, member of technical staff on the Codex app. Watch on X or YouTube, or listen on Spotify or Apple Podcasts. Was this newsletter forwarded to you? Sign up to get it in your inbox. A little after 4 p.m. PT on Super Bowl Sunday, a wave of people took their eyes off the game to download a coding agent. It wasn't the wings, the beer, or Bad Bunny that inspired them. It was one of the many AI ads that aired—specifically, OpenAI's plug for its coding agent, Codex. Thibault Sottiaux, head of Codex, and Andrew Ambrosino, a member of technical staff on the Codex app, say their systems came under heavy load almost immediately after the spot aired. Even better, a lot of people also reached out to tell them that the ad inspired them to build, they told Dan Shipper on AI & I this week. The conversation caps off a few busy weeks for the Codex team: Since the start of February, they've shipped a desktop app, GPT-5.3 Codex—a new flagship model—and a research preview of a model that's almost too fast to follow. The momentum is showing up in the numbers, too. Usage has grown fivefold since the start of the year, and more than a million people now use Codex each week. Dan talks to the pair about the strategy decisions behind what they've built, the workflows they rely on inside Codex, and how a lightning-fast model potentially solves the next bottleneck for coding agents. Here is a link to the episode transcript. You can check out their full conversation: Here are some of the themes they touch on: A look inside OpenAI's strategy to ship new productsSottiaux and Ambrosino talk Dan through the decisions and tradeoffs that went into their new launches. A warmer coding agent that's still for buildersAs we've written, GPT-5.3 Codex feels more user-friendly than its predecessors—warmer and more creative—while still maintaining its technical prowess. That shift was echoed in OpenAI's decision to run a Super Bowl ad for Codex rather than ChatGPT, signaling a bet that coding agents are ready for a mainstream audience. Still, OpenAI views Codex as its most powerful coding tool, one that requires a certain level of technical fluency. Sottiaux describes the target user as "technical" or "technical adjacent"—someone with familiarity in areas like data science, for example—and says that to get the most out of it, you should be able to read code. For the wider, less technical audience, he adds, OpenAI plans to eventually bring a similar experience into ChatGPT, which will not assume engineering literacy of its users. Even so, the team is adamant that professional developers deserve a "dedicated experience." Ambrosino notes that while the Codex app shares clear DNA with ChatGPT—features such as the central chat-style interface and the auto-named conversations—it was purpose-built to "showcase the power of the models and the way [they] could change the [software] development lifecycle." OpenAI believes your coding agent doesn't belong in a terminal or the IDEThe decision to build a dedicated graphical user interface (GUI)—a visual, point-and-click interface rather than a text-only terminal approach—for the Codex app was, by the team's own admission, a break from trending design choices. Ambrosino describes the app as a "daily driver," with the terminal and the integrated development environment (IDE)—an all-in-one environment for writing code where many developers have traditionally worked—reserved for the occasional specialized task. According to them, a terminal works well for firing off quick tasks, but starts to feel limiting once agents become multimodal—drawing diagrams, generating images, and responding to voice instructions—or once you're running several in parallel and need to keep track of them all. OpenAI designed Codex to dynamically show only the tools and views you need at that moment. "We came to the conclusion that…these models are great at knowing what's needed…for what type of task," Ambrosino says. Sottiaux adds that the AI is already acting on far more than just code—like filing tickets in project management software Linear and posting to Slack—and cramming all of that into an IDE "would feel very odd." How the Codex team teaches AI to read between the linesAchieving a balance between the model being good at following instructions and intuiting user intent is something the team obsesses over. Codex has historically excelled at the former, but when they optimize too hard in that direction, the model starts to overindex on literal wording and miss intent in ways a human never would. Sottiaux takes the example of a typo in a prompt that ends up verbatim in the code, rather than the model inferring what you obviously meant. The team is also investing in what they call "personalities"—essentially, a measure of how supportive or blunt the model should be. While the previous default leaned heavily terse and direct, now there's a friendlier, more supportive option, and users can toggle between the two. Both Sottiaux and Ambrosino still use the pragmatic "personality." "You should feel like you have your own little personal Codex," Sottiaux says, "that works in exactly the way that you want it to work." How the Codex team uses its own AITwo features make the Codex app especially powerful: "automations," which let you schedule prompts to run hourly, daily, or at whatever cadence you set, and "skills," which bundle instructions so that Codex can connect to external tools and run workflows that go beyond code generation, including research, reporting, and writing. These are a few automations and skills that Sottiaux and Ambrosino find useful:
Speed is a dimension of intelligenceWe said GPT-5.3-Codex-Spark, the smaller, speed-optimized version of OpenAI's GPT-5.3 Codex, is fast enough to blow your hair back—and it is. "We do slow it down ever so slightly, just so you can see the words come in a little bit smoother," Ambrosino says. The team sees speed unlocking different ways of working
Code review is the next bottleneckWith speed close to being solved, the next bottleneck is review. Models can generate code faster than ever. But will it be bug-free? Will the button in the settings panel do what it's supposed to? That still requires a human to click through the app and check for consistency. The OpenAI team is exploring what that process looks like with AI involved. The Codex app already has a review mode that annotates diffs (side-by-side comparisons showing exactly which lines of code were added, removed, or changed). Speed helps here, too, Sottiaux adds. A faster model that helps you understand the code you're reviewing offsets some of the pressure created by the sheer volume now being produced. Ambrosino hints at a more ambitious direction: If a model can prove a bug fix works by retracing the exact click path a user would take, code review, as we know it, might matter less—you'd verify the outcome directly rather than reading the code as a proxy. The team already has skills in the Codex app that click around the app, screenshot the results, and attach them to a pull request to show what changed (and why it works). What do you use AI for? Have you found any interesting or surprising use cases? We want to hear from you—and we might even interview you. Timestamps
You can check out the episode on X, Spotify, Apple Podcasts, or YouTube. Links are below:
Miss an episode? Catch up on Dan's recent conversations with LinkedIn cofounder Reid Hoffman; the team that built Claude Code, Cat Wu and Boris Cherny; Vercel cofounder Guillermo Rauch; podcaster Dwarkesh Patel; and others, and learn how they use AI to think, create, and relate. If you're enjoying the podcast, here are a few things I recommend:
Rhea Purohit is a contributing writer for Every focused on research-driven storytelling in tech. You can follow her on X at @RheaPurohit1 and on LinkedIn. To read more essays like this, subscribe to Every, and follow us on X at @every and on LinkedIn. We build AI tools for readers like you. Write brilliantly with Spiral. Organize files automatically with Sparkle. Deliver yourself from email with Cora. Dictate effortlessly with Monologue. We also do AI training, adoption, and innovation for companies. Work with us to bring AI into your organization. Get paid for sharing Every with your friends. Join our referral program. For sponsorship opportunities, reach out to sponsorships@every.to. Help us scale the only subscription you need to stay at the edge of AI. Explore open roles at Every. Get More Out Of Your SubscriptionTry our AI tools for ultimate productivity Front-row access to the future of AI In-depth reviews of new models on release day Playbooks and guides for putting AI to work Prompts and use cases for builders Bundle of AI software |
- Get link
- X
- Other Apps
