A Research Preview of Codex

(openai.com)

511 points meetpateltech | 2 comments | 16 May 25 15:02 UTC | HN request time: 0.413s | source

Show context

johnjwang ◴[16 May 25 16:27 UTC] No.44007301[source]▶

Some engineers on my team at Assembled and I have been a part of the alpha test of Codex, and I'll say it's been quite impressive.

We’ve long used local agents like Cursor and Claude Code, so we didn’t expect too much. But Codex shines in a few areas:

Parallel task execution: You can batch dozens of small edits (refactors, tests, boilerplate) and run them concurrently without context juggling. It's super nice to run a bunch of tasks at the same time (something that's really hard to do in Cursor, Cline, etc.)

It kind of feels like a junior engineer on steroids, you just need to point it at a file or function, specify the change, and it scaffolds out most of a PR. You still need to do a lot of work to get it production ready, but it's as if you have an infinite number of junior engineers at your disposal now all working on different things.

Model quality is good, but hard to say it's that much better than other models. In side-by-side tests with Cursor + Gemini 2.5-pro, naming, style and logic are relatively indistinguishable, so quality meets our bar but doesn’t yet exceed it.

replies(15): >>44007420 #>>44007425 #>>44007552 #>>44007565 #>>44007575 #>>44007870 #>>44008106 #>>44008575 #>>44008809 #>>44009066 #>>44009783 #>>44010245 #>>44012131 #>>44014948 #>>44016788 #

Jimmc414 ◴[16 May 25 16:54 UTC] No.44007575[source]▶

>>44007301 #

> We’ve long used local agents like Cursor and Claude Code, so we didn’t expect too much.

If you don't mind, what were the strengths and limitations of Claude Code compared to Codex? You mentioned parallel task execution being a standout feature for Codex - was this a particular pain point with Claude Code? Any other insights on how Claude Code performed for your team would be valuable. We are pleased with Claude Code at the moment and were a bit underwhelmed by comparable Codex CLI tool OAI released earlier this month.

replies(1): >>44007708 #

t_a_mm_acq ◴[16 May 25 17:07 UTC] No.44007708[source]▶

>>44007575 #

Post realizing CC can operate same code base, same file tree on different terminals instances, it's been a significant unlock for us. Most devs have 3 running concurrently. 1. master task list + checks for completion on tasks. 2. operating on current task + documentation. 3. side quests, bugs, additional context.

rinse and repeat once task done, update #1 and cycle again. Add in another CC window if need more tasks concurrently.

downside is cost but if not an issue, it's great for getting stuff done across distributed teams..

replies(1): >>44007858 #

1. naiv ◴[16 May 25 17:21 UTC] No.44007858[source]▶

>>44007708 #

do you have then instance 2 and 3 listening to instance 1 with just a prompt? or how does this work?

replies(1): >>44009901 #

2. naiv ◴[16 May 25 21:21 UTC] No.44009901[source]▶

>>44007858 (TP) #

to answer my own questions , it is actually laid out in chapter 6 of https://www.anthropic.com/engineering/claude-code-best-pract...

↑