Most active commenters

flail(21)
thenanyu(16)
no_wizard(4)
franktankbank(4)
scarface_74(4)
croes(3)
estimator7292(3)
didibus(3)

Popular/hot comments

>>45138802 #
>>45138694 #
>>45139053 #
>>45138581 #
>>45139589 #
>>45139863 #
>>45140333 #
>>45140451 #

Development speed is not a bottleneck

(pawelbrodzinski.substack.com)

1. sylware ◴[05 Sep 25 13:24 UTC] No.45138289[source]▶

>>45138156 (OP) #

For tons of software out there, but not all, development time is minuscule compared to the life cycle.

2. j-pb ◴[05 Sep 25 13:49 UTC] No.45138565[source]▶

>>45138156 (OP) #

What these LLMs enable is fixing the foundations. If you considered writing a novel database, operating system, or other foundational piece of software two years ago, you had to be mad. Now you still do, but at least you got a chance.

I can highly recommend these talks to get your eyes slightly opened to how stuck we are in a local minima.

https://vimeo.com/71278954

https://www.destroyallsoftware.com/talks/a-whole-new-world

3. HumblyTossed ◴[05 Sep 25 13:51 UTC] No.45138581[source]▶

>>45138156 (OP) #

You have the unbelievably productive programmers - we all know their names, we use the code they wrote every day. Then you have the programmers who want to be there and will try everything they can to be there - except gain depth of knowledge. They tend to be shallow programmers. If you give them a task and spell it out, they can knock out code for it at a really good pace and wow upper management. But they will always lack the ability to take a task not spelled out and complete it. Vibe-coding is like sugar and crack mixed together for these people.

replies(3): >>45138811 #>>45139628 #>>45147339 #

4. goalieca ◴[05 Sep 25 13:51 UTC] No.45138593[source]▶

>>45138156 (OP) #

Development is always a bottleneck. Writing lines of code usually isn’t. I end up pumping out more leetcode during an interview than I do during a week or two on real products. No one has meaningfully measured lines of code as a metric of productivity since my career began in the mid-2000.

replies(1): >>45141106 #

5. vessenes ◴[05 Sep 25 14:00 UTC] No.45138688[source]▶

>>45138156 (OP) #

This is just so, so wrong. LLMs change the surface of what's "hard" to do in a coding exercise. Many a project has so much boiler plate, edge cases, etc. that months+ can be taken up dealing with what is ultimately a very boring activity. Add on time to assimilate APIs, bug test, etc. This stuff does matter.

replies(1): >>45138776 #

6. temporallobe ◴[05 Sep 25 14:00 UTC] No.45138694[source]▶

>>45138156 (OP) #

About a decade ago, I was the sole developer for a special project. The code took 2 weeks to complete (a very simple Java servlet + JDBC app) but an entire year to actually deliver due to indecisive leadership, politics, and extremely overzealous security policies. By the time it was successfully deployed to prod, I had been chewed out by management countless times, who usually asked questions like “how on Earth can it take so long to do this one simple thing??”.

replies(4): >>45138908 #>>45139530 #>>45139747 #>>45139841 #

7. jajko ◴[05 Sep 25 14:02 UTC] No.45138711[source]▶

>>45138156 (OP) #

The bigger and clunkier the corporation is, the slower the speed of deliveries. And actual development FWIW is somewhere in the range of 1-5% of it all.

Sure, code sweat shops have very different % of above, but thats a completely different game altogether.

8. fosterfriends ◴[05 Sep 25 14:06 UTC] No.45138754[source]▶

>>45138156 (OP) #

You can code PRs fast, but CI, review, merge, deployment, monitoring, all takes just as long as it did before. The inner loop is shrinking; the outer loop is the real bottleneck

9. titzer ◴[05 Sep 25 14:07 UTC] No.45138776[source]▶

>>45138688 #

It reads like the author never debugged a program. Development speed is not just the time to write code, but also test, stabilize and debug it, with most of the latter being a risk that might cost you a lot much later. If your engineers have to take a two hour or two day or two week timeout to debug issues from weeks, months, or years back, then that really costs as development time.

Vibe coding is going to make this so much worse; the tech debt of load-bearing code that no one really understands is going to be immense.

replies(1): >>45141216 #

10. thenanyu ◴[05 Sep 25 14:08 UTC] No.45138802[source]▶

>>45138156 (OP) #

It's completely absurd how wrong this article is. Development speed is 100% the bottleneck.

Just to quote one little bit from the piece regarding Google: "In other words, there have been numerous dead ends that they explored, invalidated, and moved on from. There's no knowing up front."

Every time you change your mind or learn something new and you have to make a course correction, there's latency. That latency is just development velocity. The way to find the right answer isn't to think very hard and miraculously come up with the perfect answer. It's to try every goddamn thing that shows promise. The bottleneck for that is 100% development speed.

If you can shrink your iteration time, then there are fewer meetings trying to determine prioritization. There are fewer discussions and bargaining sessions you need to do. Because just developing the variations would be faster than all of the debate. So the amount of time you waste in meetings and deliberation goes down as well.

If you can shrink your iteration time between versions 2 and 3, between versions 3 and 4, etc. The advantage compounds over your competitors. You find promising solutions earlier, which lead to new promising solutions earlier. Over an extended period of time, this is how you build a moat.

replies(13): >>45139053 #>>45139060 #>>45139417 #>>45139619 #>>45139814 #>>45139926 #>>45140039 #>>45140332 #>>45140412 #>>45141131 #>>45144376 #>>45147059 #>>45154763 #

11. throwaway-18 ◴[05 Sep 25 14:10 UTC] No.45138811[source]▶

>>45138581 #

The difference between Software Engineers (or Developers) vs Programmers; with the latter designation being a stretch for some.

replies(1): >>45139688 #

12. BiteCode_dev ◴[05 Sep 25 14:14 UTC] No.45138849[source]▶

>>45138156 (OP) #

Even if it were not a bottleneck, speed allow use cases you wouldn't consider before.

I use Python differently because uv made many things faster, less costly. Stuff I used to do in bash are now in Python. Stuff I wouldn't do at all because 3rd party modules were an incompressible expense, now I do because the cost is low.

Same with AI.

Every week, there was a small tool I actively chose to not develop because I know that it would save less time by automating the thing than it would take coding it.

E.G: I send regularly documents from my hard drive or forward mails to a specific email for accounting. It would be nice to be able to do those in one click. But dev a nautilus script or thunderbird extension to save max a minute a day doesn't make sense.

Except now with claude code, it does. In a week, they paid off. And now I'm racking the minutes.

Now each week, I'm getting a new tool that is not only saving me minutes, but also reducing context switching. Those turn into hours, which turn into days. These compounds.

And of course, getting out a MVP, or a new feature demo out of the door quickly allows you to get feedback faster.

In general, AI lets you get a shorter feedback loop. Trash bad concept sooner. Get crucial info faster.

Those do speed up a project.

13. vjvjvjvjghv ◴[05 Sep 25 14:18 UTC] No.45138908[source]▶

>>45138694 #

I see that a lot too. Something is super urgent, you work your ass off to deliver and then somebody sits on it for months before actually shipping. If ever.

replies(1): >>45139310 #

14. laurent_du ◴[05 Sep 25 14:21 UTC] No.45138947[source]▶

>>45138156 (OP) #

I think development speed is merely tagging the correct causal factor which is expertise. I have witnessed development teams requiring weeks to change a single flag in a configuration flag? Were they slow? Well, yes, but I'd argue they were mostly clueless.

15. rekrsiv ◴[05 Sep 25 14:23 UTC] No.45138977[source]▶

>>45138156 (OP) #

Development is often divided into 80% known unknowns and 20% unknown unknowns. AI can only help with one of those, and it's the one that takes the least amount of time to complete.

Research and thinking is always going to be the bottleneck.

16. lordnacho ◴[05 Sep 25 14:26 UTC] No.45139006[source]▶

>>45138156 (OP) #

I have very much started to re-evaluate whether I believe in this. I always thought something along the lines of "once you have solved it architecturally, typing it out is the least of your worries".

But with LLMs I'm not so sure. I feel like I can skip the effort of typing, which is still effort, despite years of coding. I feel like I actually did end up spending quite a lot of time doing trivial nonsense like figuring out syntax errors and version mismatches. With an LLM I can conserve more of my attention on the things that really matter, while the AI sorts out the tedious things.

This in turn means that I can test more things at the top architectural level. If I want to do an experiment, I don't feel a reluctance to actually do it, since I now don't need to concentrate on it, rather I'm just guiding the AI. I can even do multiple such explorations at once.

replies(2): >>45139065 #>>45139280 #

17. trjordan ◴[05 Sep 25 14:30 UTC] No.45139053[source]▶

>>45138802 #

This article is right insofar as "development velocity" has been redefined to be "typing speed."

With LLMs, you can type so much faster! So we should be going faster! It feels faster!

(We are not going faster.)

But your definition, the right one, is spot on. The pace of learning and decisions is exactly what drives development velocity. My one quibble is that if you want to learn whether something is worth doing, implementing it isn't always the answer. Prototyping vs. production-quality implementation is different, even within that. But yeah, broadly, you need to test and validate as many _ideas_ as possible, in order take make as many correct _decisions_ as possible.

That's one place I'm pretty bullish on AI: using it to explore/test ideas, which otherwise would have been too expensive. You can learn a ton by sending the AI off to research stuff (code, web search, your production logs, whatever), which lets you try more stuff. That genuinely tightens the feedback loop, and you go faster.

I wrote a bit more about that here: https://tern.sh/blog/you-have-to-decide/

replies(4): >>45139232 #>>45139283 #>>45139863 #>>45140155 #

18. tristor ◴[05 Sep 25 14:31 UTC] No.45139060[source]▶

>>45138802 #

This, so much. As an engineer turned PM, I am usually sympathetic to the idea that doing more discovery up front leads to better outcomes, but the simple reality is that it's hard to try anything, make any bets, or even do sure wins when the average development lifecycle is 12-18 months to get something released in a large organization and they're allergic to automation, hiring higher quality engineers, and hiring more engineers to improve velocities. Development velocity basically trumps everything, after basic sanity checks on the cost/benefit tradeoffs, because you can just try things and if it doesn't work you try something else.

This is /especially/ true in software in 2025, because most products are SaaS or subscription based, so you have a consistent revenue stream that can cover ongoing development costs which gives you the necessary runway to iterate repeatedly. Development costs then become relatively stable for a given team size and the velocity of that team entirely determines how often you can iterate, which determines how quickly you find an optimal solution and derive more value.

replies(2): >>45139158 #>>45147313 #

19. kasey_junk ◴[05 Sep 25 14:31 UTC] No.45139065[source]▶

>>45139006 #

This echoes my feelings as well. I’d go further, I’ve long said that the real problem in software is verification, but my actions didn’t match that because I’d spend less time on that than code creation.

With the llm I really can spend most of my time on the verification problem.

20. marginalia_nu ◴[05 Sep 25 14:40 UTC] No.45139149[source]▶

>>45138156 (OP) #

I would reconcile the seeming paradox that AI-assisted coding produces more code faster, yet doesn't seem to produce products or features much faster by considering that AI code generation and in particular CoPilot-style code suggestions means the programmer is constantly invalidating and re-building their mental model of the code, which is not only slow but exhausting (and a tired programmer makes more errors in judgement).

It's basically the wetware equivalent of page thrashing.

My experience is that I write better code faster by turning off the AI assistants and trying to configure the IDE to as best possible produce deterministic and fast suggestions, that way they become a rapid shorthand. This makes for a fast way of writing code that doesn't lead to mental model thrashing, since the model can be updated incrementally as I go.

The exception is using LLMs to straight up generate a prototype that can be refined. That also works pretty well, and largely avoids the expensive exchanges of information back and forth between human and machine.

replies(2): >>45139921 #>>45140673 #

21. esseph ◴[05 Sep 25 14:41 UTC] No.45139158{3}[source]▶

>>45139060 #

> it's hard to try anything, make any bets, or even do sure wins when the average development lifecycle is 12-18 months to get something released in a large organization and they're allergic to automation, hiring higher quality engineers, and hiring more engineers to improve velocities.

This has been my experience as well :/

22. add-sub-mul-div ◴[05 Sep 25 14:48 UTC] No.45139232{3}[source]▶

>>45139053 #

I think people are largely split on LLMs based on whether they've reached a point of mastery where they can work close to as fast as they think and the tech would therefore slow them down rather than accelerate them.

replies(2): >>45139589 #>>45145091 #

23. theptip ◴[05 Sep 25 14:53 UTC] No.45139280[source]▶

>>45139006 #

Absolutely, my experience too. I think the bleeding edge models are very good at “idea infill”.

Depending on your subject matter you might only need an idea or two per 100loc generated. So much of what I used to do turns out to be grunt work that was simply pattern matching on simple heuristics, but I can churn out 5-10 good ideas per hour it seems, so I’m definitely rate limited on coding.

Similar to your comment on architectural experiments, one thing I have been observing is that the critical path doesn’t go 10x faster, but by multiplexing small incidental ideas I can get a lot more done. Eg “it would be nice if we had a new set of integration tests that stub this API in some slightly tedious way, go build that”.

24. skydhash ◴[05 Sep 25 14:53 UTC] No.45139283{3}[source]▶

>>45139053 #

Naur’s theory of programming has always felt right to me. Once you known everything about the current implementation, planning and decision making can be done really fast and there’s not much time lost on actually implementing prototypes and dead ends (learning with extra steps).

It’s very rare to not touch up code, even when writing new features. Knowing where to do so in advance (and planning to not have to do that a lot) is where velocity is. AI can’t help.

replies(2): >>45140154 #>>45145405 #

25. skydhash ◴[05 Sep 25 14:56 UTC] No.45139310{3}[source]▶

>>45138908 #

I don’t actually mind (because I won’t work my ass off). So when enthusiasm fizzle out, I just take a lot of notes (to onboard myself quickly) and shelve the project.

26. jayd16 ◴[05 Sep 25 15:05 UTC] No.45139417[source]▶

>>45138802 #

When they say dev speed they mean the coding the AI can do.

It's agreed that testing, evaluating, learning and course correcting are what takes the time. That's the entire point being made.

replies(1): >>45139677 #

27. latchkey ◴[05 Sep 25 15:14 UTC] No.45139530[source]▶

>>45138694 #

[deleted]

replies(1): >>45139600 #

28. no_wizard ◴[05 Sep 25 15:19 UTC] No.45139589{4}[source]▶

>>45139232 #

The verbose LLM approach that Cursor and some others have taken really annoys me. I would prefer if it simply gave me the results (written out to files, changes to files or whatever the appropriate medium is) and only let me introspect the verbose steps it took if I request to do so.

That’s what slows me down with AI tools and why I ended up sticking with GitHub Copilot, which does not do any of that unless I prompt it to

replies(3): >>45142018 #>>45142053 #>>45143248 #

29. no_wizard ◴[05 Sep 25 15:20 UTC] No.45139600{3}[source]▶

>>45139530 #

Not always simple to switch jobs unfortunately

30. Aurornis ◴[05 Sep 25 15:21 UTC] No.45139619[source]▶

>>45138802 #

> It's completely absurd how wrong this article is. Development speed is 100% the bottleneck.

The current trend in anti-vibe-coding articles is to take whatever the vibe coding maximalists are saying and then stake out the polar opposite position. In this case, vibe coding maximalists are claiming that LLM coding will dramatically accelerate time to market, so the anti-vibe-coding people feel like they need to claim that development speed has no impact at all. Add a dash of clickbait (putting "development speed" in the headline when they mean typing speed) and you get the standard LLM war clickbait article.

Both extremes are wrong, of course. Accelerating development speed is helpful, but it's not the only factor that goes into launching a successful product. If something can accelerate development speed, it will accelerate time to market and turnaround on feature requests.

I also think this mentality appeals to people who have been stuck in slow moving companies where you spend more time in meetings, waiting for blockers from third parties, writing documents, and appeasing stakeholders than you do shipping code. In some companies, you really could reduce development time to 0 and it wouldn't change anything because every feature must go through a gauntlet of meetings, approvals, and waiting for stakeholders to have open slots in their calendars to make progress. For anyone stuck in this environment, coding speed barely matters because the rest of the company moves so slow.

For those of us familiar with faster moving environments that prioritize shipping and discourage excessive process and meetings, development speed is absolutely a bottleneck.

replies(2): >>45140333 #>>45147277 #

31. no_wizard ◴[05 Sep 25 15:22 UTC] No.45139628[source]▶

>>45138581 #

It’s infecting expectations I’ve noticed as well. The thing LLM coding tools expose very plainly if someone wasn’t already aware is that management would rather ship with bugs or missing features - no matter how many - as long as the “happy path” works.

The vibe coders can deliver on happy path results pretty fast but I already have seen within 2 months it starts to fall apart quick and has to be extensively refactored which ends up ultimately taking more time than if it was done with quality in mind in the first place

And supposedly the free market makes companies “efficient and logical”

replies(1): >>45139958 #

32. thenanyu ◴[05 Sep 25 15:26 UTC] No.45139677{3}[source]▶

>>45139417 #

Sure, but the actual lag from "I have an idea worth trying" to "here's a working version people can interact with" is one of the larger pieces of latency in that entire process.

You can't test or evaluate something that doesn't work yet.

33. ardit33 ◴[05 Sep 25 15:27 UTC] No.45139683[source]▶

>>45138156 (OP) #

lol.... development speed and quality are both the bottleneck my dude. But if you have enough speed, you can fix quality issues as you are able to test and fix things faster.

You have even CEO of car companies that get fired because they mess this up. Or even the Sonos company lost a lot of value, and got their CEO fired because they messed up and can't fix it in time.

Speed is not everything. Developing the right features (what users want) and Quality are the most important things, but development speed allows you to test features and fix things fast and course correct.

replies(1): >>45144385 #

34. hombre_fatal ◴[05 Sep 25 15:27 UTC] No.45139688{3}[source]▶

>>45138811 #

I think we should we put this title-based distinction to rest.

Whether you call yourself an engineer, developer, programmer, or even a coder is mostly a localized thing, not an evaluation of expertise.

We're confusing everyone when we pretend a title reflects how good we are at the craft, especially titles we already use to refer to ourselves without judgement. At least use script kiddie or something.

replies(1): >>45140483 #

35. zduoduo ◴[05 Sep 25 15:32 UTC] No.45139741[source]▶

>>45138156 (OP) #

Yeah, “development speed” is almost never the real blocker. I’ve worked on teams where folks shipped code at lightning speed… straight into the wrong direction. Turns out it’s way slower to undo that than to just move carefully with clarity.

36. whstl ◴[05 Sep 25 15:32 UTC] No.45139747[source]▶

>>45138694 #

I saw two projects in a row in a German Fintech (the one that has AI in its name that forbids usage of AI) go exactly the same way.

Two/three months to code everything ("It's maximum priority!"), about four to QA, and then about a year to deploy to individual country services by ops team.

During test and deploy phases, the developers were just twiddling thumbs because ops refused to allow them access and product refused to take in new projects due to possibility of developers having to go back to code.

It took the CEO to intervene and investigate the issues, and the CTO's college best friend that was running DevOps was demoted.

37. witnessme ◴[05 Sep 25 15:33 UTC] No.45139749[source]▶

>>45138156 (OP) #

How fast can you react to the learning you have from the market, that's the bottleneck. And yes, the development is the big chunk of that reaction time.

38. ciconia ◴[05 Sep 25 15:33 UTC] No.45139753[source]▶

>>45138156 (OP) #

The article sort of glosses over this, but to me the real question is delivering value over the long run. This takes patience and tenacity not just from developers but also from management. Making a product that lasts and that evolves and that delivers for your clients is definitely a lot more challenging (and finally rewarding) than vibe-coding an MVP in a couple of weeks. I have the impression that in that regard AI coding tools are quite inadequate and don't really deliver the value they purport to.

replies(1): >>45141048 #

39. franktankbank ◴[05 Sep 25 15:37 UTC] No.45139814[source]▶

>>45138802 #

Feedback from customers is the longest time.

replies(1): >>45140598 #

40. franktankbank ◴[05 Sep 25 15:40 UTC] No.45139841[source]▶

>>45138694 #

Why were you getting chewed out over it? Presumably the dickhead doing the chewing would be aware of the circumstances.

replies(1): >>45141345 #

41. giancarlostoro ◴[05 Sep 25 15:42 UTC] No.45139863{3}[source]▶

>>45139053 #

I can agree with this sentiment. It does not matter how insanely good LLMs become, if you cannot assess it quickly enough. You will ALWAYS want a human to verify and validate, and test the software. There could be a ticking timebomb in there somewhere.

Maybe the real skynet will kill us with ticking time bomb software bugs we blindly accepted.

replies(3): >>45140469 #>>45140953 #>>45143958 #

42. mlinsey ◴[05 Sep 25 15:43 UTC] No.45139880[source]▶

>>45138156 (OP) #

Validation is definitely the bottleneck, if you make all your product decisions through a/b tests and wait for a statistically significant result for each feature.

But there are people with great product taste who can know by trying a product whether it meets a real user need - some of these are early-adopter customers, sometimes they are great designers, sometimes PMs. And they really do need to try a product (or prototype) to really know whether it works. I was always frustrated as a junior engineer when the PM would design a feature in a written spec, we would implement it, and then when trying it out before launch, they would want to totally redesign it, often in ways which required either terrible hacks or significant technical design changes to meet the new requirements. But after 15 years of seeing some great ideas on paper fall flat with our users, and noticing that truly exceptional product people could tell exactly what was wrong after the feature was built but before it was released to users, I learned to be flexible about those sorts of rewrites. And it’s exactly that sort of thing that vibecoding can accelerate

replies(2): >>45139946 #>>45140841 #

43. wglb ◴[05 Sep 25 15:45 UTC] No.45139898[source]▶

>>45138156 (OP) #

Development speed has been a bottleneck in every major product development effort that I have been involved in. From a realtime medical data collection application where president and VP are drumming their fingers on the desk waiting for development to be finished.

Writing a compiler at Sycor, there were teams waiting for us to finish our development. We were successful, being about an order of magnitude faster than the effort we replaced.

And just because google cancels products doesn't suggest anything about development speed.

If I were an LLM advocate (having much fun currently with gemini), I would let the criticism roll and make book using LLMs.

replies(1): >>45140956 #

44. bckr ◴[05 Sep 25 15:48 UTC] No.45139921[source]▶

>>45139149 #

I’m moving this way as well after about 6 months of generating 95% of my code with Cursor/Claude.

My new paradigm is something like:

- write a few paragraphs about what is needed

- have the bot take in the context and produce a prototype solution outside of the main application

- have the bot describe main integration challenges

- do that integration myself — although I’m still somewhat lazy about this and keep trying to have the bot do it after the above steps; it seems to only have maybe 50% success rate

- obviously test thoroughly

45. seneca ◴[05 Sep 25 15:49 UTC] No.45139926[source]▶

>>45138802 #

Exactly the comment I came to make after reading this article. The article is basically claiming that "trying different things until something works" is what takes time, but the actual act of "trying things" requires development time. I can't see how someone can think about this topic this long, which the author clearly has, and come to this conclusion.

Perhaps I've just misunderstood the point, but it seems like a nonsensical argument.

replies(2): >>45140379 #>>45140451 #

46. smelendez ◴[05 Sep 25 15:50 UTC] No.45139946[source]▶

>>45139880 #

It's interesting how frustrating it can feel to backtrack, even when it's the right move I definitely have felt this too.

Also, in the past I've done interactive maps and charts for different media organizations, and people would often debate for a considerable amount of time whether to, for example, make a bar or line chart (the actual questions and visualizations themselves were usually more sophisticated).

I remember occasionally suggesting prototyping both options and trying them out, and intuitively that usually struck people as impractical, even though it would often take less time than the discussions and yield more concrete results.

47. bckr ◴[05 Sep 25 15:51 UTC] No.45139958{3}[source]▶

>>45139628 #

We’ve only had these tools for, less than 2 years?

I think those “fall apart in 2 months” kinds of projects will still keep happening, but some of us had that experience and are refining our use of the tools. So I think in the future we will see a broader spread of “percent generated code” and degrees of success

48. flail ◴[05 Sep 25 15:58 UTC] No.45140039[source]▶

>>45138802 #

I would agree if the only way to achieve (digital product) success were to implement as many versions of software as possible. That's not true.

The whole Lean Startup was about figuring out how to validate ideas without actually developing them. And it is as relevant as ever, even with AI (maybe, especially with AI).

In fact, it's enough to look at the appalling rate of product success. We commonly agree that 90% of startups fail. The majority of that cohort have built things that shouldn't have been built at all in the first place. That's utter waste.

If only, instead of focusing on building more, they stopped and reevaluated whether they were building the right thing in the first place. Yet, most startups are completely immersed in the "development as a bottleneck" principle. And I tell that part from our own experience of 20+ years of helping such companies to build their early-stage products. The biggest challenge? Convince them to build less, validate, learn, and only then go back to further development.

When it comes to existing products, it gets even more complex. The quote from Leah Tharin explicitly mentions waiting weeks/months of wait till they were able to get statistically significant data. What follows is that within that part of experimentation, they were blocked.

Another angle to take a look at it is the fundamental difference in innovation between Edison/Dyson and Tesla.

The first duo was known for "I have not failed. I found 10,000 ways that don't work." They were flailing around with ideas till something eventually clicked.

Tesla, in contrast, would be at the Einstein's end of the spectrum with "If I had an hour to solve a problem, I'd spend 55 minutes thinking about the problem and 5 minutes thinking about [or in Tesla's case, making] solutions."

While most of the product companies would be somewhere in between, I'd argue that development is a bottleneck only if we are very close to Edison/Dyson's approach.

replies(1): >>45140358 #

49. sega_sai ◴[05 Sep 25 16:06 UTC] No.45140148[source]▶

>>45138156 (OP) #

I completely disagree. As a scientist who does a lot of coding, the modern LLM tools give me ability to code something which previously I could not afford, because I simply did not have time for it. Now if I have an idea, I may be able to test it in an hour of tinkering with claude/gemini. I could technically still code it myself, but in some cases that would require maybe a day of work -- and I simply don't have that.

replies(2): >>45141904 #>>45147370 #

50. inerte ◴[05 Sep 25 16:07 UTC] No.45140150[source]▶

>>45138156 (OP) #

Most software projects don't even do A/B tests. A lot of them don't need, it's just what someone wants to ship. Another set can't even get to the sample size required.

But fine, let's take the subset of features / projects that can be tested or somehow validated. In my experience (having worked for 13+ years on companies that prefer to A/B almost everything), more than half of the tests fail. People initially might think the solution is to have better ideas, cook them more, do better analysis. That's usually wrong. I've seen PHDs with 20+ years of experience in a given industry (Search) launch experiments and they still fail.

The solution is to have some sort of "just enough" analysis like user studies, intuition, and business needs, and launch as fast and as many as you can. Therefore, development speed is A bottleneck (there's no Silver Bullet so it's not THE bottleneck).

51. flail ◴[05 Sep 25 16:07 UTC] No.45140154{4}[source]▶

>>45139283 #

I wouldn't discuss with that part, although there are definitely limits to how big a chunk of a big product a single brain can really grasp technically. And when the number of people involved in "grasping" grows, so does the coordination/communication tax. I digress, though.

We could go with that perception, however, only if we assume that whatever is in the backlog is actually the right thing to build. If we knew that every feature has value to the customers and (even better) they are sorted from the most valuable to the least valuable one.

In reality, many features have negative value, i.e., they hurt performance, customer satisfaction, any key metric a company employs.

The big question: can we check some of these before we actually develop a fully-fledged feature? The answer, very often, is positive. And if we follow up with an inquiry about how to validate such ideas without development, we will find a way more often than not.

Teresa Torres' Continuous Discovery Habits is an entire book about that :)

One of her recurring patterns is the Opportunity Solution Tree, which is a way of navigating across all the possible experiments to focus on the right ones (and ignore, i.e., not develop, all the rest).

52. ajuc ◴[05 Sep 25 16:07 UTC] No.45140155{3}[source]▶

>>45139053 #

It's like speed of light in different mediums. It's not that photons slow down. They just hit more stuff and spend more time getting absorbed and remitted.

Better developer wastes less time solving the wrong problem.

53. ◴[05 Sep 25 16:22 UTC] No.45140332[source]▶

>>45138802 #

54. flail ◴[05 Sep 25 16:22 UTC] No.45140333{3}[source]▶

>>45139619 #

Since I haven't mentioned the context in the article, it is a small agency with a customer target of early-stage (ideally earliest-stage) product startups.

We have literally one half-hour-long sync meeting a week. The rest is as lightweight as possible, typically averaging below 10 minutes daily with clients (when all the decisions happen on the fly).

I've worked in the corpo world, too, and it is anything but.

We do use vibe coding a lot in prototyping. Depending on the context, we sometimes have a lot of AI-agent-generated code, too.

What's more, because of working on multiple projects, we have a fairly decent pool of data points. And we don't see much of speed improvement from a perspective of a project (I wrote more on it here: https://brodzinski.com/2025/08/most-underestimated-factor-es...).

However, developers sure report their perception of being more productive. We do discuss how much these perceptions are grounded in reality, though. See this: https://metr.org/blog/2025-07-10-early-2025-ai-experienced-o... and this: https://substack.com/home/post/p-172538377

So, I don't think I'm biased toward bureaucratic environments, where developers code in MS Word rather than VS Code.

But these are all just one dimension of the discussion. The other is a simple question: are there ways of validating ideas before we turn them into implemented features/products?

The answer has always been a wholehearted "yes".

If development pace were all that counted, Googles and Amazons of this world would be beating the crap out of every aspiring startup in any niche the big tech cared about, even remotely. And that simply is not happening.

Incumbents are known to be losing ground, and old-school behemoths that still kick butts (such as IBM) do so because they continuously reinvent their businesses.

replies(3): >>45140496 #>>45140889 #>>45141668 #

55. thenanyu ◴[05 Sep 25 16:24 UTC] No.45140358{3}[source]▶

>>45140039 #

The whole point of lean startup was to route around the bottleneck of development velocity.

replies(1): >>45141414 #

56. flail ◴[05 Sep 25 16:25 UTC] No.45140379{3}[source]▶

>>45139926 #

If only "trying things" always equaled "developing things". There's a whole body of knowledge (under the Lean Startup umbrella) that argues otherwise.

Do we always have to build it before we know that it will work (or, in 9 cases out of 10, that it will not work)?

Even more so, do we have to build a fully-fledged version of it to know?

If yes, then I agree, development is the bottleneck.

replies(1): >>45140609 #

57. epolanski ◴[05 Sep 25 16:27 UTC] No.45140412[source]▶

>>45138802 #

I don't buy it.

Prototyping was never the issue.

The lessons you're talking about come from stressing applications and their design, which requires users to stress it.

replies(1): >>45140454 #

58. croes ◴[05 Sep 25 16:30 UTC] No.45140451{3}[source]▶

>>45139926 #

> trying different things until something works

That sounds like an awful way of software design. Trial and error isn’t engineering but explains the current state of software security.

replies(3): >>45140564 #>>45140617 #>>45141565 #

59. thenanyu ◴[05 Sep 25 16:31 UTC] No.45140454{3}[source]▶

>>45140412 #

So give it to users?

replies(1): >>45140513 #

60. thenanyu ◴[05 Sep 25 16:32 UTC] No.45140469{4}[source]▶

>>45139863 #

In most scenarios I can tell you if I like or dislike a feature much faster than it takes a developer to build it

replies(1): >>45140922 #

61. bob1029 ◴[05 Sep 25 16:33 UTC] No.45140483{4}[source]▶

>>45139688 #

In my local world: Writing code to specification is programming. Writing the specification is engineering.

62. thenanyu ◴[05 Sep 25 16:34 UTC] No.45140496{4}[source]▶

>>45140333 #

The map is not the territory. Validating against anything other than the actual feature is a lossy proxy. It may be an acceptable tradeoff because building the feature is too costly but that’s the whole discussion at hand.

replies(1): >>45141330 #

63. bob1029 ◴[05 Sep 25 16:35 UTC] No.45140513{4}[source]▶

>>45140454 #

There is often a severe opportunity cost associated with experimenting on your customer base.

replies(2): >>45143013 #>>45143497 #

64. seneca ◴[05 Sep 25 16:38 UTC] No.45140564{4}[source]▶

>>45140451 #

Sure, and that is more my own clunky paraphrasing than anything the article states. Iterating and testing to find a fit for customers is the business/product side of software. How you execute on those iterations is engineering.

replies(1): >>45140602 #

65. thenanyu ◴[05 Sep 25 16:41 UTC] No.45140598{3}[source]▶

>>45139814 #

Get it sooner then! By getting to market faster

replies(1): >>45141304 #

66. croes ◴[05 Sep 25 16:41 UTC] No.45140602{5}[source]▶

>>45140564 #

But the business/product side is the shallow side, customers rarely care about what happens behind the curtain. And most customer needs are pretty similar in the backend

67. thenanyu ◴[05 Sep 25 16:42 UTC] No.45140609{4}[source]▶

>>45140379 #

The lean startup offers a lot of lossy proxies for building and releasing things because it presupposes that building things takes a long time

replies(1): >>45141488 #

68. thenanyu ◴[05 Sep 25 16:43 UTC] No.45140617{4}[source]▶

>>45140451 #

Trying things and changing if it doesn’t work is the only way I know how to build software.

What would you do? Don’t change?

replies(1): >>45140663 #

69. dismalaf ◴[05 Sep 25 16:44 UTC] No.45140638[source]▶

>>45138156 (OP) #

Title feels like a bait and switch.

Development speed absolutely is a bottleneck. But coding speed? Like, typing? Yeah, I can definitely type faster than I can think about code, or anything really (typing at 100wpm is a fun party trick but not super useful in the end). Many times over... Even single finger typers who peck at the keyboard probably can, auto-complete has existed for a long time...

70. croes ◴[05 Sep 25 16:46 UTC] No.45140663{5}[source]▶

>>45140617 #

The question is, why doesn’t it work? Erroneous code, erroneous algorithm, missing feature in the underlying infrastructure?

The effort it takes to implement a feature makes is more likely you think twice before you start.

If the effort goes to zero, so does the thinking.

We will turn from programmers to just LLM customers sooner or later.

Because testing if it works can be done by none programmers

replies(1): >>45147191 #

71. flail ◴[05 Sep 25 16:47 UTC] No.45140673[source]▶

>>45139149 #

Whenever a development effort involves a lot of AI-generated code, the nature of the task shifts from typing-heavy to code-review-heavy.

Cognitively, these are very different tasks. With the former, we actively drive technical decisions (decide on architecture, implementation details, even naming). The latter offers all these decisions made, and we first need to untangle them all before we can scrutinize the details.

What's more, often AI-generated code results in bigger PRs, which again adds to the cognitive load.

And some developers fall into a rabbit hole of starting another thing while they wait for their agent to produce the code. Adding context switching to an already taxing challenge basically fries brains. There's no way such a code review to consistently catch the issues.

I see how development teams define health routines around working with generated code. Especially around limiting context switching. But also retaking tasks to be made by hand.

72. flail ◴[05 Sep 25 17:02 UTC] No.45140841[source]▶

>>45139880 #

We have this saying: Our clients always know what they want. Until they get it. Then they know they wanted something different.

And don't take that as a complaint. It's a basic behavioral observation. What we say we do is different from what we really do. By the same token, what we say we want is different from what we really want.

At a risk of being a bit sarcastic: we say we want regular exercise to keep fit, but we really want doomscrolling on a sofa with a beer in hand.

In the product development context, we have a very different attitude towards an imagined (hell, even wireframed) solution than an actual working piece of software. So it's kinda obvious we can't get it right on the first attempt.

We can be working toward the right direction, and many product teams don't even do that. For them, development speed is only a clock counting time remaining before VCs pull the plug.

73. scarface_74 ◴[05 Sep 25 17:06 UTC] No.45140889{4}[source]▶

>>45140333 #

BigTech is “beating startups”. 99% of all startups are just acquisition plays with no real business model.

Check out all of the bullshit “AI” companies that YC is funding.

BigTech is not “loosing ground” all of them are reporting increasing revenues and profits.

replies(1): >>45142126 #

74. k__ ◴[05 Sep 25 17:09 UTC] No.45140922{5}[source]▶

>>45140469 #

If it just came down to the "idea guy liking or disliking a feature" things would be quite easy...

replies(1): >>45140980 #

75. ACCount37 ◴[05 Sep 25 17:12 UTC] No.45140953{4}[source]▶

>>45139863 #

The threshold of supervision keeps rising - and it's going to keep rising.

GPT-2 was barely capable of writing two lines of code. GPT-3.5 could write a simple code snippet, and be right more often than it was wrong. GPT-4 was a leap over that, enabling things like "vibe coding" for small simple projects, and GPT-5 is yet another advancement in the same direction. Each AI upgrade brings forth more capabilities - with every upgrade, the AI can go further before it needs supervision.

I can totally see the amount of supervision an AI needs collapsing to zero within our lifetimes.

replies(2): >>45141183 #>>45145223 #

76. flail ◴[05 Sep 25 17:12 UTC] No.45140956[source]▶

>>45139898 #

Oh, I agree that in many companies, internally, we create perceptions that development is, indeed, THE bottleneck.

VP of Product put all the pressure on dev teams to deliver all the features against the specs. Then they release the new product/new version with plenty of fanfare.

And then literally no one measures which parts have actually delivered any value. I'd bet a big part of that code added no value, so it's a pure waste. Some other parts were actually harmful. They frustrated users, drove key metrics down, or have you. They are worse than waste.

But no one cared to check. Good product people, and there are scarcely few of them, would follow up with validation on what worked and what did not. They would argue against "major" releases whenever possible.

And seriously, if Amazon can avoid major releases, almost anyone could.

Suddenly, we might flip the script and have a VP of Product not asking "when will it be done?" but rather trying to figure out what the next most sensible experiments are.

replies(1): >>45144137 #

77. thenanyu ◴[05 Sep 25 17:15 UTC] No.45140980{6}[source]▶

>>45140922 #

why doesn't it? it doesn't have to be you or me personally, it could be a representative sample of our users

replies(1): >>45141955 #

78. kerblang ◴[05 Sep 25 17:18 UTC] No.45141011[source]▶

>>45138156 (OP) #

Journalists & executives see you as a sort of savant who spends all your time encoding & decoding the little bits and bytes - mostly a construction worker. They correctly recognize that we ought to be able to build some sort of robot that does this for you - why do it by hand?

They don't understand that this AI was built decades ago and has been improved on several times over: Compilers & Interpreters. Furthermore, you don't need billion-dollar neural-network supercomputers, just a vanilla laptop.

It's because of how you talk about the job, though. We automate every other kind of "coding" - why can't we automate yours?

79. flail ◴[05 Sep 25 17:21 UTC] No.45141048[source]▶

>>45139753 #

That's just another great vantage point to consider when looking at product development.

Accompanying many early-stage startups in their journey, I see how often the development (which we're responsible for) takes a back seat. Sometimes the pivotal role will be customer support, sometimes it will be business development, and often product management will drive the whole thing.

And there's one more follow-up thought to this observation. Products that achieved success, inevitably, get into a spiral of getting more features. That, in turn, makes them more clunky and less usable, and ultimately opens a way for new players who disrupt the niche.

At some point, adding more features in general makes things worse--too complicated, too overwhelming, making it harder to accomplish the core task. And yet, adding new stuff never ceases.

In the long run, the best tactic may actually be to go slower (and stop at some point), but focus on the meaningful changes.

80. uhura ◴[05 Sep 25 17:21 UTC] No.45141051[source]▶

>>45138156 (OP) #

This kind of stance cannot be made without properly setting the context for the software. It is very clear that different software backgrounds have different needs a different development strategies that are more efficient.

LLMs are a tool that added a new dimension to explore. While I haven't like many felt actual gains, others are finding, and time will allow us to better judge if those can lead to long term impacts in the economy.

Just based on what I've been reading and experiencing: - Short term POCs can reach validation stage faster. - Mature cloud software needs a lot of extra tooling (LLMs don't understand the codebase, lack of places to derive good context from, and so on). - Anything in between for cloud seems to be a hit or miss, where people are mostly trading first iteration time for more refactoring later down the line.

From another perspective, areas of software where things are a lot more about numbers (cpu time, memory consumption, and so on), may benefit a lot from faster development/coding as the validation phase is either shorter or can be executed in parallel.

The key reality here is that I've been observing higher expectations for deliveries without a proof that we actually got better at coding in general. Which means that sacrifices are being made somewhere.

replies(1): >>45141796 #

81. khazhoux ◴[05 Sep 25 17:25 UTC] No.45141106[source]▶

>>45138593 #

On the other hand, there’s tons of people here on HN who will claim that there’s zero connection between lines of code written and developer productivity. Obviously, deleting bad/unused code is good. And obviously, some tricky bugs are fixed in one line. But you can’t build something new without some (usually, very many) lines of code.

No code -> no software.

replies(1): >>45142409 #

82. chrisweekly ◴[05 Sep 25 17:26 UTC] No.45141131[source]▶

>>45138802 #

Yes - and tightening the OODA (Observe, Orient, Decide, Act) loop is essential for organizational velocity.

83. gyrovagueGeist ◴[05 Sep 25 17:31 UTC] No.45141183{5}[source]▶

>>45140953 #

In the middle term, I almost feel less productive using modern GPT-5/Claude Sonnet 4 for software dev than prior models, precisely because they are more hands off and less supervised.

Because they generate so much code, that often passes initial tests, looks reasonable, and fails in nonhuman ways, in a pretty opinionated style tbh.

I have less context (and need to spend much more effort and supervision time to get up to speed to learn) to fix, refactor, and integrate the solutions, than if I was only trusting short few line windows at a time.

replies(1): >>45141293 #

84. flail ◴[05 Sep 25 17:33 UTC] No.45141216{3}[source]▶

>>45138776 #

Oh, development sure does mean the whole package. Architectural design, automated tests, coding, refactoring, code review and post-review changes, deployment, manual tests, etc.

A question: what if all those activities are to build a feature that will harm user retention or a product no one wants?

A follow-up question: what if we could have known that up front, or there was a simple way to learn that?

Because so often we build stuff that shouldn't have been built in the first place (appalling startup success rate is probably a good enough statistical measure of that). And yes, there are ways to learn that we're building the wrong thing, other than building a fully-fledged version of it.

85. BinaryIgor ◴[05 Sep 25 17:34 UTC] No.45141228[source]▶

>>45138156 (OP) #

Interesting article, but a slightly misleading title.

Other than that the discovery process of what you should build is the hardest and the costliest part, the main conclusion from the article seems to be that if you outsource the first iterations to AI via vibe-coding, you will have much harder time changing and evolving it from there (iterating); to this, I agree

86. warkdarrior ◴[05 Sep 25 17:39 UTC] No.45141293{6}[source]▶

>>45141183 #

> I almost feel less productive using modern GPT-5/Claude Sonnet 4 for software dev than prior models, precisely because they are more hands off and less supervised.

That is because you are trained in the old way to writing code: manual crafting of software line by line, slowly, deliberately, thoughtfully. New generations of developers will not use the same workflow as you, just like you do not use the same workflow as folks who programmed punch cards.

replies(1): >>45141398 #

87. franktankbank ◴[05 Sep 25 17:40 UTC] No.45141304{4}[source]▶

>>45140598 #

Its one variable in the sum of all the times. You are asserting without much evidence that the bottleneck is the dev turnaround time. I think for a lot of people there's evidence that its dev is about 10% or less of the back and forth. I've sat on my hands for months while requirements have got sorted and no this wasn't something I could just jump into which I'm sure you'd (wrongly) suggest is the right approach. Have you ever been involved in a profitable project?

replies(1): >>45142055 #

88. flail ◴[05 Sep 25 17:42 UTC] No.45141330{5}[source]▶

>>45140496 #

Sure. And yet, last time I checked, we've had plenty of applications for maps.

I like this metaphor. Looking at a map, we may get a pretty good understanding of whether it's a place we'd like to spend time, say, on vacation.

We don't physically go to a place to scrutinize it.

And we don't limit ourselves to maps only. We check reviews, ask friends, and what have you. We do cheap validation before committing to a costly decision.

If we planned vacations the way we build software products, we'd just go there (because the map is not the territory), learn that the place sucks, and then we'd complain that finding good vacation spots is costly and time-consuming. Oh, and we'd mention that traveling is a bottleneck in finding good spots.

replies(1): >>45142044 #

89. whstl ◴[05 Sep 25 17:43 UTC] No.45141345{3}[source]▶

>>45139841 #

IME, in most cases, it's the dickhead's fault in the first place.

This is often a CTO putting pressure on a dev manager when the bottleneck is ops, or product, or putting pressure on product when the bottleneck is dev.

The normal rationalization is that "you should be putting pressure on them".

The actual reason is that they are putting pressure on you as a show of force, rather than actually wanted it to go faster.

This is why the only response to a bad manager is to run away.

90. _se ◴[05 Sep 25 17:47 UTC] No.45141398{7}[source]▶

>>45141293 #

No, it's because reading code is slower than writing it.

The only way these tools can possibly be faster for non-trivial work is if you don't give a shit enough about the output to not even read it. And if you can do that and still achieve your goal, chances are your goal wasn't that difficult to begin with.

That's why we're now consistently measuring individuals to be slower using these tools even though many of them feel faster.

replies(2): >>45142008 #>>45156360 #

91. flail ◴[05 Sep 25 17:48 UTC] No.45141414{4}[source]▶

>>45140358 #

I heard that before. No, Lean Startup is not about working around the cost of software development.

It is about designing good experiments, validating, and learning, so that when we're down to development, we build something that's way more likely to succeed.

The fact that we were advised to build non-technical experiments is but a small part. And with the current AI capabilities, we actually have a new power tool for prototyping that falls neatly into the whole puzzle.

Here's a bit more elaborate argument (sorry for a LinkedIn link): https://www.linkedin.com/posts/pawelbrodzinski_weve-already-...

replies(1): >>45143580 #

92. flail ◴[05 Sep 25 17:54 UTC] No.45141488{5}[source]▶

>>45140609 #

I would actually challenge you to read/reread Lean Startup with the following filter:

Disregard parts that explicitly assume that they are relevant only because, in 2013, development was expensive. There are very few parts that you would throw out.

93. flail ◴[05 Sep 25 18:01 UTC] No.45141565{4}[source]▶

>>45140451 #

In no part was that suggestion addressed to software design/architecture.

It is telling that, while the article's theme is product management (and its relationship with the pace of development), that context is largely ignored in some comments. It's as if the article's scope was purely what happens within the IDE and/or AI agent of choice.

The whole point is that the perspective necessarily should be broader. Otherwise, we make it a circular argument, really: development is a bottleneck of development.

Well, hard to disagree on that.

94. com2kid ◴[05 Sep 25 18:09 UTC] No.45141668{4}[source]▶

>>45140333 #

I needed to make a landing page for an ad campaign to test out an idea for PMF.

Claude crapped out a workable landing page in ~30 seconds of prompting. I updated the copy on the page, total time less than an hour.

The odds of me spending more than an hour just picking a color theme for the page or finding the SVG icons it used is pretty much 100%.

------------

I had a bug in some async code, it hit rarely but often enough it was noticeable. I had narrowed down what file it was in, but after over an hour of staring at the code I wasn't finding it.

Popped into cursor, asked it to look for async bugs in the current file. "You forgot to clean up a resource on this line here."

Bug fixed.

------------

"Here is my nginx config, what is wrong with the block I just added for this new site I'm throwing up?"

------------

"Write a regex to do nnnnnn"

------------

"This page isn't working on mobile, something is wrong, can you investigate and tell me what the issues may be?"

Oh that won't go well, all of the models get super confused about CSS at some point and end up in doom spirals applying incorrect fixes again and again.

> Googles and Amazons of this world would be beating the crap out of every aspiring startup in any niche the big tech cared about, even remotely. And that simply is not happening.

This is already a well explored and understood space, to the extent that big tech cos have at times spun teams off to work independently to gain the advantage of startup-like velocities.

The more infra you have, the more overhead you have. Deploying a company's first service to production is really easy, no infra needed, no dev-ops, just publish.

Deploying the 5th service, eh.

Deploying the 50th service, well by now you need to have a host of meetings before work even starts to make sure you aren't duplicating effort and that the libraries you use mesh with the department's strategic technical vision. By the time those meeting are done, a startup will have already put 3 things into prod.

The communication overhead within large orgs is also famously non-linear.

I spent 10 years working at Microsoft, then 3 years at HBO Max (lean tech company 200 engineers, amazing dev ops), and now I'm working at startups of various sizes.

At Microsoft, pre-Azure it could take weeks just to get a machine provisioned to test an idea out on. Actually getting a project up and running in a repo was... hard at times. Build systems were complex, tooling was complex, and you sure as hell weren't getting anything pushed to users without a lot of checks in place. Now many of those checks were in place for damn good reasons, wrongly drawn lines on a map inside Windows is a literal international incident[1], and we had separate localizations for different variants of English around the world. (And I'd argue that Microsoft's agility at deploying software around the entire world at the same time is unmatched, the people I worked with there were amazing at sorting through the cultural and legal problems!)

Also if Google launches a new service and it goes down from too much traffic, it is embarrassing. Everything they do has to be scalable and load balanced, just to avoid bad press. If a startup hits the front page of HN and their website goes down from being too popular, they get to write a follow up blog post about how their announcement was so damn popular their site crashed! (And if they are lucky, hit the front page of HN again!)

The differences in designing for levels of scale is huge.

At Microsoft it was "expect potentially a billion users" At HBO it was "Expect tens of millions of users", at many startups it is "If we hit 10k users we'll turn a profit and we can figure out how to scale out later."

10K DAU is a load balances and 3 instances of NodeJS (for rolling updates) each running on a potato of a CPU.

> So, I don't think I'm biased toward bureaucratic environments, where developers code in MS Word rather than VS Code.

I've worked in those environments, and the level of engineering quality can be much higher. The number of bugs that can be hammered out and avoided in spec reviews is huge. Technology designs that end up being servicable for years to decades instead of "until the next rewrite". The actual code tends to flow much faster as well, or at least as fast as it can flow in the large sprawling code bases that exist at big tech companies. At other times, those specs are needed so that one has a path forward while working through messy legacy code bases.

Both styles have their place - Sometimes you need to iterate quickly and get lots of code down and see what works, other times it is worth thinking through edge cases, usage scenarios, and performance characteristics. Heck I've done memory bus calculations for different designs, when you are working at that level you don't just "write code and see what works", you first spend a few days (or a week!) with some other smart engineers and try to narrow down the potential field of you should even be trying to do!

[1]https://www.upi.com/Archives/1995/09/09/Microsoft-settles-In...

replies(1): >>45142258 #

95. flail ◴[05 Sep 25 18:18 UTC] No.45141796[source]▶

>>45141051 #

My experience correlates with this assessment. The closer we are toward prototyping, the bigger leverage we gain from quickly generated swaths of code. It's simply because we don't need to care about all the quality guardrails. After all, it's a prototype.

With a more complex code base (and a less popular tech stack), the perceived gains quickly diminish. Beyond a certain level of tech debt, AI-generated code is utterly useless. It's no surprise that we see people who vibe-coded their products with no technical knowledge whatsoever, and now they call professional engineers to untangle the mess.

A software agency I know well responded to the rise of AI somewhere between the lines of "Now, we'll have plenty of work to clean all that mess!" Admittedly, they always specialized in complex/rescue engineering gigs.

However, the "development as a bottleneck" discussion was set here in a broader context. It's not only how efficiently we are able to deliver bits of functionality, but primarily whether we should be building these things in the first place.

Equally for early-stage startups and established products alike, so much of features are built because someone said so. At the end of the day, they don't deliver any value (if we're lucky) or are plain harmful (if we're out of luck).

In such cases, it would have been better if developers actually sipped coffee and read Hacker News rather than coded/developed/engineered stuff.

96. flail ◴[05 Sep 25 18:27 UTC] No.45141904[source]▶

>>45140148 #

You mention one fabulous application of AI-supported coding that the article didn't touch upon. It's anything where the target customer group is me. All sorts of automation, pet projects, and serious stuff that improves research, too.

The context of the article is product development, with a bias toward the commercial part of the ecosystem. And of course, as any picture painted with broad strokes, some generalizations were inevitable.

As a scientist, you definitely are familiar with the weight (or lack thereof) of anecdotal evidence. Unless the claim is "it can never work" or "it always works," my individual experience is just that--an individual experience.

97. cestith ◴[05 Sep 25 18:30 UTC] No.45141955{7}[source]▶

>>45140980 #

So if you wait to put together a representative sample of users and gather the data long enough for the numbers to matter, you’ve gated further changes. If you’ve gated further changes for a week, why does it matter that the feature change was done in an hour or a day?

replies(1): >>45142084 #

98. mwigdahl ◴[05 Sep 25 18:34 UTC] No.45142008{8}[source]▶

>>45141398 #

"Consistently"? Is there more than just the one METR study that's saying this?

replies(1): >>45148718 #

99. cestith ◴[05 Sep 25 18:35 UTC] No.45142018{5}[source]▶

>>45139589 #

I want a merge request with a short, meaningful comment and the diffs just like I’d get from a human. Then I want to be able to discuss the changes if they aren’t exactly what’s needed, just like with a human. I don’t want to have to hold its hand and I don’t want to have to pair program everything with a chatbot. It also needs to be able to show a logic diagram, a data flow diagram, and a dependency tree. If an agent can’t give me that, it’s not really ready to work as a developer for me.

100. thenanyu ◴[05 Sep 25 18:37 UTC] No.45142044{6}[source]▶

>>45141330 #

The best way to know if you would like a new restaurant or experience is to actually try it. We rely on reviews and maps and directories because trying it is too costly. If trying it wasn't costly, we would just try it instead of relying on proxies.

replies(1): >>45147287 #

101. DenisM ◴[05 Sep 25 18:38 UTC] No.45142053{5}[source]▶

>>45139589 #

LLM might rely on their own verbosity to carry the conversation in a stable direction.

102. thenanyu ◴[05 Sep 25 18:38 UTC] No.45142055{5}[source]▶

>>45141304 #

The only reason requirements need to be sorted out is because development effort is perceived to be expensive. If you reduce the development effort significantly, then you can just build it instead of talking about building it.

replies(1): >>45142364 #

103. thenanyu ◴[05 Sep 25 18:40 UTC] No.45142084{8}[source]▶

>>45141955 #

Releasing it to users does not take a long time. Randomly select 5% of your user base and give them the feature. If your development process was mature, this would be a button you could push in your deployment env.

104. flail ◴[05 Sep 25 18:44 UTC] No.45142126{5}[source]▶

>>45140889 #

Of course, Big Techs have leverage of their bottomless coffers. What they can't develop, they buy. What was the last successful product idea coming from, say, Facebook?

Or on a smaller scale, what's the last genuine Attlassian success?

Yet, when it comes to product innovation, the momentum is always on the side of the new players. Always has been.

Project management/work organization software? Linear. Async communication? Slack. Social Media? TikTok. One has to be curious how Zoom is doing so well, given that all the big competition actually controls the channels for setting up meetings. Self-publishing? Substack. Even with AI, everyone plays catch-up with Sam Altman, and many of the most prominent companies are newcomers.

We could go on and on.

Yes, Big Techs will survive because they have enough momentum to survive events such as the Balmer-era MS. But that doesn't mean they lead product innovation.

And it's expected. Conflicting priorities, growing bureaucracies, shareholders' expectations, old business lines (and more), all make them less flexible.

replies(1): >>45142383 #

105. IshKebab ◴[05 Sep 25 18:46 UTC] No.45142144[source]▶

>>45138156 (OP) #

This is like he's picked a clickbait position, ignored how very obviously wrong it is, and argued for it as best he can.

Nice try, but it's still obviously wrong.

106. flail ◴[05 Sep 25 18:55 UTC] No.45142258{5}[source]▶

>>45141668 #

Big Techs do have ways of rolling out new services step by step.

Paul Buchheit's stories about Gmail and AdSense are good examples. I was an early Gmail user when it was invitation-only and invitations were scarcely distributed (only as fast as the infrastructure could handle).

So, while I understand the difference in PR costs, it's not like they don't have tools to run smaller experiments.

I agree with the huge bureaucracy cost. On the other hand, they really have (relatively) infinite resources if they care to deploy them. And sometimes they do. And they still fail.

They often fail even when they try a Skunk Works-like approach. Google Wave was famously developed as a corporate Lean Startup (before there was Lean Startup). It was a disaster. Precisely because they did close to zero validation pre-release.

A side note, a huge flop it was (although Buzz and Google+ were bigger), it didn't hurt them long term in PR or reputation.

replies(1): >>45142841 #

107. franktankbank ◴[05 Sep 25 19:05 UTC] No.45142364{6}[source]▶

>>45142055 #

Sounds like you need a trillion monkeys on typewriters. Easy!

108. scarface_74 ◴[05 Sep 25 19:07 UTC] No.45142383{6}[source]▶

>>45142126 #

Again let’s look at YC’s latest batch of companies. How many of them are doing anything “innovative”?

An innovative product is one where customers in aggregate are willing to pay more for it than it costs to create and run. Any idiot can sell a bunch of dollar bills for 95 cents.

Going back to the latest batch of YC companies, there value play can easily be duplicated by any company in their vertical either by throwing a few engineers on it or creating a statement of work for the consulting company I work for and I can pull together a few engineers and knock it out in a few months and they will already have customers to sell it to.

There was one recent YC company (of course one of the BS AI companies) that was a hiring a “founding full stack engineer” for $150K. It looks like they were two non technical “serial entrepreneurs” without even an MVP that YC threw money at.

You can’t imagine how many times some hair brain underfunded startup reached out to me to be a “CTO” that paid less than I made as a mid level employee at BigTech with the promise of Monopoly money “equity”.

replies(2): >>45142996 #>>45147886 #

109. goalieca ◴[05 Sep 25 19:09 UTC] No.45142409{3}[source]▶

>>45141106 #

What would be the function mapping lines of code to "value" look like. Most agile teams aim to deliver "value" these days. We can't put a number of value. We most certainly can't say on average that adding a single line of code adds 0.01 units of value for a certain project.

replies(1): >>45144359 #

110. m0llusk ◴[05 Sep 25 19:14 UTC] No.45142465[source]▶

>>45138156 (OP) #

Many here are restricting the domain to engineering which makes the real bottlenecks disappear. A good reminder of the larger context is Ralph Grabowski's Marketing to Engineering ratio. Companies that spend less on marketing that engineering tend not to endure. Companies that do endure tend to spend over double and closer to ten times as much on marketing as engineering. So the real bottlenecks in development are not centered on engineering, but the coordination of engineering and marketing in order to solve problems that matter to customers in ways that customers can be aware of and assign value to.

Go ahead and code as much as you want. Unless you can communicate the utility of that code to a paying customer it has no value or relevance.

111. com2kid ◴[05 Sep 25 19:48 UTC] No.45142841{6}[source]▶

>>45142258 #

Google had 3,000 employees when Gmail launched. Now they have over 100,000 employees!

People criticize Microsoft's historical fiefdom model, and it had its issues, but it also allowed orgs to find what worked for them and basically run independently. Of course it also had orgs fighting with each other and killing off good products.

Xbox was also a skunk works project at Microsoft (a few good books have been written about it!) and so was Microsoft Band. Xbox succeeded, Band failed for a number of reasons not related to the product or execution itself. (Politics and some historical corporate karma).

IMHO the only company good at deploying infinite resources quickly is Apple. 1 billion developing the first Apple Watch (Microsoft spend under 50 million on two generations of Band!) and then they kept going after the market, even though the first version was kinda meh. In comparison Google wear was on again of again for years until they finally took it seriously recently. I'm sure they spent lots of $, but the end result is nowheres near what Apple pulled off.

replies(1): >>45147998 #

112. thenanyu ◴[05 Sep 25 20:05 UTC] No.45142996{7}[source]▶

>>45142383 #

VCs generally expect some small single digit % of their companies to succeed and return the fund

If 90% of the companies fail or are outright fraudulent it doesn’t really matter

replies(1): >>45143296 #

113. thenanyu ◴[05 Sep 25 20:06 UTC] No.45143013{5}[source]▶

>>45140513 #

Do it responsibly then?

114. daliusd ◴[05 Sep 25 20:28 UTC] No.45143248{5}[source]▶

>>45139589 #

So you want Aider, Claude Code or opencode.ai it seems. I use opencode.ai a lot nowadays and am really happy and productive.

replies(2): >>45145124 #>>45162442 #

115. scarface_74 ◴[05 Sep 25 20:35 UTC] No.45143296{8}[source]▶

>>45142996 #

And how many of those “succeed” by creating good products compared to just being acquihires where after acquisition you soon see a blog post about “our amazing journey”?

replies(1): >>45143323 #

116. thenanyu ◴[05 Sep 25 20:36 UTC] No.45143323{9}[source]▶

>>45143296 #

Single digit percentage, like I said. Often low single digits

117. estimator7292 ◴[05 Sep 25 20:53 UTC] No.45143465[source]▶

>>45138156 (OP) #

Developers would gain a lot more speed than what LLMs offer if Microsoft would just fucking stop using >50% of the CPU watching your compiler for... something?

Nobody wants to believe it, but just try compiling C++ on Windows and again in a Linux VM. Linux in a VM on the same host compiles at least twice as fast. It's insanity. I tried a script that rsync's the project files to my server from 2013, runs the build and rsync's the artifacts back. Running the build on a Xeon 2500 is still far faster with Linux than windows on my two year old i9. Even with the overhead of sending binaries over the internet. Absolutely disgusting.

118. estimator7292 ◴[05 Sep 25 20:56 UTC] No.45143497{5}[source]▶

>>45140513 #

We've been doing this for fifty years, please catch up with the times.

119. estimator7292 ◴[05 Sep 25 21:03 UTC] No.45143580{5}[source]▶

>>45141414 #

Distinction without a difference. Your SWE costs blow up because development velocity is low and labor is a fixed cost. You reduce costs by increasing velocity, which in this case is achieved by aiming your development better.

Move faster and move better (to move faster) are the same thing. You reduce costs by going faster, and with lean you go faster by avoiding time wasters.

120. IanCal ◴[05 Sep 25 21:40 UTC] No.45143958{4}[source]▶

>>45139863 #

That doesn’t require developer time though.

Also that time is needed regardless, do you think it’s the majority of time related to releasing a feature?

121. wglb ◴[05 Sep 25 21:57 UTC] No.45144137{3}[source]▶

>>45140956 #

Disagree. My experience is that it is a measurable fact and not a created perception.

And few of us can usefully compare what we do with what amazon google Facebook or other giants do.

Good luck on flipping their script. Meanwhile I’ll be over here making book

122. khazhoux ◴[05 Sep 25 22:24 UTC] No.45144359{4}[source]▶

>>45142409 #

> We most certainly can't say on average that adding a single line of code adds 0.01 units of value for

Certainly there’s no simple F(num_lines_changed) value function. There are many other parameters. But to suggest, as many here somehow do, that lines of code touched is independent to effective development, is plain ludicrous.

123. wyum ◴[05 Sep 25 22:25 UTC] No.45144376[source]▶

>>45138802 #

I think you and the article actually agree and you are arguing only with their use of the word "development."

The article uses "development" to refer only to the part where code is generated, while you are saying "development" is the process as a whole.

You both agree that latency in the real-world validation feedback loop leads to longer cycles and fewer promising solutions and that is the bottleneck.

124. speed_spread ◴[05 Sep 25 22:26 UTC] No.45144385[source]▶

>>45139683 #

Car companies cannot figure what to develop and how to develop it. They could develop 1000% faster and would still get it wrong. Mostly because they see software as a cheap alternative to metal and rubber and developers as degenerate primadonnas - i.e. not true engineers.

Sonos decided they wanted to centralize their architecture so they could tap into it to make extra surveillance money. They trashed things that worked perfectly and replaced them with a cloudshit architecture that nobody asked for and that _cannot_ deliver the same low-latency, quality experience as before. They could have developed things 1000% faster, they would have just drove a cliff sooner.

Even if people could write apps instantly, nothing would prevent them for being stupid and greedy.

125. tharkun__ ◴[05 Sep 25 23:53 UTC] No.45145091{4}[source]▶

>>45139232 #

I can't. The LLM (Claude Code really) is just too slow. It is just so slow at doing the things I ask it to do once I'm at the review stage.

Like the initial plan always sounds great and looks great. Then it goes to actually do the changes and proclaims victory after I left it alone doing other stuff, because it takes a while. Then I review what it did and what it didn't do and I inevitably find that it only did half of what it said it would do and did half of what it did do incorrectly despite what it told me what it would do.

The use case here is a large code base that needs changes. Not new feature development on a green field (or a green corner of an established product). And it's just so unbearably frustrating. It's like giving the task to a Junior on probation. I tell them something, they go off for 10 minutes and tell me they're done and I look and find seven holes I need to tell them to fix. But they aren't the Junior that picks up stuff and gets better and needs less supervision. Instead it seems like the context gets more and more polluted and the Junior gets closer and closer to failing his probation.

Many grey hairs added recently, because yeah, we also "have to be faster by using AI" now ...

126. tharkun__ ◴[05 Sep 25 23:57 UTC] No.45145124{6}[source]▶

>>45143248 #

I really wanted to use Aider. But it's impossible. How do people actually use it?

Like, I gave it access to our code base, wanted to try a very simple bug fix. I only told it to look at one service I knew needed changes, because it says it works better in smaller code bases. It wanted to send so many tokens to sonnet that I hit the limits before it even started actually doing any coding.

Instant fail.

Then I just ran Claude Code, gave it the same instructions and I had a mostly working fix in a few minutes (never mind the other fails with Claude I've had - see other comment), but Aider was a huge disappointment for me.

replies(1): >>45152381 #

127. daxfohl ◴[06 Sep 25 00:12 UTC] No.45145223{5}[source]▶

>>45140953 #

I could see it happening in a year or two. Especially in backend. There's only so many different architecture patterns we use, and an LLM will have access to every one that has ever been deployed, every document, every gripe, every research paper, etc.

I mean, I think ultimately the state space in designing a feature is way smaller than, say, go (the game). Maybe a few hundred common patterns and maybe a billion reasonable ways to combine them. I think it's only a matter of time before we ask it to design a feature, and it produces five options that are all better than what we'd have come up with.

128. himeexcelanta ◴[06 Sep 25 00:37 UTC] No.45145405{4}[source]▶

>>45139283 #

Typing syntax and dealing with language issues takes a lot of mental overhead that AI mostly solves in the right hands. It’s not zero!

129. jiggawatts ◴[06 Sep 25 06:20 UTC] No.45147059[source]▶

>>45138802 #

> Because just developing the variations would be faster than all of the debate. So the amount of time you waste in meetings and deliberation goes down as well.

Thank you for articulating something I knew but haven't been able to express as eloquently.

It frustrates me to no end to watch half a dozen non-technical bureaucrats argue for days about something that can be tried (and discarded) in a few hours with zero consequences.

"Let's write a position paper so that everyone involved can agree before we do anything."

Noooo! Just do it! See if it works in practice! Validate the marketing! Kick the tyres! Go for a test drive. Just. Get. Behind. The. Wheel.

130. chii ◴[06 Sep 25 06:50 UTC] No.45147191{6}[source]▶

>>45140663 #

> Because testing if it works can be done by none programmers

like testing whether a building is structurally sound can just be done by the inhabitants!

131. didibus ◴[06 Sep 25 07:13 UTC] No.45147277{3}[source]▶

>>45139619 #

> so the anti-vibe-coding people feel like they need to claim that development speed has no impact at all

Strange, I'd been more of the impression that this is an argument from pro vibe-coders. As more data comes in, the "productivity increases" of AI are not showing up as expected. So as people question, how come things are not getting done faster even though you say you are 10x faster at coding? The vibe-coders answer by saying that coding isn't the bottleneck, as opposed to capitulating and saying that maybe they're not that much faster at coding after-all.

132. flail ◴[06 Sep 25 07:17 UTC] No.45147287{7}[source]▶

>>45142044 #

OK, let's assume you can get food for free (or close enough). Like if you were super rich, and the cost was absolutely marginal for you.

How many dinners a day can you have?

You would still rely on alternative proxies, like recommendations or reviews.

133. didibus ◴[06 Sep 25 07:23 UTC] No.45147313{3}[source]▶

>>45139060 #

> and they're allergic to automation, hiring higher quality engineers, and hiring more engineers to improve velocities

I think there's another issue, but it could relate to your first two statement here. Even to try ideas, to explore the space of solutions, you need to have ideas to try. When entering development, you need clarity on what you're trying. It's very hard to make decisions on even a single attempt. I see engineers working task the entire time simply not sure what really the task is about.

And in a way, the coding agents need even more clarity in what you ask of them to deliver good result.

So even inside of what we consider "development" or "coding", the bottleneck is often: "what am I supposed to do here?" and not so much "I don't know how to do this" or "I have so much to implement".

This is obvious as well, once you throw more engineers, and you can't break up the work, because you have no clue what so many people could even all do. Knowing what all the needed tasks even are is hard and a big bottleneck.

134. didibus ◴[06 Sep 25 07:29 UTC] No.45147339[source]▶

>>45138581 #

Can't agree more.

> If you give them a task and spell it out, they can knock out code for it at a really good pace and wow upper management.

This is so true. I sometimes spend entire days, weeks, all I do is provide those type of engineers the clarity to "unblock" them. Yet I always wonder, if I had just spent that time coding myself, I might have gotten more done.

But it's also this that I think bottlenecks development. The set of people who really know what needs to be done, at the level of detail that these developer will need to be told, or that coding agents will need to be told, is very small, and that's your bottleneck, you have like 1 or 2 devs on all project that knows what to do, and everyone else need a Standard Operating Procedure handed to them for every task. And now everyone is always just waiting on the same 2 devs to tell them what to do.

135. patrulek ◴[06 Sep 25 07:39 UTC] No.45147370[source]▶

>>45140148 #

Im not a scientist, just a freelancer (kind of) and its similar for me. I know probably only 2 langauges fluently enough that i could write in them without constantly checking for syntax/docs, but over my whole programming journey i got to know multiple languages with different syntaxes, coding styles and patterns, so i can easily read code written in multiple languages and know whether given code will do what i want (and even if i wouldn't, LLM can explain the code for me and in the worst case send me to the documentation to validate it by myself).

Asking LLM for a code and then read/review it is a huge speed up for me in a lot of cases comparing to when i would need to write the same thing by myself (but i agree it may not work well in a big/complex systems... yet).

136. simpleshadow ◴[06 Sep 25 07:55 UTC] No.45147450[source]▶

>>45138156 (OP) #

that hn can have an article stating "development speed is not a bottleneck" is indicative of the fact that development speed is a bottleneck

137. flail ◴[06 Sep 25 09:37 UTC] No.45147886{7}[source]▶

>>45142383 #

The last YC batch was like ~170 companies, correct? Each year, there are like 150 million startups. So let's not take YC stable for the whole startup ecosystem.

And I'm with you with a critical view on their all-in move toward AI. It's just what all the VCs do, and it's hard to say who's parroting who in this setup (I think that others are parroting YC, but feel free to challenge me on that).

Having said all that, I wouldn't be surprised if a couple of companies from this year's cohort made it big. If you look at YC's biggest successes year by year, you will often (but not always) find a household name.

Was there anyone who predicted these would be the greatest hits? Of course not! That's the whole point of having an investment portfolio. You can be wrong a lot of times if you secure an early investment in a unicorn every other year or so.

Also, "one recent example" of poor investment decision doesn't invalidate 2 decades of rather successful investment portfolios (as a whole, not individually).

In no way is it a YC defense. I'm very critical of the whole startup funding ecosystem, and they are a prominent player. Yet, if they were consistently stupid with their decisions, they wouldn't exist, let alone be the most desired accelerator out there.

Also, if it's that simple to copy what they do and what the companies in their portfolio do, why wouldn't Google et al. take their almost infinite funds and get the competing offers for non-BS ideas up and running in no time?

I bet that if you had an idea that could pay off thousandfold, you'd get enough eager ears to hear you out in any big tech. And still, it's the makeshift mass of startups that come through with new products.

One has to wonder why things like Shopify, Stripe, Zapier, or Figma did not come from the big tech. Each would have an ideal match. Even if you look at the AI landscape, how come Lovable made such a career? After all, they repackage the AI capabilities rented elsewhere. Somehow, with all the ingenuity of building ChatGPT, OpenAI and the rest didn't get it.

replies(1): >>45152579 #

138. flail ◴[06 Sep 25 10:04 UTC] No.45147998{7}[source]▶

>>45142841 #

Sure, it was way easier to move at Google in the early 2000s than it is now. Yet, one has to admit they still keep trying. The list of products that they tried and killed doesn't show signs of stagnation: https://killedbygoogle.com/

And that's only the things that they have released. I'd bet that there are lots more that never make it to the public.

And I expect no less from Microsoft, by the way. Microsoft is, in fact, a great case in point of how failed releases don't hurt the company's PR long-term. How many failures have they scored trying to catch up with the missed opportunities of the 2000s? Smartphones & tablets, search, music players, social media.

They were late to move the Office to the cloud, and kept pumping dollars into the Explorer/Edge lost cause, too.

I don't know enough details, but Xbox seems more like an outlier than a norm.

Yet they rebounded with Azure and made some good bets with AI, and are doing better than ever. However, we don't see a stream of new product bets coming from them.

Oh, and on Apple: I wouldn't discount the role of cult-like following in repeated product success. Neither of the other big techs has such a relationship with its user base. You don't see many raving fans of Facebook or Google. And you definitely have millions of people who would buy any new Apple product simply because it is a new Apple product.

It's like Joel Spolsky but on a global scale. In the 2000s, whatever Joel Spolsky touched turned into gold. Stack Overflow? Check. Trello? Check. Was there something unique about these products? Details, sure. But the biggest thing was Joel's leverage.

Having run a highly popular blog for developers, he could instantly reach out to his early adopters. Given that many of the readers were actual fans, they'd jump on the opportunity, whatever it was. So the early traction was not a problem (which was especially crucial for the developers' forum).

Scale that up to the big tech context, and you get Steve Jobs.

A side note: I wonder how long it will take Tim Cook to dismantle that. You can already see cracks.

139. _se ◴[06 Sep 25 12:31 UTC] No.45148718{9}[source]▶

>>45142008 #

I have measured it myself within my organization, and I know many peers across companies who have done the same. No, I cannot share the data (I wish I could, truly), but I expect that we will begin to see many of these types of studies emerge before long.

The tools are absolutely useful, but they need to be applied in the right places and they are decided not a silver bullet or general-purpose software engineering tool in the manner that they're being billed at present. We still use them despite our finding, but we use them judiciously and where they actually help.

140. daliusd ◴[06 Sep 25 19:59 UTC] No.45152381{7}[source]▶

>>45145124 #

I don't know about Aider, I am not using it because of lack of MCP and poor GitHub Copilot support (both are important to me). Maybe in the future that will get better if that will be relevant. I am using opencode.ai with Claude Sonnet 4 usually. Sometimes I try to switch to different models, e.g. Gemini 2.5 Pro, but Sonnet is more consistent for me.

It would be good to define what's "smaller code bases". Here is what I am working one: 10 years old project full of legacy consisting of about 10 services and 10 front-end projects. As well tried it on project similar to MUI or Mantine UI. Naturally on many smaller projects. As well tried it on TypeScript codebase where it has failed for me (but it is hard to judge from one attempt). Lastly I am using it on smaller projects. Overall question is more about task than about code base size. If the task does not involve loading too much context when code base size might be irrelevant.

141. scarface_74 ◴[06 Sep 25 20:27 UTC] No.45152579{8}[source]▶

>>45147886 #

Let’s look at YCs “successes” as far as the companies that have gone public.

https://medium.com/@kazeemibrahim18/the-post-ipo-performance...

142. j2kun ◴[07 Sep 25 02:13 UTC] No.45154763[source]▶

>>45138802 #

> The way to find the right answer isn't to think very hard and miraculously come up with the perfect answer. It's to try every goddamn thing that shows promise.

I have found that spending more time thinking generally reduces the amount of failed attempts. It's amazing what "thinking hard" beforehand can do to eliminate reprioritization scrambling.

143. KronisLV ◴[07 Sep 25 08:11 UTC] No.45156360{8}[source]▶

>>45141398 #

> No, it's because reading code is slower than writing it.

This feels wrong to me, unless we qualify the statement with: "...if you want the exact same level of understanding of it."

Otherwise, the bottleneck in development would be pull/merge request review, not writing the code to do something. But almost always, it's the other way around - someone works on a feature for 3-5 days, the pull/merge request does not really spend the same time in active review. I don't think you need the exact same level of intricate understanding over some code when reviewing it.

It's quite similar with the AI stuff, I often nitpick and want to rework certain bits of code that AI generates (or fix obvious issues with it), but using it for the first version/draft is still easier than trying to approach the issue from zero. Ofc AI won't make you consistently better, but will remove some of the friction and reduce the cognitive load.

144. angarg12 ◴[07 Sep 25 16:36 UTC] No.45159707[source]▶

>>45138156 (OP) #

The argument that the article uses to critique AI coding is that coding is not the bottleneck, instead the testing and validation of ideas is.

This is a first-degree smart argument. It presents a seemingly non-obvious idea that makes sense in retrospect.

However I happen to work at the experimentation team of a hyperscaler so I have a different perspective.

First, we aren't always saturating all of the potential experiments we could be running. The reasons are different, but essentially it takes time and effort to build those experimental features. If that cost trended to 0, we could make sure to have a queue of experiments deep enough.

Also in our side we need to do development work to support new features and products. We have a backlog long enough to keep us perpetually busy. If dev cost trended to 0, we could always be ready to provide our customers what they need.

Speaking of new products, each one our company comes up with comes with extra effort to support in our side, and yet more effort to produce dozens of AB test to validate new functionality.

This is not talking about ongoing maintenance effort. Bugfixes, upgrades etc. take a non-trivial amount of effort to keep up.

And this is only inside our little experimentation team. What about security, reliability, scalability, efficiency... it makes me wonder if OP has experience running products at scale.

Instead I'd like to think that dropping the cost of development by orders of magnitude changes the equation of how we create products.

145. no_wizard ◴[07 Sep 25 21:43 UTC] No.45162442{6}[source]▶

>>45143248 #

At the end of the day I want what my job is willing to pay for, which is a few different flavors of AI tools

↑