Most active commenters

adriand(5)
KallDrexx(4)
mrwrong(3)
beepbooptheory(3)
nasmorn(3)

Popular/hot comments

>>46205325 #
>>46204842 #
>>46205066 #
>>46204766 #
>>46205352 #
>>46205781 #
>>46205890 #
>>46206608 #
>>46207584 #

←back to thread

Has the cost of building software dropped 90%?

(martinalderson.com)

Show context

tangotaylor ◴[09 Dec 25 12:42 UTC] No.46204312[source]▶

>>46196228 (OP) #

> Engineers need to really lean in to the change in my opinion.

I tried leaning in. I really tried. I'm not a web developer or game developer (more robotics, embedded systems). I tried vibe coding web apps and games. They were pretty boring. I got frustrated that I couldn't change little things. I remember getting frustrated that my game character kept getting stuck on imaginary walls and kept asking Cursor to fix it and it just made more and more of a mess. I remember making a simple front-end + backend with a database app to analyze thousands of pull request comments and it got massively slow and I didn't know why. Cursor wasn't very helpful in fixing it. I felt dumber after the whole process.

The next time I made a web app I just taught myself Flask and some basic JS and I found myself moving way more quickly. Not in the initial development, but later on when I had to tweak things.

The AI helped me a ton with looking things up: documentation, error messages, etc. It's essentially a supercharged Google search and Stack Overflow replacement, but I did not find it useful letting it take the wheel.

replies(9): >>46204550 #>>46205027 #>>46206045 #>>46206421 #>>46206931 #>>46210894 #>>46211263 #>>46211291 #>>46216142 #

1. r_lee ◴[09 Dec 25 13:08 UTC] No.46204550[source]▶

>>46204312 #

These posts like the one OP made is why I'm losing my mind.

Like, is there truly an agentic way to go 10x or is there some catch? At this point while I'm not thrilled about the idea of just "vibe coding" all the time, I'm fine with facing reality.

But I keep having the same experience as you, or rather leaning more on that supercharged Google/SO replacement

or just a "can you quickly make this boring func here that does xyz" "also add this" or for bash scripts etc.

And that's only when I've done most of the plumbing myself.

replies(19): >>46204630 #>>46204766 #>>46204828 #>>46204843 #>>46204925 #>>46205328 #>>46205478 #>>46205659 #>>46205781 #>>46205890 #>>46205913 #>>46205924 #>>46205931 #>>46206330 #>>46207518 #>>46209875 #>>46214153 #>>46214479 #>>46214591 #

2. adam_patarino ◴[09 Dec 25 13:18 UTC] No.46204630[source]▶

>>46204550 (TP) #

I keep using this excel analogy. When excel came out everyone said accountants were dead. Now we have more accountants than ever.

The analogy carries to what you’re saying here. Accountants or folks who know excel deeply can get a lot more from it than folks who are novice to it.

AI coding can be really helpful for an engineer. Keep at it!

3. adriand ◴[09 Dec 25 13:34 UTC] No.46204766[source]▶

>>46204550 (TP) #

> Like, is there truly an agentic way to go 10x or is there some catch?

Yes. I think it’s practice. I know this sounds ridiculous, but I feel like I have reached a kind of mind meld state with my AI tooling, specifically Claude Code. I am not really consciously aware of having learned anything related to these processes, but I have been all in on this since ChatGPT, and I honestly think my brain has been rewired in a way that I don’t truly perceive except in terms of the rate of software production.

There was a period of several months a while ago where I felt exhausted all the time. I was getting a lot done, but there was something about the experience that was incredibly draining. Now I am past that and I have gone to this new plateau of ridiculous productivity, and a kind of addictive joy in the work. A marvellous pleasure at the orchestration of complex tasks and seeing the results play out. It’s pure magic.

Yes, I know this sounds ridiculous and over-the-top. But I haven’t had this much fun writing software since my 20s.

replies(3): >>46204830 #>>46204842 #>>46207458 #

4. lionkor ◴[09 Dec 25 13:41 UTC] No.46204828[source]▶

>>46204550 (TP) #

I have a feeling that people who are genuinely impressed by long term vibe coding on a single project are only impressed because they don't know any better.

Take writing a book, or blog post; writing a good blog post, or a chapter of a book, takes lots of skill and practice. The results are very satisfying and usually add value to both the writer's life as well as the reader's. When someone who has done that uses AI and sees the slop it generates, he's not impressed, probably even frustrated.

However, someone who can barely write a couple coherent sentences, would be baffled at how well AIs can put together sentences, paragraphs, and have a somewhat coherent train of thought through the entire text. People who struggled in school with writing an introduction and a conclusion will be amazed at AIs writing. They would maybe even assume that "those paragraphs actually add no meaning and are purely fluff" is a totally normal part of writing and not an AI artifact.

replies(1): >>46206458 #

5. r_lee ◴[09 Dec 25 13:41 UTC] No.46204830[source]▶

>>46204766 #

That's really interesting.

May I ask what kinds of projects, stack and any kind of markdown magic you use?

And any specific workflow? And are there times when you have to step in manually?

replies(2): >>46204929 #>>46204969 #

6. mrwrong ◴[09 Dec 25 13:43 UTC] No.46204842[source]▶

>>46204766 #

> Yes, I know this sounds ridiculous and over-the-top.

in that case you should come with more data. tell us how you measured your productivity improvement. all you've said here is that it makes you feel good

replies(4): >>46205066 #>>46205352 #>>46206166 #>>46208843 #

7. bonzini ◴[09 Dec 25 13:43 UTC] No.46204843[source]▶

>>46204550 (TP) #

I did find some benefit in lowering the cost of exploratory work, but that's it—certainly worth 20€/month, but not the price of any of the "ultimate" plans.

For example today I had to write a simple state machine (for a parser that I was rewriting so I had all the testcases already). I asked Claude Code to write the state machine for me and stopped it before it tried compiling and testing.

Some of the code (of course including all the boilerplate) worked, some made no sense. It saved a few minutes and overall the code it produced was a decent first approximation, but waiting for it to "reason" through the fixes would have made no sense, at least to me. The time savings mostly came from avoiding the initial "type the boilerplate and make it compile" part.

When completing the refactoring there were a few other steps like where using AI was useful. But overall the LLM did maybe 10% of the work and saved optimistically 20-30 minutes over a morning.

Assuming I have similar savings once a week, which is again very optimistic... That's a 2% reduction or less.

8. itgoon ◴[09 Dec 25 13:52 UTC] No.46204925[source]▶

>>46204550 (TP) #

Like another reply says: practice.

How many hours have you spent writing code? Thousands? Tens of thousands? Were you able to achieve good results in the first hundred hours?

Now, compare it to how much time you've spent working with agents. Did you dedicate considerable time to figuring out how to use them? Do you stop using the agent and do things manually when it isn't going right, or do you spend time figuring out how to get the agent to do it?

replies(2): >>46204957 #>>46204996 #

9. adriand ◴[09 Dec 25 13:53 UTC] No.46204929{3}[source]▶

>>46204830 #

Currently three main projects. Two are Rails back-ends and React front-ends, so they are all Ruby, Typescript, Tailwind, etc. The third is more recent, it's an audio plugin built using the JUCE framework, it is all C++. This is the one that has been blowing my mind the most because I am an expert web developer, but the last time I wrote a line of C++ was 20 years ago, and I have zero DSP or math skills. What blows my mind is that it works great, it's thread safe and performant.

In terms of workflow, I have a bunch of custom commands for tasks that I do frequently (e.g. "perform code review"), but I'm very much in the loop all the time. The whole "agent can code for hours at a time" thing is not something I personally believe. It depends on the task how involved I get, however. Sometimes I'm happy to just let it do work and then review afterwards. Other times, I will watch it code and interrupt it if I am unhappy with the direction. So yes, I am constantly stepping in manually. This is what I meant about "mind meld". The agent is not doing the work, I am not doing the work, WE are doing the work.

replies(2): >>46205135 #>>46206605 #

10. ilidur ◴[09 Dec 25 13:56 UTC] No.46204957[source]▶

>>46204925 #

And then all the heuristics you've learnt change under you and you're stuck doing 100-1000 more hours of learning with a drop in quality during that time.

11. itgoon ◴[09 Dec 25 13:57 UTC] No.46204969{3}[source]▶

>>46204830 #

Research -> Plan -> Implement

Start by having the agent ask you questions until it has enough information to create a plan.

Use the agent to create the plan.

Follow the plan.

When I started, I had to look at the code pretty frequently. Rather than fix it myself, I spent time thinking about what I could change in my prompts or workflow.

12. crowbahr ◴[09 Dec 25 14:00 UTC] No.46204996[source]▶

>>46204925 #

You can't really compare those 2. Agents a re non-deterministic. I can tell Clod to go update my unit test coverage and it will choke itself, burn 200k tokens and then loudly proclaim "Great! I've updated unit test coverage".

I'll kill that terminal, open it again and run the exact same command. 30k tokens, actually adds new tests.

It's hard to "learn" when the feedback cycle can take 30 minutes and result in the agent sitting in the corner touching itself and crooning about what a good boy it is. It's hard to _want_ to learn when you can't trust the damn thing with the same prompt twice.

replies(1): >>46205011 #

13. funkydata ◴[09 Dec 25 14:02 UTC] No.46205011{3}[source]▶

>>46204996 #

That comment made my day. I actually had an intern like that!

14. adriand ◴[09 Dec 25 14:08 UTC] No.46205066{3}[source]▶

>>46204842 #

Work that would have taken me 1-2 weeks to complete, I can now get done in 2-3 hours. That's not an exaggeration. I have another friend who is as all-in on this as me and he works in a company (I work for myself, as a solo contractor for clients), and he told me that he moved on to Q1 2026 projects because he'd completed all the work slated for 2025, weeks ahead of schedule. Meanwhile his colleagues are still wading through scrum meetings.

I realize that this all sounds kind of religious: you don't know what you're missing until you actually accept Jesus's love, or something along those lines. But you do have to kinda just go all-in to have this experience. I don't know what else to say about it.

replies(4): >>46205140 #>>46205843 #>>46206372 #>>46213628 #

15. efields ◴[09 Dec 25 14:16 UTC] No.46205135{4}[source]▶

>>46204929 #

I maintain a few rails apps and Claude Code has written 95% of the code for the last 4 months. I deploy regularly.

I make my own PRs then have Copilot review them. Sometimes it finds criticisms, and I copy and paste that chunk of critique into Claude Code, and it fixes it.

Treat the LLMs like junior devs that can lookup answers supernaturally fast. You still need to be mindful of their work. Doubtful even. Test, test, test.

replies(1): >>46206548 #

16. beepbooptheory ◴[09 Dec 25 14:17 UTC] No.46205140{4}[source]▶

>>46205066 #

My sympathies go out to the friend's coworkers. They are probably wading through a bunch of stuff right now, but given the context you have given us, its probably not "scrum meetings"..

I don't even care about the llm, I just want the confidence you have to assess that any given thing will take N weeks. You say 1-2 weeks.. thats like a big range! Something that "would" take 1 week takes ~2 hours, something that "would" take 2 weeks also takes ~2 hours. How does that even make sense? I wonder how long something that would of taken three weeks would take?

Do you still charge your clients the same?

replies(1): >>46205325 #

17. adriand ◴[09 Dec 25 14:35 UTC] No.46205325{5}[source]▶

>>46205140 #

> They are probably wading through a bunch of stuff right now, but given the context you have given us, its probably not "scrum meetings"..

This made me laugh. Fair enough. ;)

In terms of the time estimations: if your point is that I don't have hard data to back up my assertions, you're absolutely correct. I was always terrible at estimating how long something would take. I'm still terrible at it. But I agree with the OP. I think the labour required is down 90%.

It does feel to me that we're getting into religious believer territory. There are those who have firsthand experience and are all-in (the believers), there are those who have firsthand experience and don't get it (the faithless), and there are those who haven't tried it (the atheists). It's hard to communicate across those divides, and each group's view of the others is essentially, "I don't understand you".

replies(5): >>46205910 #>>46206281 #>>46206742 #>>46206867 #>>46207540 #

18. groby_b ◴[09 Dec 25 14:36 UTC] No.46205328[source]▶

>>46204550 (TP) #

You can go faster once you understand the domain reasonably well that you could have written it yourself. This allows you to write better designs, and steer LLMs in the right direction.

"Vibe coding" though is moving an ever growing pile of nonunderstanding and complexity in front of you, until you get stuck. (But it does work until you've amassed a big enough pile, so it's good for smaller tasks - and then suddenly extremely frustrating once you reach that threshold)

Can you go 10x? Depends. I haven't tried any really large project yet, but I can compress fairly large things that would've taken a week or two pre-LLM into a single lazy Sunday.

For larger projects, it's definitely useful for some tasks. ("Ingest the last 10k commits, tell me which ones are most likely to have broken this particular feature") - the trick is finding tasks where the win from the right answer is large, and the loss from the wrong one is small. It's more like running algorithmic trading on a decent edge than it is like coding :)

It definitely struggles to do successfully do fully agentic work on very large code bases. But... I've also not tried too much in that space yet, so take that with a grain of salt.

replies(1): >>46207612 #

19. klank4 ◴[09 Dec 25 14:38 UTC] No.46205352{3}[source]▶

>>46204842 #

What's worked best with Gemini such I made a DSL that transpiles to C with CUDA support to train small models in about 3 hours... (all programs must run against an image data set, must only generate embeddings)

Do not; vibe code from top down (ex. Make me a UI with React, with these buttons and these behaviors to each button)

Do not; chat casually with it. (ex. I think it would look better if the button was green)

Do; constrain phrasing to the next data transform goal (ex. You must add a function to change all words that start with lowercase to start with uppercase)

Do; vibe code bottom up (ex. You must generate a file with a function to open a plaintext file and appropriate tests; now you must add a function to count all words that begin with "f")

Do; stick to must/should/may (ex. You must extend the code with this next function)

Do; constrain it to mathematical abstractions (ex. sys prompt: You must not use loops, you must only use recursion and functional paradigms. You must not make up abstractions and stick to mathematical objects and known algorithms)

Do; constrain it to one file per type and function. This makes it quick to review, regenerate only what needs to change.

Using those patterns, Gemini 2.5 and 3 have cranked out banging code with little wandering off in the weeds and hallucinating.

Programming has been mired in made up semantics of the individual coder for the luls, to create mystique and obfuscate the truth to ensure job security; end of the day it's matrix math and state sync between memory and display.

replies(3): >>46209003 #>>46217690 #>>46218926 #

20. JeremyNT ◴[09 Dec 25 14:48 UTC] No.46205478[source]▶

>>46204550 (TP) #

> Like, is there truly an agentic way to go 10x or is there some catch? At this point while I'm not thrilled about the idea of just "vibe coding" all the time, I'm fine with facing reality.

Below is based on my experience using (currently) mostly GPT-5 with open source code assistants.

For a new project with straightforward functionality? I think you (and "you" being "basically anybody who can code at all") can probably manage to go 10x the pace of a junior engineer of yesteryear.

Things get a lot trickier when you have complex business logic to express and backwards compatibility to maintain in an existing codebase. Writing out these kinds of requirements in natural language is its own skillset (which can be developed), and this process takes time in and of itself.

The more confusing the requirements, the more error prone the process becomes though. The model can do things "correctly" but oops maybe you forgot something in your description, and now the whole thing will be wrong. And the fact that you didn't write the code means that you missed out on your opportunity to fix / think about stuff in the first pass of implementation (i.e. you need to seriously review stuff, which also slow you down).

Sometimes iterating over English instructions will take longer than just writing/expressing things in code from the start. But sometimes it will be a lot faster too.

Basically the easy stuff is way easier but the more complex stuff is still going to require a lot of hand holding and a lot of manual review.

21. coffeefirst ◴[09 Dec 25 15:02 UTC] No.46205659[source]▶

>>46204550 (TP) #

That's my finding as well. The smaller the chunk, the better, and it saves me 5m here and an hour there. These really add up.

This is cool. It's extra cool on annoying things like "fix my types" or "find the syntax error" or "give me the flags for ffmpeg to do exactly this."

If I ever meet someone who drank the koolaid and wants to show me their process, I'm happy to see it. But I've tried enough to believe my own eyes, and when I see open source contributors I respect demo their methods, they spend enough time and energy either waiting on the machine and trying to keep it on the rails that, yes this is harder, but it does not appear to be faster.

22. KallDrexx ◴[09 Dec 25 15:14 UTC] No.46205781[source]▶

>>46204550 (TP) #

EVERY DX survey that comes out (surveying over 20k developers) says the exact same thing.

Staff engineers get the most time savings out of AI tools, and their weekly time savings is 4.4 hours for heavy AI users. That's a little more than 10% productivity, so not anywhere close to 10x.

What's more telling about the survey results is they are also consistent in their findings between heavy and light users of AI. Staff engineers who are heavy users of AI save 4.4 hours a week while staff engineers who are light users of AI save 3.3 hours a week. To put another way, the DX survey is pretty clear that the time savings between heavy and light AI users is minimal.

Yes surveys are all flawed in different ways but an N of 20k is nothing to sneeze at. Any study with data points shows that code generation is not a significant time savings and zero studies show significant time savings. All the productivity gains DX reports come from debugging and investigation/code base spelunking help.

replies(3): >>46206608 #>>46207179 #>>46207511 #

23. mrwrong ◴[09 Dec 25 15:20 UTC] No.46205843{4}[source]▶

>>46205066 #

this is just not a very interesting way to talk about technology. I'm glad it feels like a religious experience to you, I don't care about that. I care about reality

replies(1): >>46206396 #

24. digitalsushi ◴[09 Dec 25 15:24 UTC] No.46205890[source]▶

>>46204550 (TP) #

does anyone remember that episode of star trek tng where the kid is given a little laser engraver that carves a dolphin from a block of wood? and the kid is like "i didn't make this" and the teacher (who abducted him, ew) is like "yeah but it's what you wanted to make, the tool just guided you"

so in 2026 we're going to get in trouble doing code "the old way", the pleasurable way, the way an artist connects with the work. we're not to chefs any longer, we're a plumber now that pours food from a faucet.

we're annoyed because our output can suddenly be measured by the time unit. the jig is up. our secret clubhouse has a lightbulb the landlord controls.

some of us were already doing good work, saving money, making the right decisions. we'll be fine.

some of us don't know how to do those things - or won't do those things - and our options are funneled down. we're trashing at this, like dogs being led to the pound.

there's before, there's during, and there's after; the during is a thing we so seldom experience, and we're in it, and 2024 felt like nothing, 2025 feels like the struggle, and 2026 will be the reconciliation.

change sucks. but it's how we continue. we continue differently or we dont exist.

replies(3): >>46206408 #>>46210545 #>>46211507 #

25. WesleyJohnson ◴[09 Dec 25 15:25 UTC] No.46205910{6}[source]▶

>>46205325 #

Not to pick apart your analogy, but asserting that atheists haven't tried religion is misinformed.

replies(1): >>46206554 #

26. jjice ◴[09 Dec 25 15:25 UTC] No.46205913[source]▶

>>46204550 (TP) #

> or just a "can you quickly make this boring func here that does xyz" "also add this" or for bash scripts etc.

I still write most of the interesting code myself, but when it comes to boring, tedious work (that's usually fairly repetitive, but can't be well abstracted any more), that's when I've found gen AI to be a huge win.

It's not 10x, because a lot of the time, I'm still writing code normally. For very specific, boring things (that also are usually my least favorite parts of code to write), it's fantastic and it really is a 10x. If you amortize that 10x over all the time, it's more like a 1.5x to 3x in my experience, but it saves my sanity.

Things like implementing very boring CRUD endpoints that have enough custom logic that I can't use a good abstraction and writing the associated tests.

I would dread doing work like that because it was just so mind numbing. Now, I've written a bunch of Cursor rules (that was actually pretty fun) so I can now drop in a Linear ticket description and have it get somewhere around 95% done all at once.

Now, if I'm writing something that is interesting, I probably want to work on it myself purely because it's fun, but also because the LLM may suck at it (although they're getting pretty damn good).

replies(1): >>46229606 #

27. vbezhenar ◴[09 Dec 25 15:26 UTC] No.46205924[source]▶

>>46204550 (TP) #

I tried claude code to write very simple app for me. Basically Golang mock server which will dump request to console. I'd write this kind of app in an hour. I spent around 1.5 hours with claude code and in the end I had code which I liked, almost the same code I'd write myself. It's not vibe coding, I carefully instructed it to write code in a way I prefer, one small step after another.

So for me, it's pretty obvious that with better training, I'd be able to achieve speed ups with the same result in the end. Not 10x, but 2x is possible. The very first attempt to use AI ended up with almost the same time I'd write the same code, and I have a lot to improve.

That said, I have huge problem with this approach. It's not fun to work like that. I started to program 25 years ago, because it was fun for me. It still fun for me today. I love writing all these loops and ifs. I can accept minimal automation like static autocomplete, but that's about it.

28. csomar ◴[09 Dec 25 15:26 UTC] No.46205931[source]▶

>>46204550 (TP) #

I have been building a game (preview here: https://qpingpong.codeinput.com) as a practice to "vibe-coding". There is only one rule: I am not allowed to write a single line of code. But can prompt as much as I want.

So far I am hitting a "hard-block" on getting the AI to make changes once you have a large code base. One "unblocker" was to restructure all the elements as their own components. This makes it easier for the LLM (and you?) to reason about each component (React) in isolation.

Still, even as this "small/simple game" stage, it is not only hard for the LLM to get any change done but very easy for it to break things. The only way I can see my around it, is to structure very through tests (including E2E tests) so that any change by the LLM has to be thoroughly tested for regression.

I've been working on this for a month or so. I could have coded it faster by hand except for the design part.

replies(1): >>46211894 #

29. stocksinsmocks ◴[09 Dec 25 15:44 UTC] No.46206166{3}[source]▶

>>46204842 #

Nobody had a robust, empirical metric of programmer productivity. Nobody. Ticket count, function points, LoC, and others tell you nothing about the fitness of the product. It’s all feels.

replies(1): >>46206333 #

30. beepbooptheory ◴[09 Dec 25 15:51 UTC] No.46206281{6}[source]▶

>>46205325 #

But then does this not give you pause, that it "feels religious"? Is there not some morsel of critical/rational interrogation on this? Aren't you worried about becoming perhaps too fundamentalist in your belief?

To extend the analogy: why charge clients for your labor anymore, which Claude can supposedly do in a fraction of the time? Why not just ask if they have heard the good word, so to speak?

31. Lich ◴[09 Dec 25 15:55 UTC] No.46206330[source]▶

>>46204550 (TP) #

Feeling the same. I’m guessing the folks getting good results are literally writing extremely detailed pseudocode by hand?! Like:

Write a class Person who has members (int) age, (string) first name, (string) last name…

But if you can write that detailed…don’t you know the code you want to write and how you should write it? Writing plain pseudo code feels more verbose.

replies(1): >>46207855 #

32. mrwrong ◴[09 Dec 25 15:55 UTC] No.46206333{4}[source]▶

>>46206166 #

ok, but there's a spectrum between fully reproducible empirical evidence and divine revelation. I'm not convinced it's impossible to measure productivity in a meaningful way, even if it isn't perfect. it at least seems better to try than... whatever this is

33. no_wizard ◴[09 Dec 25 15:57 UTC] No.46206372{4}[source]▶

>>46205066 #

If your work maps exceedingly well to the technology it is true, it goes much faster. Doubly so when you have enough experience and understanding of things to find its errors or suboptimal approaches and adjust it that much faster.

The second you get to a place where the mapping isn’t there though, it goes off rails quickly.

Not everyone programs in such a way that they may ever experience this but I have, as a Staff engineer at a large firm, run into this again and again.

It’s great for greenfield projects that follow CRUD patterns though.

34. coderatlarge ◴[09 Dec 25 15:58 UTC] No.46206396{5}[source]▶

>>46205843 #

it seems to me if these things were real and repeatable there would be published traces that show the exact interactions that led to a specific output and the cost in time and money to get there.

do such things exist?

35. ◴[09 Dec 25 15:59 UTC] No.46206408[source]▶

>>46205890 #

36. stocksinsmocks ◴[09 Dec 25 16:02 UTC] No.46206458[source]▶

>>46204828 #

I’m impressed by getting the output of at least a mediocre developer at less than 1% of the cost. Brute force is an underrated strategy. I’ve been having a great experience.

That developers in the Hacker News comment bin report experiences that align with their personal financial interests doesn’t really dissuade me.

37. lomase ◴[09 Dec 25 16:08 UTC] No.46206548{5}[source]▶

>>46205135 #

Can we see any of this software created by this amazing LLMs?

38. timeon ◴[09 Dec 25 16:08 UTC] No.46206554{7}[source]▶

>>46205910 #

Brain-rot can be associated with heavy LLM usage.

39. timeon ◴[09 Dec 25 16:11 UTC] No.46206605{4}[source]▶

>>46204929 #

Why do you need to use Tailwind if the code is generated? Can't there be something more efficient?

replies(1): >>46206943 #

40. y0eswddl ◴[09 Dec 25 16:11 UTC] No.46206608[source]▶

>>46205781 #

do you have a link to the latest survey? my google-fu is failing me at the moment

replies(3): >>46206835 #>>46207132 #>>46207512 #

41. jakebasile ◴[09 Dec 25 16:20 UTC] No.46206742{6}[source]▶

>>46205325 #

So, you say that AI has made you "ridiculously faster", but then admit you've always been terrible at estimating how long something would take?

42. jatora ◴[09 Dec 25 16:25 UTC] No.46206835{3}[source]▶

>>46206608 #

your google-fu isnt failing. there's simply only a couple large studies on this, and of those, zero that have a useful methodology.

replies(1): >>46213472 #

43. bonesss ◴[09 Dec 25 16:27 UTC] No.46206867{6}[source]▶

>>46205325 #

Religions are about faith, faith is belief in the absence of evidence. Engineering output is tangible and measurable, objectively verifiable and readily quantifiable (both locally and in terms of profits). Full evidence, testable assertions, no faith required.

Here we have claims of objective results, but also admissions we’re not even tracking estimations and are terrible at making them when we do. People are notoriously bad at estimating actual time spent versus output, particularly when dealing with unwanted work. We’re missing the fundamental criteria of assessment, and there are known biases unaccounted for.

Output in LOC has never been the issue, copy and paste handles that just fine. TCO and holistic velocity after a few years is a separate matter. Masterful orchestration of agents could include estimation and tracking tasks with minimal overhead. That’s not what we’re seeing though…

Someone who has even a 20% better method for deck construction is gonna show me some timetables, some billed projects, and a very fancy new car. If accepting Mothra as my lord and saviour is a prerequisite to pierce an otherwise impenetrable veil of ontological obfuscation in order to see the unseeable? That deck might not be as cheap as it sounds, one way or the other.

I’m getting a nice learning and productivity bump from LLMs, there are incredible capabilities available. But premature optimization is still premature, and claims of silver bullets are yet to be demonstrated.

replies(2): >>46207584 #>>46207920 #

44. Deegy ◴[09 Dec 25 16:32 UTC] No.46206943{5}[source]▶

>>46206605 #

Extensive tailwind training data in the models. Sure there's something more efficient but it's just safer to let the model leverage what it was trained on.

replies(1): >>46208432 #

45. KallDrexx ◴[09 Dec 25 16:45 UTC] No.46207132{3}[source]▶

>>46206608 #

Unfortunately I do not.

Past survey results are hidden in some presentations I've seen, and the latest survey I have full access due to my company paying for it. So I'm not sure it's legal for me to reproduce

46. valbaca ◴[09 Dec 25 16:48 UTC] No.46207179[source]▶

>>46205781 #

> That's a little more than 10% productivity, so not anywhere close to 10x.

No Silver Bullet still alive and well in 2026

47. newsoftheday ◴[09 Dec 25 17:08 UTC] No.46207458[source]▶

>>46204766 #

> Yes, I know this sounds ridiculous and over-the-top. But I haven’t had this much fun writing software since my 20s.

But...you're not writing it. The culmination of many sites, many people, Stack Overflow, etc. all wrote it through the filtering mechanism being called AI.

You didn't write a damn thing.

replies(1): >>46214135 #

48. jonfw ◴[09 Dec 25 17:11 UTC] No.46207511[source]▶

>>46205781 #

For those surveys to mean anything (for or against AI), we'd have to have an effective measure of developer productivity

replies(1): >>46207857 #

49. KallDrexx ◴[09 Dec 25 17:11 UTC] No.46207512{3}[source]▶

>>46206608 #

The latest survey is here apparently: https://getdx.com/report/ai-assisted-engineering-q4-impact-r....

No idea if signing up gives the full survey for free. We get it through our company paying for DX's services

50. GoatInGrey ◴[09 Dec 25 17:12 UTC] No.46207518[source]▶

>>46204550 (TP) #

It's been my experience that reaching for an LLM is a significant context switch that breaks flow state. Comparable to a monkey entering your office and banging cymbals together for a minute, returning to programming after writing up instructions for an LLM requires a refocusing process to reestablish the immersion you just forfeited. This can be a worthwhile trade with particularly tedious or annoying tasks, but not always.

I suspect that this explains the current bifurcation of LLM usage. Where individuals either use LLMs for everything or use them minimally. With the in-between space shrinking by the day.

51. newsoftheday ◴[09 Dec 25 17:13 UTC] No.46207540{6}[source]▶

>>46205325 #

> It does feel to me that we're getting into religious believer territory. There are those who have firsthand experience and are all-in (the believers), there are those who have firsthand experience and don't get it (the faithless), and there are those who haven't tried it (the atheists). It's hard to communicate across those divides, and each group's view of the others is essentially, "I don't understand you".

What a total crock. Your prose reminds of of the ridiculously funny Mike Meyers in "The Love Guru".

52. adriand ◴[09 Dec 25 17:17 UTC] No.46207584{7}[source]▶

>>46206867 #

Here's an example from this morning. At 10:00 am, a colleague created a ticket with an idea for the music plugin I'm working on: wouldn't it be cool if we could use nod detection (head tracking) to trigger recording? That way, musicians who use our app wouldn't need a foot switch (as a musician, you often have your hands occupied).

Yes, that would be cool. An hour later, I shipped a release build with that feature fully functional, including permissions plus a calibration UI that shows if your face is detected and lets you adjust sensitivity, and visually displays when a nod is detected. Most of that work got done while I was in the shower. That is the second feature in this app that got built today.

This morning I also created and deployed a bug fix release for analytics on one platform, and a brand-new report (fairly easy to put together because it followed the pattern of other reports) for a different platform.

I also worked out, argued with random people on HN and walked to work. Not bad for five hours! Do I know how long it would have taken to, for example, integrate face detection and tracking into a C++ audio plugin without assistance from AI? Especially given that I have never done that before? No, I do not. I am bad at estimating. Would it have been longer than 30 minutes? I mean...probably?

replies(3): >>46208255 #>>46210903 #>>46220079 #

53. rbalicki ◴[09 Dec 25 17:19 UTC] No.46207612[source]▶

>>46205328 #

A git bisect where you can't easily check whether the feature is broken (because the testing step isn't easily described as code) would be killer

54. skapadia ◴[09 Dec 25 17:35 UTC] No.46207855[source]▶

>>46206330 #

But the AI coding agent can then ask you follow up questions, consider angles you may not have, and generate other artifacts like documentation, data generation and migration scripts, tests, CRUD APIs, all in context. If you can reliably do all that from plain pseudo code, that's way less verbose than having to write out every different representation of the same underlying concept, by hand.

Sure, some of that, like CRUD APIs, you can generate via templates as well. Heck, you can even have the coding agent generate the templates and the code that will process/compile them, or generate the code that generates the templates given a set of parameters.

55. ponector ◴[09 Dec 25 17:36 UTC] No.46207857{3}[source]▶

>>46207511 #

In my experience the productivity measured in created merge requests increased massively.

More merge requests because now the same senior developers are creating more bugs, 4x comparing to 2025. Same developers, same codebase but now with Cursor!

replies(1): >>46208799 #

56. fauigerzigerk ◴[09 Dec 25 17:41 UTC] No.46207920{7}[source]▶

>>46206867 #

I think you have to make a distinction between indvidual experience and claims about general truths.

If I know someone as an honest and serious professional, and they tell me that some tool has made them 5x or 10x more productive, then I'm willing to believe that the tool really did make a big difference for them and their specific work. I would be far more sceptical if they told me that a tool has made them 10% more productive.

I might have some questions about how much technical debt was accumulated in the process and how much learning did not happen that might be needed down the road. How much of that productivity gain was borrowed from the future?

But I wouldn't dismiss the immediate claims out of hand. I think this experience is relevant as a starting point for the science that's needed to make more general claims.

Also, let's not forget that almost none of the choices we make as software engineers are based on solid empirical science. I have looked at quite a few studies about productivity and defect rates in software engineering projects. The methodology is almost always dodgy and the conclusions seem anything but robust to me.

57. beepbooptheory ◴[09 Dec 25 18:04 UTC] No.46208255{8}[source]▶

>>46207584 #

Just having a 'count-in' type feature for recording would be much much more useful. Head nodding is something I do all the time anyway as a musician :).

I don't know what your user makeup is like, but shipping a CV feature same day sounds so potentially disastrous.. There are so many things I would think you would at least want to test, or even just consider with the kind of user emapthy we all should practice.

58. camdenreslink ◴[09 Dec 25 18:18 UTC] No.46208432{6}[source]▶

>>46206943 #

Surely there is an order of magnitude more training data on plain CSS than tailwind, right?

replies(1): >>46210528 #

59. Izkata ◴[09 Dec 25 18:44 UTC] No.46208799{4}[source]▶

>>46207857 #

Lines of code changed would be a high one as well, though that's already well-known as a bad measure.

60. nasmorn ◴[09 Dec 25 18:47 UTC] No.46208843{3}[source]▶

>>46204842 #

Just as an aside I also think I am way more productive now but a really convincing datapoint would be someone who does project work and now has 5x the hourly rate they had last year. If there are not plenty of people like this, it cannot be 10x

replies(1): >>46209815 #

61. pdimitar ◴[09 Dec 25 18:57 UTC] No.46209003{4}[source]▶

>>46205352 #

Awesome comment, thank you. No idea why it was flagged as dead. Vouched for it to not be.

62. thfuran ◴[09 Dec 25 19:57 UTC] No.46209815{4}[source]▶

>>46208843 #

That's not a very convincing argument. Even if you can do 10x the work, that doesn't necessarily mean you can easily find customers ready to pay 5x the hourly rate.

replies(2): >>46215756 #>>46220199 #

63. estimator7292 ◴[09 Dec 25 20:02 UTC] No.46209875[source]▶

>>46204550 (TP) #

It seems to very heavily depend on your exact project and how well it's represented in the training set.

For instance, AI is great at react native bullshit that I can't be bothered with. It absolutely cannot handle embedded development. Particularly if you're not using Arduino framework on an Atmel 328. I'm presently doing bare metal AVR on a new chip and none of the AI agents have a single clue what they're doing. Even when fed with the datasheet and an entire codebase of manually written code for this thing, AI just produces hot wet garbage.

If you're on the 1% happy path AI is great. If you diverge even slightly from the top 10 most common languages and frameworks it's basically useless.

The weird thing is if you go in reverse it works great. I can feed bits of AVR assembly in and the AI can parse it perfectly. Not sure how that works, I suspect it's a fundamentally different type of transformation that these models are really good at

64. frikk ◴[09 Dec 25 20:55 UTC] No.46210528{7}[source]▶

>>46208432 #

In my experience the LLMs work better with frameworks that have more rigid guidance. Something like Tailwind has a body of examples that work together, language to reason about the behavior needed, higher levels of abstraction (potentially), etc. This seems to be helpful.

The LLMs can certainly use raw CSS and it works well, the challenge is when you need consistent framing across many pages with mounting special cases, and the LLMs may make extrapolate small inconsistencies further. If you stick within a rigid framework, the inconsistencies should be less across a larger project (in theory, at least).

65. frikk ◴[09 Dec 25 20:57 UTC] No.46210545[source]▶

>>46205890 #

I like your perspective. What do you think "2026: the reconciliation looks like?

Also I need to track down that Star Trek TNG episode... it sounds poignant.

66. jakubmazanec ◴[09 Dec 25 21:27 UTC] No.46210903{8}[source]▶

>>46207584 #

> An hour later, I shipped a release build

I would love to see that pull request, and how readable and maintainable the code is. And do you understand the code yourself, since you've never done this before?

67. RScholar ◴[09 Dec 25 22:18 UTC] No.46211507[source]▶

>>46205890 #

I sure do. I believe it's the first season episode "When the Bough Breaks," (S01E16). That show tackled so many heavy topics right out of the gate... I respect the hell of of the courage to try, even if it produced some pretty epic whiffs along with the home runs and standing doubles.

68. allochthon ◴[09 Dec 25 22:58 UTC] No.46211894[source]▶

>>46205931 #

I have a hobby project on the side involving radio digital signal processing in Rust that I've been pure vibe coding, just out of curiosity to see how far I can get. On more than one occasion the hobby project has gotten bogged down in a bug that is immensely challenging to resolve. And since the project isn't in an area I have experience with, and since I don't have a solid "theory of the program", since it's a gray box because I've been vibe coding it, I've definitely seen CC get stuck and introduce regressions in tricky issues we previously worked through.

The use of Claude Code with my day job has been quite different. In my day job, I understand the code and review it carefully, and CC has been a big help.

69. maddmann ◴[10 Dec 25 02:52 UTC] No.46213472{4}[source]▶

>>46206835 #

I think there is going to be 2-3 year lag in understanding how llms actually impact developer productivity. There are way too many balls in the air, and anyone claiming specific numbers on productivity increase is likely very very wrong.

For example citing staff engineers as an example will have a bias: they have years of traditional training and are obviously not representative of software engineers in general.

replies(1): >>46218235 #

70. wilsonnb3 ◴[10 Dec 25 03:21 UTC] No.46213628{4}[source]▶

>>46205066 #

Assuming 40 hours a week of work time, you’re claiming a ~25x speed up, which is laughably absurd to me.

It will take you 2.5 months to accomplish what would have taken you five years, that is the kind of productivity increase you’re describing.

It doesn’t pass the smell test. I’m not sure that going from assembly to python would even have such a ludicrous productivity enhancement.

71. dent9 ◴[10 Dec 25 04:37 UTC] No.46214135{3}[source]▶

>>46207458 #

Lol that's like saying that because you found the solution on stack overflow you didn't write the program

News flash buddy: YOU never wrote any code yourself either. Literally every single line of code you've ever committed to a repo was first written by someone else and you just copied it and modified it a little.

72. dent9 ◴[10 Dec 25 04:41 UTC] No.46214153[source]▶

>>46204550 (TP) #

Agentic AI is a pretty huge scam. Every organization worth it's salt has so many IAM protections in place that an AI developer is useless because you can't give it SSO and OIDC credentials to access the company resources. And no IT team will ever let it. So all these folks trying to convince us that AI will ever deploy anything useful are lying their butts off; IT won't even let their own developers deploy anything why would an AI be treated any differently?

73. hattmall ◴[10 Dec 25 05:42 UTC] No.46214479[source]▶

>>46204550 (TP) #

>Like, is there truly an agentic way to go 10x

Yes, absolutely.

>or is there some catch?

Yes, absolutely.

The catch is that to go 10x you have to either do a lot of work of the variety that AI excels at, mainly boilerplate and logical but tedious modifications. There's a lot of code I can write, but I will probably need to check the syntax and implementations for 10 or more functions / methods, but I know what they are and how I want the code to flow. AI never really nails it, but it gets close enough that I can fix it with considerable time savings. The major requirement here is that I, for the most part, already knew almost exactly what I wanted to do. This is the really fancy auto-complete that is actually a pretty reasonable assistant.

The other way is that you have to start from a position of 0.1x (or less) and go to !~1x.

There are a tremendous amount of people employed in tech roles, but outside of actual tech companies that have very very low throughput.

I've recently worked in a very large non-tech firm but one that is part of a major duopoly and is for the most part a household name worldwide. They employ 1000s of software developers whose primary function is to have a vague idea of who they should email about any question or change. The ratio of emails to lines of code is probably 25:1.

The idea that you could simply ask an AI to modify code, and it might do it correctly, in only a day is completely mind blowing to people whose primary development experience is from within one of these organizations.

replies(1): >>46214613 #

74. godelski ◴[10 Dec 25 06:02 UTC] No.46214591[source]▶

>>46204550 (TP) #

  > for bash scripts etc

Everyone keeps telling me that it's good for bash scripts but I've never had real success.

Here's an example from today. I wanted to write a small script to grab my Google scholar citations and I'm terrible with web so I ask the best way to parse the curl output. First off, it suggests I use a python package (seriously? For one line of code? No thanks!) but then it gets the wrong grep. So I pull up the page source, copy paste some to it, and try to parse it myself. I already have a better grep command and for the second time it's telling me to use pearl regex (why does it love -P as much as it loves delve?). Then I'm pasting in my new command showing it my output asking for the awk and sed parts while googling the awk I always forget. It messes up the sed parts while googling, so I fix it, which means editing the awk part slightly but I already had the SO post open that I needed anyways. So I saved maybe one minutes total?

Then I give it a skeleton of a script file adding the variables I wanted and fully expected it to be a simple cleanup. No. It's definitely below average, I mean I've never seen an LLM produce bash functions without being explicitly told (not that the same isn't also true for the average person). But hey, it saved me the while loop for the args so that was nice. So it cost as much time as it gave back.

Don't get me wrong, I find LLMs useful but they're nowhere near game changing like everyone says they are. I'm maybe 10% more productive? But I'm not convinced that's even true. And sure, I might have been able to do less handholding with agents and having it build test cases but for a script that took 15 minutes to write? Feels like serious overkill. And this is my average experience with them.

Is everyone just saying it's so good at bash because no one is taking the time to learn bash? It's a really simple language that every Linux user should know the basics of...

75. godelski ◴[10 Dec 25 06:08 UTC] No.46214613[source]▶

>>46214479 #

Okay, I got two questions and I never seem to get satisfactory answers but I'm actually curious.

1) What kind of code are you writing that's mostly boilerplate?

2) Why are you writing code that's mostly boilerplate and not code that generalizes boilerplate? (read: I'm lazy. If I'm typing the same things a lot I'm writing a script instead)

I'd think maybe the difference is in what we program but I see say similar things to you that program the types of things I program so idk

76. nasmorn ◴[10 Dec 25 09:25 UTC] No.46215756{5}[source]▶

>>46209815 #

Not everyone bills hourly. I mostly do fixed price contracts

77. GrinningFool ◴[10 Dec 25 13:50 UTC] No.46217690{4}[source]▶

>>46205352 #

THis is remarkably similar to the process we had to follow a couple of decades ago, when offshoring to IT mills: spell out every little detail in small steps, iterate often, and you'll usually get most of what you want.

78. KallDrexx ◴[10 Dec 25 14:42 UTC] No.46218235{5}[source]▶

>>46213472 #

FWIW I only mentioned staff engineers because the survey found staff+ engineers reported the highest time savings. The survey itself had time savings averages for junior (3.9), Mid level (4.3), Senior (4.1) and Staff (4.4).

79. _superposition_ ◴[10 Dec 25 15:38 UTC] No.46218926{4}[source]▶

>>46205352 #

This. I find constraints to be very important. It's fairly obvious an llm can tackle a class or function. It's still up to the human to string it all together. I'm not quite sure how long that will last though. Seems more of an engineering problem to me. At the end of the day you absolutely can get good outputs from these things if you provide the proper input. Everything else is orchestration.

80. agent281 ◴[10 Dec 25 16:53 UTC] No.46220079{8}[source]▶

>>46207584 #

I appreciate this example. This does seem like a pretty difficult feature to build de novo. Did you already have some machine vision work integrated into your app? How are you handling machine vision? Is it just a call to an LLM API? Or are you doing it with a local model?

81. agent281 ◴[10 Dec 25 16:59 UTC] No.46220199{5}[source]▶

>>46209815 #

I understood their comment as going from

$100 / hour * 100 hours

$100 / hour * 500 hours

not to

$500 / hour * 100 hours

replies(2): >>46222662 #>>46230122 #

82. thfuran ◴[10 Dec 25 19:46 UTC] No.46222662{6}[source]▶

>>46220199 #

But it specifically mentions having 5x the previous hourly rate.

83. davidjytang ◴[11 Dec 25 10:15 UTC] No.46229606[source]▶

>>46205913 #

3X means what you got done in 30 days now only takes 10 days, yeah?

66.7% time saving just by having AI doing boring stuff.

84. nasmorn ◴[11 Dec 25 11:33 UTC] No.46230122{6}[source]▶

>>46220199 #

Yeah the last one. The others would require 5x deal flow which LLMs might not help deliver at all. But the last one should exist for people if 10x is true. Not every client can have already fully price in LLM improvements, people have contracts negotiated pre LLM. I have not heard of this though so I have to remain sceptical

↑