Most active commenters
  • infecto(13)
  • ForHackernews(5)
  • acdha(4)
  • hn_throwaway_99(3)
  • xyzzy9563(3)
  • macrolime(3)
  • freejazz(3)

←back to thread

502 points alazsengul | 78 comments | | HN request time: 2.053s | source | bottom
Show context
pm90 ◴[] No.44564397[source]
I think the amount of turmoil around these deals is giving more weight to the possibility that we’re in a massive bubble thats quite divorced from any kind of fundamentals. Sooner or later the bubbles gonna burst.
replies(13): >>44564436 #>>44564444 #>>44564507 #>>44564837 #>>44564856 #>>44564871 #>>44565061 #>>44566422 #>>44568840 #>>44570092 #>>44570792 #>>44571345 #>>44572790 #
nikcub ◴[] No.44564871[source]
> divorced from any kind of fundamentals

Anthropic ARR went $1B -> $4B in the first half of this year. They're getting my $200 a month and it's easily the best money I spend. There's definitely something there.

replies(22): >>44564952 #>>44564962 #>>44565035 #>>44565278 #>>44565374 #>>44565387 #>>44565422 #>>44565447 #>>44565517 #>>44565637 #>>44565761 #>>44565844 #>>44566449 #>>44567425 #>>44568353 #>>44569351 #>>44569976 #>>44570595 #>>44571349 #>>44572134 #>>44575913 #>>44579934 #
1. hn_throwaway_99 ◴[] No.44565447[source]
"Sooner or later the bubble's gonna burst" and "There's definitely something there" aren't mutually exclusive - in fact they often go together.

It makes me perhaps a little sad to say that "I'm showing my age" by bringing up the .com boom/bust, but this feels exactly the same. The late 90s/early 00s were the dawn of the consumer Internet, and all of that tech vastly changed global society and brought you companies like Google and Amazon. It also brought you Pets.com, Webvan, and the bajillion other companies chronicled in "Fucked Company".

You mention Anthropic, which I think is in a good a position as any to be one of the winners. I'm much less convinced about tons of the others. Look at Cursor - they were a first moving leader, but I know tons of people (myself included) who have cancelled their subscription because there are now better options.

replies(9): >>44565540 #>>44565789 #>>44566666 #>>44567060 #>>44567139 #>>44570863 #>>44572666 #>>44581313 #>>44582976 #
2. pqtyw ◴[] No.44565540[source]
Not really much a of stuck bubble this time, though. Besides Nvidia and a handful of other HW companies, at least. Almost all of the very high valuations are for private companies and usually the amount of actual money involved in is relatively low.
3. ttrmw ◴[] No.44565789[source]
what're you finding better than cursor now?
replies(3): >>44565962 #>>44566263 #>>44566387 #
4. rock_hard ◴[] No.44565962[source]
Devin is light years ahead of Cursor. It’s not even the same category!

I stopped writing code by hand almost entirely and my output (measured in landed PRs) has been 10x

And when I write code myself then it’s gnarly stuff and I want AI to get out of my way…so I just use Webstorm

5. andrewmutz ◴[] No.44566263[source]
Cline is absolutely fantastic when you combine it with Sonnet 4. Always use plan mode first and always have it write tests first (have it do TDD). It changed me from a skeptic to a believer and now I use it full time.
replies(1): >>44566470 #
6. g42gregory ◴[] No.44566387[source]
Claude Code with Pro, Max100, or Max200 subscriptions. Works with any IDE including none.

For the time being, nothing comes close, at least for me.

replies(4): >>44566592 #>>44569808 #>>44572658 #>>44580956 #
7. v5v3 ◴[] No.44566470{3}[source]
How much is it costing you?
replies(2): >>44566526 #>>44566949 #
8. jjmarr ◴[] No.44566526{4}[source]
As much as you theoretically want to spend, since it's pay-per-use.

I spend $200/month by using Sonnet 4. Could be higher if you want to use Opus.

replies(1): >>44568981 #
9. 6Az4Mj4D ◴[] No.44566592{3}[source]
Can you please share your Claude usage workflow?

I use Github copilot and often tend to be frustrated. It messes up old things while making new. I use Claude 4 model in GH CP.

replies(3): >>44567488 #>>44569394 #>>44572987 #
10. ForHackernews ◴[] No.44566666[source]
I genuinely don't understand what value Cursor itself brings. It's like a wrapper for some APIs, right? As far as I can tell there's like four actual AI firms in the world and everyone else is trying to whitelabel. It reminds me of the hosting industry in the early 2000s.
replies(7): >>44566793 #>>44566961 #>>44567150 #>>44567216 #>>44570921 #>>44571747 #>>44601127 #
11. xyzzy9563 ◴[] No.44566793[source]
It's a very well done wrapper that improves your coding productivity a lot.
replies(1): >>44567025 #
12. ewoodrich ◴[] No.44566949{4}[source]
I use Roo Code (Cline fork) and spend roughly $15-30/mo by subscribing to Github Copilot Pro for $10/mo for unlimited use of GPT-4.1 via the VS Code LM API, and a handful of premium credits a month (I use Gemini 2.5 Pro for the most part).

Once I max out the premium credits I pay-as-you-go for Gemini 2.5 Pro via OpenRouter, but always try to one shot with GPT 4.1 first for regular tasks, or if I am certain it's asking too much, use 2.5 Pro to create a Plan.md and then switch to 4.1 to implement it which works 90% of the time for me (web dev, nothing too demanding).

With the different configurable modes Roo Code adds to Cline I've set up the model defaults so it's zero effort switching between them, and have been playing around with custom rules so Roo could best guess whether it should one shot with 4.1 or create a plan with 2.5 Pro first but haven't nailed it down yet.

replies(1): >>44572178 #
13. cheema33 ◴[] No.44566961[source]
> I genuinely don't understand what value Cursor itself brings. It's like a wrapper for some APIs, right?

By similar token Windows is mostly a wrapper around Intel and AMD and now Qualcomm CPUs. Cursor/Windsurf add a lot of useful functionality. So much so so that Microsoft GitHub Copilot is losing marketshare to these guys.

14. gruez ◴[] No.44567025{3}[source]
The problem is that they have no moat and the underlying provider can easily cut them out.
replies(3): >>44567141 #>>44567442 #>>44567882 #
15. joshdavham ◴[] No.44567060[source]
> It makes me perhaps a little sad to say that "I'm showing my age"

Please don't say stuff like that.

As a 20-something who was in diapers during the dot-com boom, I really appreciate your insight. Thanks for sticking around on HN!

replies(2): >>44573951 #>>44578419 #
16. infecto ◴[] No.44567139[source]
Feels nothing like the same. The .com bubble was largely companies with no business, unchanged revenue but still having massive swings in price in private and public markets.

Cursor has a $500mm ARR your anecdote might be meaningful in the medium turn but so far growth as not slowed down.

replies(3): >>44567213 #>>44567522 #>>44572767 #
17. infecto ◴[] No.44567141{4}[source]
I think you underestimate the difficulty in getting the tool chain running efficiently in the IDE. It's a significant moat and I suspect their spend is too attractive to cut them off from an API especially when most of the model providers are not exactly competing fully in this space yet or at least not with the same enthusiasm.
replies(1): >>44570868 #
18. infecto ◴[] No.44567150[source]
Amazing how folks make comments without even trying it and especially making a comment similar to how Dropbox is simply rsync, right?

It is a lot less trivial than people like yourself make it out to be to get an effective tool chain and especially do it efficiently.

replies(1): >>44569552 #
19. acdha ◴[] No.44567213[source]
> The .com bubble was largely companies with no business

Ah, yes, companies like Amazon.com, eBay, PayPal, Expedia, and Google. Never heard of those losers again. Not to mention those crazy kids at Kozmo foolishly thinking that people would want to have stuff delivered same-day.

The two lessons you should learn from the .com bubble are that the right idea won’t save you from bad execution, and that boom markets–especially when investors are hungry for big returns–can stay inflated longer than you think. You can be early to market, have a big share, and still end up like Netscape because Microsoft decided to take the money from under the couch cushions and destroy your revenue stream. That seems especially relevant for AI as long as model costs are high and nobody has a moat: even if you’re right on the market, if someone else can train users to expect subsidized low prices long enough you’ll run out of runway.

replies(2): >>44567342 #>>44569012 #
20. lelanthran ◴[] No.44567216[source]
> I genuinely don't understand what value Cursor itself brings. It's like a wrapper for some APIs, right? As far as I can tell there's like four actual AI firms in the world and everyone else is trying to whitelabel. It reminds me of the hosting industry in the early 2000s.

Yes, there's (maybe?) four, but they're at the very bottom of the value chain.

Things built on top of them will be higher up the value chain and (in theory anyway) command a larger margin, hence a VC rush into betting on which company actually makes it up the value chain.

I mean, the only successes we see now are with coding agents. Nothing else has made it up the value chain except coding agents. Everything else (such as art and literature generation) is still on the bottom rung of the value chain.

That, by definition alone, is where the smallest margins are!

21. infecto ◴[] No.44567342{3}[source]
You’re right that many .com companies lacked fundamentals but you’re cherry-picking survivors. For every Amazon, there were dozens of Pets.coms. The current AI wave does feel different in terms of revenue traction (e.g., Cursor’s $500M ARR), but the broader lesson still applies: hype cycles don’t discriminate between good and bad execution in the short term.

Cursor’s growth is impressive, but sustained dominance isn’t guaranteed. Distribution, margins, and defensibility still matter and we haven’t seen how durable any of that is once incentives tighten and infra costs stop being subsidized.

replies(1): >>44570067 #
22. xyzzy9563 ◴[] No.44567442{4}[source]
Not really, it's pretty hard to get the editor and code editing via AI working as well as they did.
replies(1): >>44569002 #
23. fzzzy ◴[] No.44567488{4}[source]
I use github copilot chat right now. First I use ask mode to ask it a question about the state of the codebase outlining my current understanding of the condition of the code. "I'm trying to x, I think the code currently does y." I include a few source files that I am talking about. I correct any misconceptions about the plan the llm may have and suggest stylistic changes to the code. Then once the plan seems correct, I switch to agent mode and ask it to implement the change on the codebase.

Then I'll look through the changes and decide if it is correct. Sometimes can just run the code to decide if it is correct. Any compilation errors are pasted right back in to the chat in agent mode.

Once the feature is done, commit the changes. Repeat for features.

replies(2): >>44567641 #>>44570827 #
24. overfeed ◴[] No.44567522[source]
> The .com bubble was largely companies with no business, unchanged revenue but still having massive swings in price in private and public markets.

There also were companies like Sun and Cisco who had real, roaring business and lots of revenue that depended on loose start-up purse-strings, and VC exuberance...

Sun and Cisco both survived the .com bust, but were never the same, nor did theu ever reach their high-water marks again. They were shovel-sellers, much like Amazon and Nvidia in 2025.

replies(2): >>44567787 #>>44569737 #
25. 6Az4Mj4D ◴[] No.44567641{5}[source]
Does it remember context from chat mode and when you switch to agent mode?
replies(2): >>44568543 #>>44568645 #
26. ido ◴[] No.44567787{3}[source]
Or yahoo- they were the premier sellers of ad space online (like google today) and made a lot of money from over-funded tech companies overpaying for online advertising during the boom years.
27. yunwal ◴[] No.44567882{4}[source]
> and the underlying provider can easily cut them out

what? Do you think providers (or their other customers) don’t care about the business implications of a decision like this? All so that cursor can bring their significant customer base to a nearly-indistinguishable competitor?

28. addandsubtract ◴[] No.44568543{6}[source]
Yes. I think it used to be separate tabs, but now chat/agent mode is just a toggle. After discussing a concept, you can just switch to agent mode and tell it to "implement the discussed plan."
29. Paradigma11 ◴[] No.44568645{6}[source]
Yes, it can't change between edit and ask/agent without losing context but ask <-> agent is no problem. You can also change to your custom chat modes https://code.visualstudio.com/docs/copilot/chat/chat-modes without losing context. At least that's what I just did in VSCode Insiders.

Here are some nice copilot resources: https://github.com/github/awesome-copilot

Also, I am using tons of markdown documents for planning, results, research.... This makes it easy to get new agent sessions or yourself up to context.

30. macrolime ◴[] No.44568981{5}[source]
You can use Claude Code as a provider if you want it subscription based

https://docs.cline.bot/provider-config/claude-code

31. macrolime ◴[] No.44569002{5}[source]
If it's so hard, then why are there multiple open source projects that are just as good?
replies(1): >>44571571 #
32. macrolime ◴[] No.44569012{3}[source]
Thing is that it took 10-15 years for the stocks of these companies to reach the same marketcap again.
replies(1): >>44570001 #
33. csomar ◴[] No.44569394{4}[source]
GitHub Copilot models are intentionally restricted, which unfortunately makes them less capable.

I'm not the original poster, but regarding workflow, I've found it works better to let the LLM create one instead of imposing my own. My current approach is to have 10 instances generate 10 different plans, then I average them out.

34. ForHackernews ◴[] No.44569552{3}[source]
That's why Dropbox is a trillion dollar company and not a feature called iCloud, right?
replies(1): >>44569719 #
35. infecto ◴[] No.44569719{4}[source]
Are you trying to make a point or just being defensive for no reason? You called out something without having any experience or knowledge and then did the classic “it’s just a wrapper”.
replies(1): >>44569742 #
36. infecto ◴[] No.44569737{3}[source]
For sure but unlike then we are in a very different buying environment. Investors are more discerning even though folks here would like tot think differently. Cisco had something at peak like a 179x pe. That is a vastly different world than what we see Nvidia at today. I am not saying it cannot fail or collapse but to say this feels like the .com bubble is wrong.
37. ForHackernews ◴[] No.44569742{5}[source]
Sorry, I thought the point was clear: Dropbox (file sync) is a feature, not a product. Cursor (AI in your IDE) likewise, is a feature, not a product.

I am old and I remember when you could make a lot of money offering "Get Your Business On The Information Superhighway" (HTML on Apache) and we're in that stage of LLMadness today, but I suspect it will not last.

replies(1): >>44569875 #
38. cft ◴[] No.44569808{3}[source]
My problem with Claude code versus Cursor is that with Cursor I could "shop around" the same context with different foundational model providers, often finding bugs this way or or getting insights.

Sometimes one model would get stuck in their thinking and submitting the same question to a different model would resolve the problem

replies(1): >>44571324 #
39. infecto ◴[] No.44569875{6}[source]
“It's like a wrapper for some APIs, right?”

Don’t be sorry it shows your true colors. The point stands that you continue to step around. Cursor and other tools like it are more than a trivial wrapper but of course you have never used them so you have no idea. At least give yourself some exposure before projecting.

Dropbox is still a $5+bn business. Cursor is still growing, will it work out, I don’t know but lots of folks are seeing the value in these tools and I suspect we have not hit peak yet with the current generation. I am not sure what a service business like a small biz website builder has to do with Cursor or other companies in adjacent spaces.

replies(1): >>44570241 #
40. acdha ◴[] No.44570001{4}[source]
That’s what people predicted bit, for example, on Amazon’s case it was less than 3 years because they just kept posting solid numbers. The thing which all of those companies have in common is that they stood out from the Pets.com types in having profitable revenue - they didn’t need a miracle to be profitable, only for customers to keep buying.
41. acdha ◴[] No.44570067{4}[source]
My point in listing survivors was simply to make the point that while there were plenty of doomed businesses, there were also many giants which were big at the time and could be told apart by looking at their fundamentals — they had real people paying them money for tangible things at a price which could be profitable. Amazon famously reported low numbers due to reinvestment but they were profitable in most business segments a few years after entering, which was quite different from the “lose money on every sale, make it up on volume” plays many dotcoms made.
replies(1): >>44570432 #
42. ForHackernews ◴[] No.44570241{7}[source]
What "true colors"? I think I've been pretty consistent that I think Cursor is a commodity. You're surprisingly hostile and defensive about your preferred autocomplete plugin. You're right, I haven't used Cursor, but I've used similar tools like Copilot.

Your characterization of hosting as "a small biz website builder" is revealing. https://finance.yahoo.com/quote/GDDY/ is the one that made it and is now a $24B firm, but there were at least dozens of these companies floating around in the early 2000s.

Why are you so sure Cursor is the new GoDaddy and not the new Tripod? https://www.tripod.lycos.com/

replies(1): >>44570406 #
43. infecto ◴[] No.44570406{8}[source]
You’ve been dismissive from the start without any real engagement with the product. I pointed out that you haven’t used Cursor, and rather than reflect on that, you’ve responded with sarcasm and condescension.

The only person being defensive here is you. My point was simple: tools like Cursor are more than just “wrappers.” Whether it becomes a massive business or not, revenue is growing, and clearly many users find enough value to justify the subscription. You don’t have to like it but writing it off without firsthand experience just weakens your argument.

At this point, you’re debating a product you haven’t tried, in a market you’re not tracking. Maybe sit this one out unless you have something constructive to say beyond “it’s just a wrapper”.

44. infecto ◴[] No.44570432{5}[source]
How does that refute the statement you quoted? I said the vast majority of companies during the bubble had no business, were run on hype dollars and had insane P/E ratios. That supports a handful of companies making it through the bloodbath, but also a cherry picked examples that neither refutes my claim or supports yours.
replies(1): >>44572751 #
45. ghm2180 ◴[] No.44570827{5}[source]
I also do the same. I am on the 200$ maxpro plan. I often let the plan go to pretty fine level of detail, e.g. describe exactly what test conditions to check, what exact code conditions to follow. Do you write this to a separate plan file? I find myself doing this a lot since after compaction Claude starts to have code drift.

Do you also get it to add to it's to-do list?

I also find that having the o3 model review the plan helps catch gaps. Do you do the same?

46. code51 ◴[] No.44570863[source]
Anthropic is actually a good point to focus on since Claude is very good proof that it's not about the scaling. We are not quite there yet but we are "programming" through how we shape and filter the input data for training it seems. With time, we'll understand the methods to better represent.

Current situation doesn't sound too good for "scaling hypothesis" itself.

replies(1): >>44574146 #
47. dboreham ◴[] No.44570868{5}[source]
Interesting to see multiple posts here saying this. Pretty clearly it isn't true. The IDE is owned by Microsoft. The model is owned by Anthropic or Google or whoever. A business can't be made from a thin sandwich filling between the two.
replies(1): >>44570961 #
48. rickyhatespeas ◴[] No.44570921[source]
The value is the UX/DevX. Though, they are essentially just a fork of VS Code so it's hard to justify using instead of VS Code + Copilot or Continue which is almost the same UX now.

That's the problem with most "AI" products/companies that still isn't being answered. Why do people use your tool/service if you don't own the LLM which is most of the underlying "engine"? And further, how do you stay competitive when your LLM provider starts to scale RL with whatever prompting tricks you're doing, making your product obsolete?

replies(1): >>44574142 #
49. infecto ◴[] No.44570961{6}[source]
Hey guys we have a guy here stating pretty clearly it isn’t true. He clearly is the authority on the topic because he said so.

The shell of the IDE is open source. It’s true there is some risk on the supply of models and compute but again none of those, except MSFT which does not even own any of the SOTA models, have any direct competition. OpenAI has codex but it’s half baked and being bundled in ChatGPT. It is in nobodies interest to cut off Cursor as at this point they are a fairly sustained and large customer. The risk exists but feels pretty far fetched until someone is actively competing or Cursor gets bought out by a OpenAI.

Again, what proof do you have that there is zero complexity or most being driven by the sandwich filling. Most of OpenAIs valuation is being driven by the wrapper ChatGPT not API usage. I have written a number of integrations with LLM APIs and while some of it just works, there is a lot of nuance to doing it effectively and efficiently at scale. If it was so simple why would we not see many other active competitors in this space with massive MAUs?

50. virgildotcodes ◴[] No.44571324{4}[source]
I’m unaffiliated, but I’ve really been enjoying this - https://github.com/BeehiveInnovations/zen-mcp-server

It allows you to have CC shoot out requests to o3, 2.5 pro and more. I was previously bouncing around between different windows to achieve the same thing. With this I can pretty much live in CC with just an editor open to inspect / manually edit files.

51. xyzzy9563 ◴[] No.44571571{6}[source]
They're not just as good unless you are willing to spend huge amounts in API credits.
52. komali2 ◴[] No.44571747[source]
Don't they have their own model for the inline completions? For me I find it really nice to preserve some brain energy by doing one repetitive change and just tab spamming it to get it done everywhere. I could get it done maybe just as fast with a macro in vim, but, Cursor lets me preserve the brain energy for something harder.

Meanwhile other "wrappers" e.g. in nvim or whatever, don't have this feature, they just have slightly better autocomplete than bare LSP.

53. greggh ◴[] No.44572178{5}[source]
Looking at Cline, wondering what the real selling points for Roo Code are. Any chance you can say what exactly made you go with Roo Code instead of Cline?
replies(1): >>44573212 #
54. hn_throwaway_99 ◴[] No.44572658{3}[source]
This was my answer as well. And I think it just highlights all the serious dangers for the "API wrapper companies" compared to the foundation model companies.

User experience is definitely worth something, and I think Cursor had the first great code integration, but then there is very little stopping the foundation model companies from coming in and deciding they want to cut out the middleman if so desired.

55. SJC_Hacker ◴[] No.44572666[source]
> It also brought you Pets.com, Webvan, and the bajillion other companies chronicled in "Fucked Company".

The irony with Webvan, they had the right idea about 15 years too early. Now we have InstaCart, DoorDash, etc. You really needed the mobile revolution circa 2010 for it to work.

Pets.com is essentially Chewy (successful pet focused online retailer)

So, neither of those ideas were really terrible in the same vain as say Juicera, or outright frauds like Theranos. Overvalued and ill-timed, sure

replies(1): >>44575118 #
56. acdha ◴[] No.44572751{6}[source]
You said “largely” and I think that’s painting with too broad a brush. The dotcom world included a bunch of companies which are still around (or were acquired later after surviving the collapse), and it wasn’t hard to tell who those were even at the time. There was a lot of lazy boosterism and criticism painting the whole field as the same, and that was a disservice to readers who could’ve used a more thoughtful triage approach. That’s especially the case for companies like Kozmo which actually had a popular idea and had the potential to be profitable (they were in most urban markets) but made the mistake of expanding too quickly or taking on more debt than they could service.
replies(1): >>44582530 #
57. freejazz ◴[] No.44572767[source]
>Feels nothing like the same. The .com bubble was largely companies with no business, unchanged revenue but still having massive swings in price in private and public markets.

I'm an attorney that got pitched the leading legal AI service and it was nothing but junk... so I'm not sure why you think that's different from what's going on right now.

replies(2): >>44576948 #>>44581893 #
58. mkozlows ◴[] No.44572987{4}[source]
Github Copilot is weirdly bad, and all the alternatives are better. Sometimes people think "they have the same model, must be the same," but it's not.
59. ewoodrich ◴[] No.44573212{6}[source]
Cline has two modes (Plan and Act) which work pretty well but Roo Code has 5 modes by default. (Code, Ask, Architect, Orchestrator, Debug) and it's designed so that users can add custom modes. e.g. I added a Code (simple) mode with instructions about the scale/complexity of tasks it can handle or decide to pass it to Code for a better model. I also changed the Architect mode to evaluate whether to redirect the user to Code or Code (simple) after generating a plan.

Roo Code just has a lot more config exposed to the user which I really appreciate. When I was using Cline I would run into minor irritating quirks that I wished I can change but couldn't vs. Roo where the odds are pretty good there are some knobs you could turn to modify that part of your workflow.

60. walthamstow ◴[] No.44573951[source]
Seconded
61. lizardking ◴[] No.44574142{3}[source]
Cursor isn't my preferred environment for development, but for me, it yields much better results than Copilot. Continue and Cline bugged out on me so badly, and so frequently, that I didn't find them worth using. YMMV
replies(1): >>44576078 #
62. yourapostasy ◴[] No.44574146[source]
> Current situation doesn't sound too good for "scaling hypothesis" itself.

But the “scaling hypothesis” is the easiest, fastest story to raise money. So it will be leveraged until conclusively broken by the next advancement.

63. fuzzieozzie ◴[] No.44575118[source]
Don't forget that Amazon's market cap was higher in 1999 than 2009!
replies(1): >>44584507 #
64. danudey ◴[] No.44576078{4}[source]
If you're interested, I recommend checking out Kiro from Amazon.

Every time I've tried Copilot or Cursor, it's happily gone off and written or rewritten code into a state it seemed very proud of, and which didn't even work, let alone solve the problem I put to it.

Meanwhile, Kiro:

1. Created a requirements document, with user stories and acceptance criteria, so that we could be on the same page about the goals

2. Once I signed off on that, it then created a design document, with code examples, error handling cases, and an architecture diagram, for me to review

3. After that looked good, it set about creating an itemized task list for each step of the implementation, broken down into specific tasks and sub-tasks and including which of the acceptance criteria from step 1 that task addressed

4. I could go through the document task by task, ask it to work on it, and then review the results

At one point, it noticed that the compiler had reported a minor issue with the code it had written, but correctly identified that resolving that issue would involve implementing something that was slated for a future task, so it opted to ignore the issue until the appropriate time.

For once, I found myself using an AI tool that handled the part of the job I hate the most, and am the worst at: planning, diagramming, and breaking down tasks. Even if it hadn't been able to write any working code at all, it already created something useful for me that I could have built off of, but it did end up writing something that worked great.

In case anyone is curious about the files it created, you can see them here: https://github.com/danudey/rust-downloader/pull/4

Note that I'm not really familiar with Rust (as most of the code will demonstrate), so it would probably have been far faster for an experienced Rust programmer to implement this. In my case, though, I just let it do its thing in the background and checked in occasionally to validate it was doing what I expected.

replies(2): >>44576648 #>>44586035 #
65. ForHackernews ◴[] No.44576648{5}[source]
Wow, that sounds like it might be useful!
66. namadaza ◴[] No.44576948{3}[source]
Was the leading legal AI service Harvey.ai by any chance? I feel like big VC money goes to solving legal analysis, but I'm seeing a lot of wins with document drafting/templating.

Briefpoint.ai, casely.ai, eve.legal etc. I work with an attorney who trained his paralegals to use chatgpt + some of these drafting tools, says it's significantly faster than what they could've done previously.

replies(1): >>44577135 #
67. freejazz ◴[] No.44577135{4}[source]
It was Vincent.

> I feel like big VC money goes to solving legal analysis, but I'm seeing a lot of wins with document drafting/templating.

What do you mean "wins?" Like motions won with AI drafted papers? I'm skeptical.

>I work with an attorney who trained his paralegals to use chatgpt + some of these drafting tools, says it's significantly faster than what they could've done previously.

I'd be concerned about malpractice, personally. The case reviews I've seen from Vincent (which is ChatGPT + the entire federal docket) are shocking in how facially wrong they can be. It's one thing for an attorney to use ChatGPT when they do know the law and issues (hasn't seemed to help the various different partners getting sanctioned for filing AI drafted briefs) but to leave the filtering to a paralegal? That's insane, imo.

68. hn_throwaway_99 ◴[] No.44578419[source]
Thanks, that's very kind!

I probably mean it less as "I'm too old" and more of "Wow, time really flies".

To me, who started my career in the very late 90s, the .com boom doesn't really seem that long ago. But then I realize that there is about the same amount of time between now and the .com boom, and the .com boom and the Altair 8800, and I think "OK, that was a loooong time ago". It really is true what they say, that the perception of time speeds up the older you get, which is both a blessing and a curse.

Regarding AI, it's a bit fascinating to me to think that we really only had enough data to get generative AI to work in the very recent past, and nearly as soon as we had enough data, the tech appeared. In the past I would have probably guessed that the time between having enough data and the development of AI would have been a lot longer.

69. d_sc ◴[] No.44580956{3}[source]
Amp Code is also very good, they released about 2 weeks ago their Oracle feature which leverages o3 to do reviews (https://ampcode.com/news/oracle). Amp leans the closer to Claude Code more than other solutions I’ve seen so far, the team there is really leaning into the agentic approach.

I watch the changes on Kilo Code as well (https://github.com/Kilo-Org/kilocode). Their goal is to merge the best from Cline & Roo Code then sprinkle their own improvements on top.

70. justinclift ◴[] No.44581313[source]
We still use Windsurf at work, but I've cancelled my personal subscription as (frankly) the code which Claude Sonnet 3.7 (thinking) generates is just not good enough, and much of the time its analysis of issues is also not correct. :(

That being said _sometimes_ its analysis is actually correct, so it's not a total miss. Just not something I'm willing to pay for when Ollama and free models exist.

71. infecto ◴[] No.44581893{3}[source]
Never heard Vincent to be leading in the legal space but I know vlex is more popular outside of the US so perhaps that’s why.

I am not sure why you would think your single anecdote is defensible or evidence to prove much. My perspective is valuations that are going on right now don’t have multiples that are that wild especially if we aren’t compare it to the com bubble.

replies(1): >>44587410 #
72. infecto ◴[] No.44582530{7}[source]
You’re sidestepping the core point. Of course some companies had fundamentals, even Kozmo had product market fit in a narrow sense. But the broader ecosystem was bloated with capital chasing flimsy ideas, and most dot-coms had no viable path to profit. That’s not “too broad a brush”, it’s backed by the collapse itself.

Kozmo is a great case study: decent demand, terrible unit economics, and zero pricing power. They didn’t just scale too fast, they scaled a structurally unprofitable model. There was no markup, thin margins, and they held inventory without enough throughput.

Many of these companies may fail but it’s a much different environment and the path to profitability is moving a lot quicker.

73. leguy ◴[] No.44582976[source]
>there are now better options.

Care to share your opinions on which options are better?

74. AceJohnny2 ◴[] No.44584507{3}[source]
What

March 1999: ~$27.7B

Jan 2009: ~$25B (back to $27.7B & rising by Feb)

Huh.

75. lizardking ◴[] No.44586035{5}[source]
What you are describing is effectively my work flow with any LLM tooling, it's just not formalized by the tooling itself. Sounds interesting, I will take it for a spin.
76. freejazz ◴[] No.44587410{4}[source]
>I am not sure why you would think your single anecdote is defensible or evidence to prove much.

Evidence? Prove? What are you talking about. This is just a discussion between people, not some courtroom melodrama you are making it out to be.

>My perspective is valuations that are going on right now don’t have multiples that are that wild especially if we aren’t compare it to the com bubble.

Okay, I could be equally rude to you, but I wont.

replies(1): >>44595204 #
77. infecto ◴[] No.44595204{5}[source]
I responded in the exact same format as yourself and adding my additional thoughts. If that counts a being rude then were you being rude? I don’t believe in the US market Vincent is considered the best tool. That said I don’t believe a lot of the tools in the legal space that try to do everything are that powerful and the ones on my lens that are going well have focused on specific domains and problems. Even Harvey tried to do too much from the start.

As for valuations, when looking at current VC multiples and equity markets, I don’t see the same bubble from a qualitative perspective. Absolutely there is over hype coming from CEOs in public markets but there is a lot of value being driven. I don’t believe the giants are going to do well, maybe the infrastructure plays will but I think we will see a carve out of a new generation of companies driving the change. Unlike ‘99, I am seeing a lot more startups and products with closer to the ground roadmaps to profitability. In 99 so many were running off of hopes and dreams.

If you would actually like to converse I would love to see your perspective but if all you can be is mad please please don’t respond. Nobody is having a courtroom drama other than what’s playing out in your head.

78. wizardpisces771 ◴[] No.44601127[source]
These four AI companies are nothing more than re-packaging GPUs and training data.