Most active commenters
  • ssl-3(3)
  • Duanemclemore(3)
  • brookst(3)

←back to thread

GPT-5.2

(openai.com)
1053 points atgctg | 69 comments | | HN request time: 2.925s | source | bottom
Show context
tenpoundhammer ◴[] No.46236826[source]
I have been using chatGPT a ton over the last months and paying the subscription. Used it for coding, news, stock analysis, daily problems, and a whatever I could think of. I decided to give Gemini a go when version three came out to great reviews. Gemini handles every single one of my uses cases much better and consistently gives better answers. This is especially true for situations were searching the web for current information is important, makes sense that google would be better. Also OCR is phenomenal chatgpt can't read my bad hand writing but Gemini can easily. Only downsides are in the polish department, there are more app bugs and I usually have to leave the happen or the session terminates. There are bugs with uploading photos. The biggest complaint is that all links get inserted into google search and then I have to manipulate them when they should go directly to the chosen website, this has to be some kind of internal org KPI nonsense. Overall, my conclusion is that ChatGPT has lost and won't catch up because of the search integration strength.
replies(36): >>46236861 #>>46236896 #>>46236956 #>>46236971 #>>46236980 #>>46237123 #>>46237253 #>>46237258 #>>46237321 #>>46237407 #>>46237452 #>>46237531 #>>46237626 #>>46237654 #>>46237786 #>>46237888 #>>46237927 #>>46238237 #>>46238324 #>>46238527 #>>46238546 #>>46238828 #>>46239189 #>>46239400 #>>46239512 #>>46239719 #>>46239767 #>>46239999 #>>46240382 #>>46240656 #>>46240742 #>>46240760 #>>46240763 #>>46241303 #>>46241326 #>>46241523 #
1. solarkraft ◴[] No.46236896[source]
> Only downsides are in the polish department

What an understatement. It has me thinking „man, fuck this“ on the daily.

Just today it spontaneously lost an entire 20-30 minutes long thread and it was far from the first time. It basically does it any time you interrupt it in any way. It’s straight up data loss.

It’s kind of a typical Google product in that it feels more like a tech demo than a product.

It has theoretically great tech. I particularly like the idea of voice mode, but it’s noticeably glitchy, breaks spontaneously often and keeps asking annoying questions which you can’t make it stop.

replies(10): >>46237173 #>>46237180 #>>46237398 #>>46237664 #>>46237832 #>>46238374 #>>46238810 #>>46239600 #>>46240229 #>>46240650 #
2. mnky9800n ◴[] No.46237173[source]
The colab integration is where it shines the most imo.
3. radicaldreamer ◴[] No.46237180[source]
Google’s standard problem is that they don’t even use their own products. Their Pixel and Android team rocks iPhones on the daily, for example.
replies(6): >>46237220 #>>46237510 #>>46238473 #>>46239140 #>>46239896 #>>46241324 #
4. onethought ◴[] No.46237220[source]
I mean there is benefit to understanding competitor well as well?
replies(3): >>46237469 #>>46238131 #>>46238276 #
5. sundarurfriend ◴[] No.46237398[source]
ChatGPT web UI was also like this for the longest time, until a few months ago: all sorts of random UI bugs leading either to data loss or misleading UI state. Interrupting still is very flaky there too. And on the mobile app, if you move away from the app while it's taking time to think, its state would somehow desync from the actual backend thinking state, and get stuck randomly; sometimes restarting the app fixes it, sometimes that chat is that unusable from that point on.

And the UI lack of polish shows up freshly every time a new feature lands too - the "branch in new chat" feature is really finicky still, getting stuck in an unusable state if you twitch your eyebrows at wrong moment.

replies(2): >>46239212 #>>46239320 #
6. LogicFailsMe ◴[] No.46237469{3}[source]
Outweighed by the value of having to suffer with the moldy fruits of their own labor. That was the only way the Android Facebook app became usable as well.
7. RBerenguel ◴[] No.46237510[source]
I would think this is not true
replies(2): >>46238818 #>>46240374 #
8. mmaunder ◴[] No.46237664[source]
Yeah I eventually noped out as I said in another comment and am charging hard with Codex and am so happy about 5.2!!
9. adamkochanowicz ◴[] No.46237832[source]
I also love that I can leave the microphone on (not in live voice mode) while dictating to ChatGPT and pause and think as much as needed.

With Gemini, it will send as soon as I stop to think. No way to disable that.

replies(1): >>46238611 #
10. ssl-3 ◴[] No.46238131{3}[source]
There certainly is.

To posit a scenario: I would expect General Motors to buy some Ford vehicles to test and play around with and use. There's always stuff to learn about what the competition has done (whether right, wrong, or indifferent).

But I also expect the parking lots used by employees at any GM design facility in the world to be mostly full of General Motors products, not Fords.

replies(2): >>46238792 #>>46239960 #
11. Forgeties79 ◴[] No.46238276{3}[source]
I wonder how many apple employees walk in to the office with android phones
replies(1): >>46239485 #
12. KronisLV ◴[] No.46238374[source]
> It has me thinking „man, fuck this“ on the daily.

That's sometimes me with the CLI. I can't use the Gemini CLI right now on Windows (in the Terminal app), because trying to copy in multiple lines of text for some reason submits them separately and it just breaks the whole thing. OpenCode had the same issue but even worse, it quite after the first line or something and copied the text line by line into the shell, thank fuck I didn't have some text that mentions rm -rf or something.

More info: https://github.com/google-gemini/gemini-cli/issues/14735#iss...

At the same time, neither Codex CLI, nor Claude Code had that issue (and both even showed shortened representations of copied in text, instead of just dumping the whole thing into the input directly, so I could easily keep writing my prompt).

So right now if I want to use Gemini, I more or less have to use something like KiloCode/RooCode/Cline in VSC which are nice, but might miss out on some more specific tools. Which is a shame, because Gemini is a really nice model, especially when it comes to my language, Latvian, but also your run of the mill software dev tasks.

In comparison, Codex feels quite slow, whereas Claude Code is what I gravitate towards most of the time but even Sonnet 4.5 ends up being expensive when you shuffle around millions of tokens: https://news.ycombinator.com/item?id=46216192 Cerebras Code is nice for quick stuff and the sheer amount of tokens, but in KiloCode/... regularly messes up applying diff based edits.

replies(1): >>46240654 #
13. Der_Einzige ◴[] No.46238473[source]
That’s because they will be bullied out of the dating market if they have a “green bubble”.
replies(2): >>46239184 #>>46240021 #
14. wheelerwj ◴[] No.46238611[source]
How did you do this?
replies(1): >>46238726 #
15. toomuchtodo ◴[] No.46238726{3}[source]
Record button in the app if you’ve got the feature.
16. GenerWork ◴[] No.46238792{4}[source]
>But I also expect the parking lots used by employees at any GM design facility in the world to be mostly full of General Motors products, not Fords.

I think you'd be surprised about the vehicle makeup at Big 3 design facilities.

replies(1): >>46239474 #
17. arjie ◴[] No.46238810[source]
Any time its safety stuff triggers, Gemini wipes the context. It's unusable because of this because whatever is going on with the safety stuff, it fires too often. I'm trying to figure out some code here, not exactly deporting ICE to Guantanamo or whatever.
replies(2): >>46238852 #>>46238902 #
18. renewiltord ◴[] No.46238818{3}[source]
Yeah, I've heard that Sundar Pichai dogfoods the latest Pixel at least once a month and sometimes two or three times.
19. dzhiurgis ◴[] No.46238852[source]
On a flip side chatgpt app now has years of history that sometimes useful (search is pretty ok, but could improve) but otherwise I'd like to remove most of it - good luck doing so.
20. rvnx ◴[] No.46238902[source]
The more Gemini and Nano-Banana soften their filters, the more audience it will take from other platforms. The main risk is payment providers banning them, I can't imagine bank card providers to remove payments to Google.
21. free652 ◴[] No.46239140[source]
You cant buy an iPhone without a director approval. And it's like 3 gen behind as well. So no, they don't use iPhones.
replies(4): >>46239225 #>>46239229 #>>46239259 #>>46242398 #
22. gcr ◴[] No.46239212[source]
i basically can't use the ChatGPT app on the subway for these reasons. the moment the websocket connection drops, i have to edit my last message and resubmit it unchanged.

it's like the client, not the server, is responsible for writing to my conversation history or something

replies(2): >>46239585 #>>46240677 #
23. dominotw ◴[] No.46239225{3}[source]
you have to get premission from director for your presonal phone? wtf
replies(1): >>46239469 #
24. ummonk ◴[] No.46239229{3}[source]
Google tells its employees what products they're allowed to buy for personal use?
replies(1): >>46239950 #
25. gcr ◴[] No.46239259{3}[source]
lots of googlers use BYOD iPhones and the corp suite for this use case is fairly well-supported
replies(1): >>46240218 #
26. p_ing ◴[] No.46239320[source]
> ChatGPT web UI was also like this for the longest time

Copilot Chat has been perfect in this respect. It's currently GPT 5.0, moving to 5.1 over the next month or so, but at least I've never lost an (even old) conversation since those reside in an Exchange mailbox.

replies(1): >>46239940 #
27. testdelacc1 ◴[] No.46239469{4}[source]
For the work phone.
28. ssl-3 ◴[] No.46239474{5}[source]
Maybe so.

I'm only familiar with Ford production and distribution facilities. Those parking lots are broadly full of Fords, but that doesn't mean that it's like this across the board.

replies(1): >>46241530 #
29. azinman2 ◴[] No.46239485{4}[source]
Effectively zero.

Disclosure: I work at Apple. And when I was at Google I was shocked by how many iPhones there were.

replies(1): >>46239961 #
30. spruce_tips ◴[] No.46239585{3}[source]
it took me a lot of tinkering to get this feeling seamless in my own apps that use the api under the hood. i ended up buffering every token into a redis stream (with a final db save at the end of streaming) and building a mechanism to let clients reconnect to the stream on demand. no websocket necessary.

works great for kicking off a request and closing tab or navigating away to another page in my app to do something.

i dont understand why model providers dont build this resilient token streaming into all of their APIs. would be a great feature

replies(1): >>46240622 #
31. deepGem ◴[] No.46239600[source]
There is no competing product for GPT Voice. Hands down. I have tried Claude, Gemini - they don't even comes close.

But voice is not a huge traffic funnel. Text is. And the verdict is more or less unanimous at this time. Gemini 3.0 has outdone ChatGPT. I unsubscribed from GPT plus today. I was a happy camper until the last month when I started noticing deplorable bugs.

1. The conversation contexts are getting intertwined.Two months ago, I could ask multiple random queries in a conversation and I would get correct responses but the last couple of weeks, it's been a harrowing experience having to start a new chat window for almost any change in thread topic. 2. I had asked ChatGPT to once treat me as a co-founder and hash out some ideas. Now for every query - I get a 'cofounder type' response. Nothing inherently wrong but annoying as hell. I can live with the other end of the spectrum in which Claude doesn't remember most of the context.

Now that Gemini pro is out, yes the UI lacks polish, you can lose conversations, but the benefits of low latency search and a one year near free subscription is a clincher. I am out of ChatGPT for now, 5.2 or otherwise. I wish them well.

replies(3): >>46239866 #>>46240011 #>>46240050 #
32. esyir ◴[] No.46239866[source]
Just a note, chatGPT does retain a persistent memory of conversations. In the settings menu, there's a section that allows you to tweak/clear this persistent memory
33. sam345 ◴[] No.46239896[source]
That's inexcusable.
34. Max-Limelihood ◴[] No.46239940{3}[source]
I lost thousands of conversations I'd had back in the move from "Bing" to "Copilot". Moved straight to Claude and never touched a GPT again.
replies(1): >>46240101 #
35. snypher ◴[] No.46239950{4}[source]
Seems like they meant for a work device.
36. snypher ◴[] No.46239960{4}[source]
The CEO of Ford was driving a competition EV for months;

https://www.caranddriver.com/news/a62694325/ford-ceo-jim-far...

37. jimmaswell ◴[] No.46239961{5}[source]
This is flabbergasting, how could such a large proportion of highly technical people willingly subject themselves to being shackled by iOS? They just happily put up with having one choice of browser, (outside Europe) no third party app stores, and being locked into the Apple ecosystem? I can't think of a single reason I would ever switch from an S22-25+U to an iPhone. I only went from 22U to 25U because my old one got smashed, otherwise the 22U would still be perfectly fine.
replies(3): >>46240161 #>>46240255 #>>46240848 #
38. wkat4242 ◴[] No.46240011[source]
What's that near free subscription? I don't see it here
replies(2): >>46240335 #>>46240523 #
39. dkga ◴[] No.46240021{3}[source]
What is a green bubble? iPhone's carbon footprint?
replies(1): >>46240236 #
40. rapind ◴[] No.46240050[source]
I found the gemini cli extremely lacking and even frustrating. Why google would choose node…

Codex is decent and seemed to be improving (being written in rust helps). Claude code is still the king, but my god they have server and throttling issues.

Mixed bag wherever you go. As model progress slows / flatlines (already has?) I’m sure we’ll see a lot more focus and polish on the interfaces.

replies(1): >>46240653 #
41. Duanemclemore ◴[] No.46240101{4}[source]
I downloaded my archive and completely ended my GPT subscription last week based on some bad computer maintenance advice. Same thing here - using other models, never touching that product again.
replies(1): >>46240320 #
42. dumbfounder ◴[] No.46240161{6}[source]
Because it’s better.
replies(2): >>46240498 #>>46240934 #
43. brookst ◴[] No.46240218{4}[source]
Which makes tons of sense because iPhone users are higher CLV than Android users. If Google had to choose between major software defects in Android or iOS, they would focus quality on iOS every time.
44. amluto ◴[] No.46240229[source]
Claude regularly computes a reply for me, then reports an error and loses the reply. I wonder what fraction of Anthropic’s compute gets wasted and redone.
replies(1): >>46241116 #
45. brookst ◴[] No.46240236{4}[source]
iMessage renders other iMessage users as blue bubbles, SMS/RCS as green bubbles.

People who can’t understand that many people actually prefer iOS use this green/blue thing to explain the otherwise incomprehensible (to them) phenomenon of high iOS market share. “Nobody really likes iOS, they just get bullied at school if they don’t use it”.

It’s just “wake up sheeple” dressed up in fake morality.

replies(2): >>46240895 #>>46241172 #
46. brookst ◴[] No.46240255{6}[source]
Because many of them just want to use their phone as a tool, not tinker with it.

Same way many professional airplane mechanics fly commercial rather than building their own plane. Just because your job is in tech doesn’t mean you have to be ultra-haxxor with every single device in your life.

47. topato ◴[] No.46240320{5}[source]
now I kind of HAVE to know... what was the aforementioned bad advice was?! So mysterious!
replies(1): >>46240726 #
48. topato ◴[] No.46240335{3}[source]
yeah, the best Ive seen is like 1.99 for two months, then back to normal pricing....
49. sib ◴[] No.46240374{3}[source]
You'd be wrong (source - worked in the Android org).
50. ◴[] No.46240498{7}[source]
51. deepGem ◴[] No.46240523{3}[source]
They had 9.99 for the first year.
52. rishabhaiover ◴[] No.46240622{4}[source]
exactly. they need to bring in spotify level of caching of streaming music that it just works if you're in a subway. Constant availability should be table stakes for them.
53. hexnuts ◴[] No.46240650[source]
You may be interested in tools like OpenMemory
54. wahnfrieden ◴[] No.46240653{3}[source]
Codex is king
55. ◴[] No.46240654[source]
56. rjzzleep ◴[] No.46240677{3}[source]
I get that the web versions are free, but if you can afford API access, I always recommend using Msty for everything. It's a much better experience.

https://msty.ai/

57. Duanemclemore ◴[] No.46240726{6}[source]
Oh, it was DUMB. I was dumb. I only have myself to blame here. But we all do dumb things sometimes, owning your mistakes keeps you humble, and you asked. So here goes.

I use a modeling software called Rhino on wine on Linux. In the past, there was an incident where I had to copy an obscure dll that couldn't be delivered by wine or winetricks from a working Windows installation to get something to work. I did so and it worked. (As I recall this was a temporary issue, and was patched in the next release of wine.)

I hate the wine standard file picker, it has always been a persistent issue with Rhino3d. So I keep banging my head on trying to get it to either perform better or make a replacement. Every few months I'll get fed up and have a minute to kill, so I'll see if some new approach works. This time, ChatGPT told me to copy two dll's from a working windows installation to the System folder. Having precedent that this can work, I did.

Anyway, it borked startup completely and it took like an hour to recover. What I didn't consider - and I really, really should have - was that these were dll's that were ALREADY IN the system directory, and I was overwriting the good ones with values already reflecting my system with completely foreign ones.

And that's the critical difference - the obscure dll that made the system work that one time was because of something missing. This time was overwriting extant good ones.

But the fact that the LLM even suggested (without special prompting) to do something that I should have realized was a stupid idea with a low chance of success made me very wary of the harm it could cause.

replies(1): >>46240944 #
58. kaashif ◴[] No.46240848{6}[source]
I don't have my phone (a Pixel) because it frees me from shackles or anything like that. It's just a phone. I use the default everything. Works great. I imagine most people with iPhones are the same.
59. ethbr1 ◴[] No.46240895{5}[source]
As someone who switches between platforms somewhat frequently, iOS perpetually feels like people have Stockholm syndrome.

'Oh, that super annoying issue? Yeah, it's been there for years. We just don't do that.'

Fundamentally though, browsing the web on iOS, even with a custom "browser" with adblocking, feels like going back in time 15 years.

60. jimmaswell ◴[] No.46240934{7}[source]
I've tried them out and not a single thing about it was tangibly better IMO. They have no inherent merit above Android except that some see them as a status symbol (which is absurd as my S25U has a higher MSRP than most iPhone models)
replies(1): >>46241034 #
61. me-vs-cat ◴[] No.46240944{7}[source]
> ...using other models, never touching that product again.

> ...that the LLM even suggested (without special prompting) to do something that I should have realized was a stupid idea with a low chance of success...

Since you're using other models instead, do you believe they cannot give similarly stupid ideas?

replies(1): >>46241371 #
62. hamburglar ◴[] No.46241034{8}[source]
My bottom of the barrel iPhone SE is absolutely not a status symbol. It’s just the phone I like best.

The MSRP of your phone does not matter.

63. seg_lol ◴[] No.46241116[source]
Try using a VPN, my ISP was killing connections and claude would randomly reset. Using a VPN fixed the issue.
64. platevoltage ◴[] No.46241172{5}[source]
It wouldn't be an issue if they didn't pick the worst green on earth. "Which green would you like for the carrier text messages Mr. Jobs?" ... "#00FF00 will be fine."
65. Duanemclemore ◴[] No.46241371{8}[source]
I'm under no misimpression they can't. But I have found ChatGPT to be most confident when it f's up. And to suggest the worst ideas most often.

Until you queried I had forgotten to mention that the same day I was trying to work out a Linux system display issue and it very confidently suggested to remove a package and all its dependencies, which would have removed all my video drivers. On reading the output of the autoremove command I pointed out that it had done this, and the model spat out an "apology" and owned up to ** the damage it would have wreaked.

** It can't "apologize" for or "own up" to anything, it can just output those words. So I hope you'll excuse the anthropomorphization.

replies(1): >>46243093 #
66. olyjohn ◴[] No.46241530{6}[source]
GM has dedicated parking lots for employees with GM vehicles. Everybody else parks further away in the lot of shame.
replies(1): >>46242051 #
67. ssl-3 ◴[] No.46242051{7}[source]
Of course.

And I've parked in the lot of shame at a Ford plant, as an outsider, in my GMC work truck -- way over there.

It wasn't so bad. A bit of a hike to go back and get a tool or something, but it was at least paved...unlike the non-union lot I'm familiar with at a P&G facility, which is a gravel lot that takes crossing a busy road to get to, lacks the active security and visibility from the plant that the union lot has, and which is full of tall weeds. At P&G, I half-expect to come back and find my tires slashed.

Anyway, it wasn't barren over there in the not-Ford lot, but it wasn't nearly so populous as the Ford lot was. The Ford-only lot is bigger, and always relatively packed.

It was very clear to me that the lots (all of the lots, in aggregate) were mostly full of Fords.

To bring this all back 'round: It is clear to me that Ford employees broadly (>50%) drive Fords to work at that plant.

---

It isn't clear to me at all that Google Pixel developers don't broadly drive iPhones. As far as I can tell, that status (which is meme-level in its age at this point) is true, and they aren't broadly making daily use of the systems they build.

(And I, for one, can't imagine spending 40 hours a week developing systems that I refuse to use. I have no appreciation for that level of apparent arrogance, and I hope to never be suaded to be that way. I'd like to think that I'd be better-motivated to improve the system than I would be to avoid using it and choose a competitor instead.

I don't shit where I sleep.)

68. siva7 ◴[] No.46242398{3}[source]
that explains why their ios gemini app is so ridiculously bad. in private they probably use iphones and just chatgpt instead.
69. me-vs-cat ◴[] No.46243093{9}[source]
I feel the same about the obsequious "apologies".