Most active commenters
  • (5)
  • dmd(4)
  • tenpoundhammer(3)
  • mmaunder(3)
  • staticman2(3)
  • ssl-3(3)
  • Duanemclemore(3)
  • brookst(3)

←back to thread

GPT-5.2

(openai.com)
1053 points atgctg | 176 comments | | HN request time: 0.938s | source | bottom
1. tenpoundhammer ◴[] No.46236826[source]
I have been using chatGPT a ton over the last months and paying the subscription. Used it for coding, news, stock analysis, daily problems, and a whatever I could think of. I decided to give Gemini a go when version three came out to great reviews. Gemini handles every single one of my uses cases much better and consistently gives better answers. This is especially true for situations were searching the web for current information is important, makes sense that google would be better. Also OCR is phenomenal chatgpt can't read my bad hand writing but Gemini can easily. Only downsides are in the polish department, there are more app bugs and I usually have to leave the happen or the session terminates. There are bugs with uploading photos. The biggest complaint is that all links get inserted into google search and then I have to manipulate them when they should go directly to the chosen website, this has to be some kind of internal org KPI nonsense. Overall, my conclusion is that ChatGPT has lost and won't catch up because of the search integration strength.
replies(36): >>46236861 #>>46236896 #>>46236956 #>>46236971 #>>46236980 #>>46237123 #>>46237253 #>>46237258 #>>46237321 #>>46237407 #>>46237452 #>>46237531 #>>46237626 #>>46237654 #>>46237786 #>>46237888 #>>46237927 #>>46238237 #>>46238324 #>>46238527 #>>46238546 #>>46238828 #>>46239189 #>>46239400 #>>46239512 #>>46239719 #>>46239767 #>>46239999 #>>46240382 #>>46240656 #>>46240742 #>>46240760 #>>46240763 #>>46241303 #>>46241326 #>>46241523 #
2. LorenDB ◴[] No.46236861[source]
What is it with the Polish always messing up products?

(yes, /s)

replies(2): >>46237324 #>>46238730 #
3. solarkraft ◴[] No.46236896[source]
> Only downsides are in the polish department

What an understatement. It has me thinking „man, fuck this“ on the daily.

Just today it spontaneously lost an entire 20-30 minutes long thread and it was far from the first time. It basically does it any time you interrupt it in any way. It’s straight up data loss.

It’s kind of a typical Google product in that it feels more like a tech demo than a product.

It has theoretically great tech. I particularly like the idea of voice mode, but it’s noticeably glitchy, breaks spontaneously often and keeps asking annoying questions which you can’t make it stop.

replies(10): >>46237173 #>>46237180 #>>46237398 #>>46237664 #>>46237832 #>>46238374 #>>46238810 #>>46239600 #>>46240229 #>>46240650 #
4. lxgr ◴[] No.46236956[source]
Interesting, I had the opposite experience. 5.0 "Thinking" was better than 5.1, but Gemini 3 Pro seems worse than either for web search use cases. It's hallucinating at pretty alarming rates (including making up sources it never actually accessed) for a late 2025 model.

Opus 4.5 has been a step above both for me, but the usage limits are the worst of the three. I'm seriously considering multiple parallel subscriptions at this point.

replies(2): >>46237542 #>>46237605 #
5. bayarearefugee ◴[] No.46236971[source]
This matches my experience pretty closely when it comes to LLM use for coding assistance.

I still find a lot to be annoyed with when it comes to Gemini's UI and its... continuity, I guess is how I would describe it? It feels like it starts breaking apart at the seams a bit in unexpected ways during peak usages including odd context breaks and just general UI problems.

But outside of UI-related complaints, when it is fully operational it performs so much better than ChatGPT for giving actual practical, working answers without having to be so explicit with the prompting that I might as well have just written the code myself.

6. kccqzy ◴[] No.46236980[source]
> The biggest complaint is that all links get inserted into google search and then I have to manipulate them when they should go directly to the chosen website, this has to be some kind of internal org KPI nonsense.

Oh I know this from my time at Google. The actual purpose is to do a quick check for known malware and phishing. Of course these days such things are better dealt with by the browser itself in a privacy preserving way (and indeed that’s the case), so it’s unnecessary to reveal to Google which links are clicked. It’s totally fine to manipulate them to make them go directly to the website.

replies(2): >>46237222 #>>46240351 #
7. UltraSane ◴[] No.46237123[source]
Google has such a huge advantage in the amount of training data with the Google search database and with YouTube and in terms of FLOPS with their TPUs.
8. mnky9800n ◴[] No.46237173[source]
The colab integration is where it shines the most imo.
9. radicaldreamer ◴[] No.46237180[source]
Google’s standard problem is that they don’t even use their own products. Their Pixel and Android team rocks iPhones on the daily, for example.
replies(6): >>46237220 #>>46237510 #>>46238473 #>>46239140 #>>46239896 #>>46241324 #
10. onethought ◴[] No.46237220{3}[source]
I mean there is benefit to understanding competitor well as well?
replies(3): >>46237469 #>>46238131 #>>46238276 #
11. sundarurfriend ◴[] No.46237222[source]
That's interesting, I just today started getting some "Some sites restrict our ability to check links." dialogue in ChatGPT that wanted me to verify that I really wanted to follow the link, with a Learn More link to this page: https://help.openai.com/en/articles/10984597-chatgpt-generat...

So it seems like ChatGPT does this automatically and internally, instead of using an indirect check like this.

12. NickNaraghi ◴[] No.46237253[source]
Straight up Silicon Valley warfare in the HN comment section.
13. dmd ◴[] No.46237258[source]
I consistently have exactly the opposite experience. ChatGPT seems extremely willing to do a huge number of searches, think about them, and then kick off more searches after that thinking, think about it, etc., etc. whereas it seems like Gemini is extremely reluctant to do more than a couple of searches. ChatGPT also is willing to open up PDFs, screenshot them, OCR them and use that as input, whereas Gemini just ignores them.
replies(5): >>46237338 #>>46237556 #>>46237747 #>>46240627 #>>46241115 #
14. hbarka ◴[] No.46237321[source]
I’ve been putting literally the same inputs into both ChatGPT and Gemini and the intuition in answers from Gemini just fits for me. I’m now unwilling to just rely on ChatGPT.

Google, if you can find a way to export chats into NotebookLM, that would be even better than the Projects feature of ChatGPT.

replies(3): >>46237707 #>>46242434 #>>46242448 #
15. petersumskas ◴[] No.46237324[source]
It’s because their thoughts are Roman while they are always Russian to Finnish things.

Kenya believe it!

Anyway, I’m done here. Abyssinia.

16. nullbound ◴[] No.46237338[source]
I will say that it is wild, if not somewhat problematic that two users have such disparate views of seemingly the same product. I say that, but then I remember my own experience just from few days ago. I don't pay for gemini, but I have paid chatgpt sub. I tested both for the same product with seemingly same prompt and subbed chatgpt subjectively beat gemini in terms of scope, options and links with current decent deals.

It seems ( only seems, because I have not gotten around to test it in any systematic way ) that some variables like context and what the model knows about you may actually influence quality ( or lack thereof ) of the response.

replies(10): >>46237530 #>>46237782 #>>46238005 #>>46238426 #>>46238540 #>>46238609 #>>46238817 #>>46238824 #>>46239808 #>>46240331 #
17. sundarurfriend ◴[] No.46237398[source]
ChatGPT web UI was also like this for the longest time, until a few months ago: all sorts of random UI bugs leading either to data loss or misleading UI state. Interrupting still is very flaky there too. And on the mobile app, if you move away from the app while it's taking time to think, its state would somehow desync from the actual backend thinking state, and get stuck randomly; sometimes restarting the app fixes it, sometimes that chat is that unusable from that point on.

And the UI lack of polish shows up freshly every time a new feature lands too - the "branch in new chat" feature is really finicky still, getting stuck in an unusable state if you twitch your eyebrows at wrong moment.

replies(2): >>46239212 #>>46239320 #
18. bossyTeacher ◴[] No.46237407[source]
A future where Google still dominates, is that a future we want? I feel a future with more players is better than one with just a single one. Competition is valuable for us consumers
19. varispeed ◴[] No.46237452[source]
Get Gemini answer and tell ChatGPT this is what my friend said. Then put ChatGPT answer to Claude and so on. It's a cheat code.
replies(2): >>46239694 #>>46241227 #
20. LogicFailsMe ◴[] No.46237469{4}[source]
Outweighed by the value of having to suffer with the moldy fruits of their own labor. That was the only way the Android Facebook app became usable as well.
21. RBerenguel ◴[] No.46237510{3}[source]
I would think this is not true
replies(2): >>46238818 #>>46240374 #
22. dmd ◴[] No.46237530{3}[source]
And I’d really like for Gemini to be as good or better, since I get it for free with my Workspace account, whereas I pay for chatgpt. But every time I try both on a query I’m just blown away by how vastly better chatgpt is, at least for the heavy-on-searching-for-stuff kinds of queries I typically do.
23. AznHisoka ◴[] No.46237531[source]
ChatGPT seems to just randomly pick urls to cite and extract information from.

Google Gemini seems to look at heuristics like whether the author is trustworthy, or an expert in the topic. But more advanced

24. noname120 ◴[] No.46237556[source]
Perplexity Pro with any thinking model blows both out of the water in a fraction of the time, in my experience
25. gs17 ◴[] No.46237605[source]
I've had the same experience with search, especially with it hallucinating results instead of actually finding them. It's really frustrating that you can't force a more in-depth search from the model run by the company most famous for a search engine.
replies(2): >>46239216 #>>46241327 #
26. afro88 ◴[] No.46237626[source]
> I usually have to leave the happen or the session terminates

Assuming you meant "leave the app open", I have the same frustration. One of the nice things about the ChatGPT app is you can fire off a req and do something else. I also find Gemini 3 Pro better for general use, though I'm keen to try 5.2 properly

27. mmaunder ◴[] No.46237654[source]
Then you haven't used Gemini CLI with Gemini 3 hard enough. It's a genius psychopath. The raw IQ that Gemini has is incredible. Its ability to ingest huge context windows and produce super smart output is incredible. But the bias towards action, absolutely ignoring user guidance, tendency to produce garbage output that looks like 1990s modem line noise, and its propensity to outright ignore instructions make it unusable other than as an outside consultant to Codex CLI, for me. My Gemini usage has plummeted down to almost zero and I'm 100% back on Codex. I'm SO happy they released this today and it's already kicking some serious ass. Thanks OpenAI team and congrats.
replies(3): >>46239384 #>>46239549 #>>46239990 #
28. mmaunder ◴[] No.46237664[source]
Yeah I eventually noped out as I said in another comment and am charging hard with Codex and am so happy about 5.2!!
29. LogicFailsMe ◴[] No.46237707[source]
All I want for Christmas is a "No NotebookLM slop" checkbox on youtube.
replies(2): >>46238918 #>>46240165 #
30. staticman2 ◴[] No.46237747[source]
Are you uploading PDFs that already have a text layer?

I don't currently subscribe to Gemini but on A.I. Studio's free offering when I upload a non OCR PDF of around 20 pages the software environment's OCR feeds it to the model with greater accuracy than I've seen from any other source.

replies(1): >>46237784 #
31. martinpw ◴[] No.46237782{3}[source]
> I will say that it is wild, if not somewhat problematic that two users have such disparate views of seemingly the same product.

This happens all the time on HN. Before opening this thread, I was expecting that the top comment would be 100% positive about the product or its competitor, and one of the top replies would be exactly the opposite, and sure enough...

I don't know why it is. It's honestly a bit disappointing that the most upvoted comments often have the least nuance.

replies(2): >>46237859 #>>46238338 #
32. dmd ◴[] No.46237784{3}[source]
I’m not uploading PDFs at all. I’m talking about PDFs it finds while searching than it extracts data from for the conversation.
replies(1): >>46238012 #
33. billyrnalvo ◴[] No.46237786[source]
Oh my good heavens, gotta tell ya, you wrestled that rascal to the floor with a shit-eating grin! Good times my friend!
34. adamkochanowicz ◴[] No.46237832[source]
I also love that I can leave the microphone on (not in live voice mode) while dictating to ChatGPT and pause and think as much as needed.

With Gemini, it will send as soon as I stop to think. No way to disable that.

replies(1): >>46238611 #
35. block_dagger ◴[] No.46237859{4}[source]
Replace "on HN" with "in the course of human events" and we may have a generally true statement ;)
36. didibus ◴[] No.46237888[source]
> Overall, my conclusion is that ChatGPT has lost and won't catch up because of the search integration strength.

Depends, even though Gemini 3 is a bit better than GPT5.1, the quality of the ChatGPT apps themselves (mobile, web) have kept me a subscriber to it.

I think Google needs to not-google themselves into a poor app experience here, because the models are very close and will probably continue to just pass each other in lock step. So the overall product quality and UX will start to matter more.

Same reason I am sticking to Claude Code for coding.

replies(1): >>46238281 #
37. luhn ◴[] No.46237927[source]
That's hilarious and right on brand for Google that they spend millions developing cutting-edge technology and fumble the ball making a chat app.
replies(1): >>46239182 #
38. Workaccount2 ◴[] No.46238005{3}[source]
Gemini has tons of people using it free via aistudio

I can't help but feel that google gives free requests the absolute lowest priority, greatest quantization, cheapest thinking budget, etc.

I pay for gemini and chatGPT and have been pretty hooked on Gemini 3 since launch.

39. staticman2 ◴[] No.46238012{4}[source]
I'm surprised to hear anyone finds these models trustworthy for research.

Just today I asked Claude what year over year inflation was and it gave me 2023 to 2024.

I also thought some sites ban A.I. crawling so if they have the best source on a topic, you won't get it.

replies(1): >>46240607 #
40. ssl-3 ◴[] No.46238131{4}[source]
There certainly is.

To posit a scenario: I would expect General Motors to buy some Ford vehicles to test and play around with and use. There's always stuff to learn about what the competition has done (whether right, wrong, or indifferent).

But I also expect the parking lots used by employees at any GM design facility in the world to be mostly full of General Motors products, not Fords.

replies(2): >>46238792 #>>46239960 #
41. azan_ ◴[] No.46238237[source]
That's interesting. I've got completely different impression. Every time I use Gemini I'm surprised how bad it is. My main complaint is that Gemini is too lazy.
replies(1): >>46238342 #
42. Forgeties79 ◴[] No.46238276{4}[source]
I wonder how many apple employees walk in to the office with android phones
replies(1): >>46239485 #
43. concinds ◴[] No.46238281[source]
The ChatGPT Mac app especially feels much nicer to use. I like Gemini more due to the context window but I doubt Google will ever create a native Mac app.
44. WheatMillington ◴[] No.46238324[source]
I generate fun images for my kids - turn photos into a new style, create colouring pages from pictures, etc. I lost interest in chatGPT because it throws vague TOS errors constantly. Gemini handles all of this without complaint.
replies(1): >>46239524 #
45. stevage ◴[] No.46238338{4}[source]
How much nuance can one person's experience have? If the top two most visible things are detailed, contrary experiences of the same product, that seems a pretty good outcome?
replies(1): >>46243473 #
46. Nathanba ◴[] No.46238342[source]
Same for me, at this point I'm seriously starting to think that these are ads for and by Google because for me Gemini is the worst.
replies(1): >>46240579 #
47. KronisLV ◴[] No.46238374[source]
> It has me thinking „man, fuck this“ on the daily.

That's sometimes me with the CLI. I can't use the Gemini CLI right now on Windows (in the Terminal app), because trying to copy in multiple lines of text for some reason submits them separately and it just breaks the whole thing. OpenCode had the same issue but even worse, it quite after the first line or something and copied the text line by line into the shell, thank fuck I didn't have some text that mentions rm -rf or something.

More info: https://github.com/google-gemini/gemini-cli/issues/14735#iss...

At the same time, neither Codex CLI, nor Claude Code had that issue (and both even showed shortened representations of copied in text, instead of just dumping the whole thing into the input directly, so I could easily keep writing my prompt).

So right now if I want to use Gemini, I more or less have to use something like KiloCode/RooCode/Cline in VSC which are nice, but might miss out on some more specific tools. Which is a shame, because Gemini is a really nice model, especially when it comes to my language, Latvian, but also your run of the mill software dev tasks.

In comparison, Codex feels quite slow, whereas Claude Code is what I gravitate towards most of the time but even Sonnet 4.5 ends up being expensive when you shuffle around millions of tokens: https://news.ycombinator.com/item?id=46216192 Cerebras Code is nice for quick stuff and the sheer amount of tokens, but in KiloCode/... regularly messes up applying diff based edits.

replies(1): >>46240654 #
48. jhancock ◴[] No.46238426{3}[source]
I can use GPT one day and the next get a different experience with the same problem space. Same with Gemini.
replies(2): >>46238643 #>>46238793 #
49. Der_Einzige ◴[] No.46238473{3}[source]
That’s because they will be bullied out of the dating market if they have a “green bubble”.
replies(2): >>46239184 #>>46240021 #
50. Onewildgamer ◴[] No.46238527[source]
Google AI mode constantly does mistakes and I go back to chatgpt even when I don't like it.
51. blks ◴[] No.46238540{3}[source]
Because neither product has any consistency in its results, no predictive behaviour. One day it performs well, another it hallucinates non existing facts and libraries. Those are stochastic machines
replies(1): >>46238992 #
52. FpUser ◴[] No.46238546[source]
I've read many very positive reviews about Gemini 3. I tried using it including Pro and to me it looks very inferior to ChatGPT. What was very interesting though was when I caught it bullshitting me I called its BS and Gemini expressed very human like behavior. It did try to weasel its way out, degenerated down to "true Scotsman" level but finally admitted that it was full of it. this is kind of impressive / scary.
53. crorella ◴[] No.46238609{3}[source]
It’s like having 3 coins and users preferring one or the other when tossing it because one coin gives consistently more heads (or tails) than the other coin.

What is better is to build a good set of rules and stick to one and then refine those rules over time as you get more experience using the tool or if the tool evolves and digress from the results you expect.

replies(1): >>46238668 #
54. wheelerwj ◴[] No.46238611{3}[source]
How did you do this?
replies(1): >>46238726 #
55. 4ndrewl ◴[] No.46238643{4}[source]
This is by design, given a non-determenitisic application?
replies(1): >>46238690 #
56. nullbound ◴[] No.46238668{4}[source]
<< What is better is to build a good set of rules and

But, unless you are on a local model you control, you literally can't. Otherwise, good rules will work only as long as the next update allows. I will admit that makes me consider some other options, but those probably shouldn't be 'set and iterate' each time something changes.

replies(1): >>46240430 #
57. jhancock ◴[] No.46238690{5}[source]
sure. It may be more than that...possibly due to variable operating params on the servers and current load.

On whole, if I compare my AI assistant to a human worker, I get more variance than I would from a human office worker.

replies(2): >>46238899 #>>46239210 #
58. toomuchtodo ◴[] No.46238726{4}[source]
Record button in the app if you’ve got the feature.
59. labrador ◴[] No.46238730[source]
I like their hotdogs
60. GenerWork ◴[] No.46238792{5}[source]
>But I also expect the parking lots used by employees at any GM design facility in the world to be mostly full of General Motors products, not Fords.

I think you'd be surprised about the vehicle makeup at Big 3 design facilities.

replies(1): >>46239474 #
61. sjaramillo ◴[] No.46238793{4}[source]
I guess LLMs have a mood too
replies(1): >>46239460 #
62. arjie ◴[] No.46238810[source]
Any time its safety stuff triggers, Gemini wipes the context. It's unusable because of this because whatever is going on with the safety stuff, it fires too often. I'm trying to figure out some code here, not exactly deporting ICE to Guantanamo or whatever.
replies(2): >>46238852 #>>46238902 #
63. rabf ◴[] No.46238817{3}[source]
Chatgpt is not one model! Unless you manually specify to use a particular model your question can be routed to different models depending on what it guesses would be most appropriate for your question.
replies(1): >>46240466 #
64. renewiltord ◴[] No.46238818{4}[source]
Yeah, I've heard that Sundar Pichai dogfoods the latest Pixel at least once a month and sometimes two or three times.
65. nunez ◴[] No.46238824{3}[source]
Tesla FSD has been more or less the same experience. Some people drive 100s of miles without disengaging while others pull the plug within half a mile from their house. A lot of it depends on what the customer is willing to tolerate.
66. dzhiurgis ◴[] No.46238852{3}[source]
On a flip side chatgpt app now has years of history that sometimes useful (search is pretty ok, but could improve) but otherwise I'd like to remove most of it - good luck doing so.
67. pixl97 ◴[] No.46238899{6}[source]
Thats because you don't 'own' the LLM compute. If you instead bought your office workers by the question I'm sure the variability would increase.
68. rvnx ◴[] No.46238902{3}[source]
The more Gemini and Nano-Banana soften their filters, the more audience it will take from other platforms. The main risk is payment providers banning them, I can't imagine bank card providers to remove payments to Google.
69. ◴[] No.46238918{3}[source]
70. sendes ◴[] No.46238992{4}[source]
I see the hyperbole is the point, but surely what these machines do is to literally predict? The entire prompt engineering endeavour is to get them to predict better and more precisely. Of course, these are not perfect solutions - they are stochastic after all, just not unpredictably.
replies(1): >>46240345 #
71. free652 ◴[] No.46239140{3}[source]
You cant buy an iPhone without a director approval. And it's like 3 gen behind as well. So no, they don't use iPhones.
replies(4): >>46239225 #>>46239229 #>>46239259 #>>46242398 #
72. spwa4 ◴[] No.46239182[source]
Every Google app is a chat app, except maybe search.
replies(1): >>46239980 #
73. abhaynayar ◴[] No.46239189[source]
Gemini voice recognition is trash compared to chatgpt and that is a deal breaker for me. I wonder how many ppl do OCR versus use voice.

And how has chatgpt lost when ure not comparing the chatgpt that just came out to the Gemini that just came out? Gemini is just annoying to use.

and Google just benchmaxxed I didn't see any significant difference (paying for both) and the same benchmaxxing probably happening for chatgpt now as well, so in terms of core capabilities I feel stuff has plateaued. more bout overall experience now where Gemini suxx.

I really don't get how "search integration" is a "strength"?? can you give any examples of places where you searched for current info and chatgpt was worse? even so I really don't get how it's a moat enough to say chatgpt has lost. would've understood if you said something like tpu versus GPU moat.

74. astrange ◴[] No.46239210{6}[source]
They're not really capable of producing varying answers based on load.

But they are capable of producing different answers because they feel like behaving differently if the current date is a holiday, and things like that. They're basically just little guys.

75. gcr ◴[] No.46239212{3}[source]
i basically can't use the ChatGPT app on the subway for these reasons. the moment the websocket connection drops, i have to edit my last message and resubmit it unchanged.

it's like the client, not the server, is responsible for writing to my conversation history or something

replies(2): >>46239585 #>>46240677 #
76. astrange ◴[] No.46239216{3}[source]
Try the same question in deep research mode.
77. dominotw ◴[] No.46239225{4}[source]
you have to get premission from director for your presonal phone? wtf
replies(1): >>46239469 #
78. ummonk ◴[] No.46239229{4}[source]
Google tells its employees what products they're allowed to buy for personal use?
replies(1): >>46239950 #
79. gcr ◴[] No.46239259{4}[source]
lots of googlers use BYOD iPhones and the corp suite for this use case is fairly well-supported
replies(1): >>46240218 #
80. p_ing ◴[] No.46239320{3}[source]
> ChatGPT web UI was also like this for the longest time

Copilot Chat has been perfect in this respect. It's currently GPT 5.0, moving to 5.1 over the next month or so, but at least I've never lost an (even old) conversation since those reside in an Exchange mailbox.

replies(1): >>46239940 #
81. Kim_Bruning ◴[] No.46239384[source]
That bias towards action is a real thing in Gemini and more so in ChatGPT, isn't it?

Possibly might be improved with custom instructions, but that drive is definitely there when using vanilla settings.

replies(1): >>46239926 #
82. razster ◴[] No.46239400[source]
Just a fair warning, it likes to spell Acknowledge as Acknolwedge. And I've run into issues when it's accessing markdown guides, it loses track and hallucinates from time to time which is annoying.
83. dr_dshiv ◴[] No.46239460{5}[source]
Vibes
84. testdelacc1 ◴[] No.46239469{5}[source]
For the work phone.
85. ssl-3 ◴[] No.46239474{6}[source]
Maybe so.

I'm only familiar with Ford production and distribution facilities. Those parking lots are broadly full of Fords, but that doesn't mean that it's like this across the board.

replies(1): >>46241530 #
86. azinman2 ◴[] No.46239485{5}[source]
Effectively zero.

Disclosure: I work at Apple. And when I was at Google I was shocked by how many iPhones there were.

replies(1): >>46239961 #
87. xyzsparetimexyz ◴[] No.46239512[source]
Why do people pay for ai tools? I didn't get that. I feel like I just rotate between them on the free tiers. Unless you're paying for all of them, what's the point?
replies(1): >>46240600 #
88. xyzsparetimexyz ◴[] No.46239524[source]
You feed ai slop to your children? That doesn't seem unhealthy and bad for their development?
replies(2): >>46240738 #>>46241154 #
89. prodigycorp ◴[] No.46239549[source]
Genius psychopath is a good description for Gemini. It’s the most impressive model but post training is not all there.
90. spruce_tips ◴[] No.46239585{4}[source]
it took me a lot of tinkering to get this feeling seamless in my own apps that use the api under the hood. i ended up buffering every token into a redis stream (with a final db save at the end of streaming) and building a mechanism to let clients reconnect to the stream on demand. no websocket necessary.

works great for kicking off a request and closing tab or navigating away to another page in my app to do something.

i dont understand why model providers dont build this resilient token streaming into all of their APIs. would be a great feature

replies(1): >>46240622 #
91. deepGem ◴[] No.46239600[source]
There is no competing product for GPT Voice. Hands down. I have tried Claude, Gemini - they don't even comes close.

But voice is not a huge traffic funnel. Text is. And the verdict is more or less unanimous at this time. Gemini 3.0 has outdone ChatGPT. I unsubscribed from GPT plus today. I was a happy camper until the last month when I started noticing deplorable bugs.

1. The conversation contexts are getting intertwined.Two months ago, I could ask multiple random queries in a conversation and I would get correct responses but the last couple of weeks, it's been a harrowing experience having to start a new chat window for almost any change in thread topic. 2. I had asked ChatGPT to once treat me as a co-founder and hash out some ideas. Now for every query - I get a 'cofounder type' response. Nothing inherently wrong but annoying as hell. I can live with the other end of the spectrum in which Claude doesn't remember most of the context.

Now that Gemini pro is out, yes the UI lacks polish, you can lose conversations, but the benefits of low latency search and a one year near free subscription is a clincher. I am out of ChatGPT for now, 5.2 or otherwise. I wish them well.

replies(3): >>46239866 #>>46240011 #>>46240050 #
92. clhodapp ◴[] No.46239694[source]
A cheat code to what?
replies(1): >>46241206 #
93. Daz912 ◴[] No.46239719[source]
No desktop app, not using it
replies(1): >>46239982 #
94. ◴[] No.46239767[source]
95. Bombthecat ◴[] No.46239808{3}[source]
Could also be a language thing ...
96. esyir ◴[] No.46239866{3}[source]
Just a note, chatGPT does retain a persistent memory of conversations. In the settings menu, there's a section that allows you to tweak/clear this persistent memory
97. sam345 ◴[] No.46239896{3}[source]
That's inexcusable.
98. mmaunder ◴[] No.46239926{3}[source]
Yeah it's a weird mix of issues with the backend model and issues with the CLI client and its prompts. What makes it hard for them is the teams aren't talking to each other. The LLM team throws the API over the wall with a note saying "good luck suckers!".
99. Max-Limelihood ◴[] No.46239940{4}[source]
I lost thousands of conversations I'd had back in the move from "Bing" to "Copilot". Moved straight to Claude and never touched a GPT again.
replies(1): >>46240101 #
100. snypher ◴[] No.46239950{5}[source]
Seems like they meant for a work device.
101. snypher ◴[] No.46239960{5}[source]
The CEO of Ford was driving a competition EV for months;

https://www.caranddriver.com/news/a62694325/ford-ceo-jim-far...

102. jimmaswell ◴[] No.46239961{6}[source]
This is flabbergasting, how could such a large proportion of highly technical people willingly subject themselves to being shackled by iOS? They just happily put up with having one choice of browser, (outside Europe) no third party app stores, and being locked into the Apple ecosystem? I can't think of a single reason I would ever switch from an S22-25+U to an iPhone. I only went from 22U to 25U because my old one got smashed, otherwise the 22U would still be perfectly fine.
replies(3): >>46240161 #>>46240255 #>>46240848 #
103. dieortin ◴[] No.46239980{3}[source]
Is Google Drive a chat app? Is Google Photos a drive app? I don’t know what you mean
replies(2): >>46240458 #>>46242080 #
104. eru ◴[] No.46239982[source]
HN doesn't have a dedicated desktop app either.
replies(1): >>46241553 #
105. tobias2014 ◴[] No.46239990[source]
I guess when you use it for generic "problem solving", brainstorming for solutions, this is great. That's what I use it for, and Gemini is my favorite model. I love when Gemini resists and suggests that I am wrong while explaining why. Either it's true, and I'm happy for that, or I can re-prompt based on the new information which doesn't allow for the mistake Gemini made.

On the other hand, I can also see why Claude is great for coding, for example. By default it is much more "structured". One can probably change these default personalities with some prompting, and many of the complaints found in this thread about either side are based on the assumption that you can use the same prompt for all models.

106. Razengan ◴[] No.46239999[source]
It would be useful to see some examples of the differences and supposed strengths of Gemini so this doesn't come off as Google advertisement snarf.

Also, I would never, ever, trust Google for privacy or sign into a Google account except on YouTube (and clear cookies afterwards to stop them from signing me into fucking Search too).

107. wkat4242 ◴[] No.46240011{3}[source]
What's that near free subscription? I don't see it here
replies(2): >>46240335 #>>46240523 #
108. dkga ◴[] No.46240021{4}[source]
What is a green bubble? iPhone's carbon footprint?
replies(1): >>46240236 #
109. rapind ◴[] No.46240050{3}[source]
I found the gemini cli extremely lacking and even frustrating. Why google would choose node…

Codex is decent and seemed to be improving (being written in rust helps). Claude code is still the king, but my god they have server and throttling issues.

Mixed bag wherever you go. As model progress slows / flatlines (already has?) I’m sure we’ll see a lot more focus and polish on the interfaces.

replies(1): >>46240653 #
110. Duanemclemore ◴[] No.46240101{5}[source]
I downloaded my archive and completely ended my GPT subscription last week based on some bad computer maintenance advice. Same thing here - using other models, never touching that product again.
replies(1): >>46240320 #
111. dumbfounder ◴[] No.46240161{7}[source]
Because it’s better.
replies(2): >>46240498 #>>46240934 #
112. simplify ◴[] No.46240165{3}[source]
Youtube's downvote button has served me quite well for this purpose.
113. brookst ◴[] No.46240218{5}[source]
Which makes tons of sense because iPhone users are higher CLV than Android users. If Google had to choose between major software defects in Android or iOS, they would focus quality on iOS every time.
114. amluto ◴[] No.46240229[source]
Claude regularly computes a reply for me, then reports an error and loses the reply. I wonder what fraction of Anthropic’s compute gets wasted and redone.
replies(1): >>46241116 #
115. brookst ◴[] No.46240236{5}[source]
iMessage renders other iMessage users as blue bubbles, SMS/RCS as green bubbles.

People who can’t understand that many people actually prefer iOS use this green/blue thing to explain the otherwise incomprehensible (to them) phenomenon of high iOS market share. “Nobody really likes iOS, they just get bullied at school if they don’t use it”.

It’s just “wake up sheeple” dressed up in fake morality.

replies(2): >>46240895 #>>46241172 #
116. brookst ◴[] No.46240255{7}[source]
Because many of them just want to use their phone as a tool, not tinker with it.

Same way many professional airplane mechanics fly commercial rather than building their own plane. Just because your job is in tech doesn’t mean you have to be ultra-haxxor with every single device in your life.

117. topato ◴[] No.46240320{6}[source]
now I kind of HAVE to know... what was the aforementioned bad advice was?! So mysterious!
replies(1): >>46240726 #
118. austhrow743 ◴[] No.46240331{3}[source]
We've been having trouble telling if people are using the same product ever since Chat GPT first got popular. The had a free model and a paid model, that was it, no other competitors or naming schemes to worry about, and discussions were still full of people talking about current capabilities without saying what model they were using.

For me, "gemini" currently means using this model in the llm.datasette.io cli tool.

openrouter/google/gemini-3-pro-preview

For what anyone else means? If they're equivalent? If Google does something different when you use "Gemini 3" in their browser app vs their cli app vs plans vs api users vs third party api users? No idea to any of the above.

I hate naming in the llm space.

replies(1): >>46243334 #
119. topato ◴[] No.46240335{4}[source]
yeah, the best Ive seen is like 1.99 for two months, then back to normal pricing....
120. coliveira ◴[] No.46240345{5}[source]
Prompt engineering is voodoo. There's no sure way to determine how well these models will respond to a question. Of course, giving additional information may be helpful, but even that is not guaranteed.
replies(2): >>46241557 #>>46241667 #
121. gjuggler ◴[] No.46240351[source]
I think Gemini is just broken.

Instead of forwarding model-generated links to https://www.google.com/url?q=[URL], which serves the purpose of malware check and user-facing warning about linking to an external site, Gemini forwards links to https://www.google.com/search?q=[URL], which does... a Google search for the URL, which isn't helpful at all.

Example: https://gemini.google.com/share/3c45f1acdc17

NotebookLM by comparison, does the right thing: https://notebooklm.google.com/notebook/7078d629-4b35-4894-bb...

It's kind of impressive how long this obviously-broken link experience has been sitting in the Gemini app used by millions.

122. sib ◴[] No.46240374{4}[source]
You'd be wrong (source - worked in the Android org).
123. melagonster ◴[] No.46240382[source]
It happened at least once; when I asked too many questions, the Gemini web page stopped working because it was occupying too much RAM...
124. crorella ◴[] No.46240430{5}[source]
what I had in mind when I added that comment was for coding, with the use of .md files. For the web version of chats I agree there is little control on how to tailor the way you want the agent to behave, unless you give a initial "setup" prompt.
125. minitoar ◴[] No.46240458{4}[source]
In Google Photos shared albums there is a tab that I can only describe as a chatroom.
126. stingraycharles ◴[] No.46240466{4}[source]
Isn’t that just standard MoE behavior? And isn’t the only choice you have from the UI between “Instant” and “Thinking”?
replies(1): >>46241680 #
127. ◴[] No.46240498{8}[source]
128. deepGem ◴[] No.46240523{4}[source]
They had 9.99 for the first year.
129. WillPostForFood ◴[] No.46240579{3}[source]
My experience is that "AI Mode" Gemini in Chrome is terrible, but AI Studio Gemini is pretty great.
130. Zambyte ◴[] No.46240600[source]
I pay for Kagi and get all of the major ones, a great search engine that I can tune to my liking, and the ability to link any model to my tuned web search.
131. Workaccount2 ◴[] No.46240607{5}[source]
Anytime you use LLMs you should be keenly aware of their knowledge cutoff. Like any other tool, the more you understand it, the better it works.
replies(1): >>46243873 #
132. rishabhaiover ◴[] No.46240622{5}[source]
exactly. they need to bring in spotify level of caching of streaming music that it just works if you're in a subway. Constant availability should be table stakes for them.
133. ghostpepper ◴[] No.46240627[source]
Same, I use chatgpt plus (the entry-level paid option) extensively for personal research projects and coding, and it seems miles ahead of whatever "Gemini Pro" is that I have through work. Twice yesterday, gemini repeated verbatim a previous response as if I hadn't asked another question and told it why the previous response was bad. Gemini feels like chatGPT from two years ago.
134. hexnuts ◴[] No.46240650[source]
You may be interested in tools like OpenMemory
135. wahnfrieden ◴[] No.46240653{4}[source]
Codex is king
136. ◴[] No.46240654{3}[source]
137. a_victorp ◴[] No.46240656[source]
I see a post like this every time there are news about ChatGPT or OpenAI. I'm probably being paranoid but I keep thinking that it looks like bots or paid advertisement for Gemini
replies(2): >>46241014 #>>46241207 #
138. rjzzleep ◴[] No.46240677{4}[source]
I get that the web versions are free, but if you can afford API access, I always recommend using Msty for everything. It's a much better experience.

https://msty.ai/

139. Duanemclemore ◴[] No.46240726{7}[source]
Oh, it was DUMB. I was dumb. I only have myself to blame here. But we all do dumb things sometimes, owning your mistakes keeps you humble, and you asked. So here goes.

I use a modeling software called Rhino on wine on Linux. In the past, there was an incident where I had to copy an obscure dll that couldn't be delivered by wine or winetricks from a working Windows installation to get something to work. I did so and it worked. (As I recall this was a temporary issue, and was patched in the next release of wine.)

I hate the wine standard file picker, it has always been a persistent issue with Rhino3d. So I keep banging my head on trying to get it to either perform better or make a replacement. Every few months I'll get fed up and have a minute to kill, so I'll see if some new approach works. This time, ChatGPT told me to copy two dll's from a working windows installation to the System folder. Having precedent that this can work, I did.

Anyway, it borked startup completely and it took like an hour to recover. What I didn't consider - and I really, really should have - was that these were dll's that were ALREADY IN the system directory, and I was overwriting the good ones with values already reflecting my system with completely foreign ones.

And that's the critical difference - the obscure dll that made the system work that one time was because of something missing. This time was overwriting extant good ones.

But the fact that the LLM even suggested (without special prompting) to do something that I should have realized was a stupid idea with a low chance of success made me very wary of the harm it could cause.

replies(1): >>46240944 #
140. bonesss ◴[] No.46240738{3}[source]
Customized, self-guided, tailor made kids content isn’t slop per se.

Colouring pages autogenerated for small kids is about as dangerous as the crayons involved.

Not slop, not unhealthy, not bad.

141. bckr ◴[] No.46240742[source]
Gemini is good at reading bad handwriting you say? Might need to give it a shot at my 10 years of journals
142. m00dy ◴[] No.46240760[source]
it's true that Gemini-3 pro is very good, I recently used it on deepwalker [0]. Its agentic performance is amazing. Much better than 5.1

[0]: https://deepwalker.xyz

143. citizenpaul ◴[] No.46240763[source]
What?? Am I using the same gemini as everyone else?

>OCR is phenomenal

I literally tried to OCR a TYPED document in Gemini today and it mangled it so bad I just transcribed it myself because it would take less time than futzing around with gemini.

> Gemini handles every single one of my uses cases much better and consistently gives better answers.

>coding

I asked it to update a script by removing some redundant logic yesterday. Instead of removing it it just put == all over the place essentially negating but leaving all the code and also removing the actual output.

>Stocks analysis

lol, now I know where my money comes from.

replies(1): >>46241152 #
144. kaashif ◴[] No.46240848{7}[source]
I don't have my phone (a Pixel) because it frees me from shackles or anything like that. It's just a phone. I use the default everything. Works great. I imagine most people with iPhones are the same.
145. ethbr1 ◴[] No.46240895{6}[source]
As someone who switches between platforms somewhat frequently, iOS perpetually feels like people have Stockholm syndrome.

'Oh, that super annoying issue? Yeah, it's been there for years. We just don't do that.'

Fundamentally though, browsing the web on iOS, even with a custom "browser" with adblocking, feels like going back in time 15 years.

146. jimmaswell ◴[] No.46240934{8}[source]
I've tried them out and not a single thing about it was tangibly better IMO. They have no inherent merit above Android except that some see them as a status symbol (which is absurd as my S25U has a higher MSRP than most iPhone models)
replies(1): >>46241034 #
147. me-vs-cat ◴[] No.46240944{8}[source]
> ...using other models, never touching that product again.

> ...that the LLM even suggested (without special prompting) to do something that I should have realized was a stupid idea with a low chance of success...

Since you're using other models instead, do you believe they cannot give similarly stupid ideas?

replies(1): >>46241371 #
148. jdiff ◴[] No.46241014[source]
The consistent side comments about the interface to Gemini being "half baked" probably doesn't fit into that narrative.
149. hamburglar ◴[] No.46241034{9}[source]
My bottom of the barrel iPhone SE is absolutely not a status symbol. It’s just the phone I like best.

The MSRP of your phone does not matter.

150. whazor ◴[] No.46241115[source]
I agree with you. To me, gemini has much worse search results. Then again, I use kagi for search and I cannot stand the search results from Google anymore. And its clear that gemini uses those.

In contrast, chatgpt has built their own search engine that performs better in my experience. Except for coding, then I opt for Claude opus 4.5.

151. seg_lol ◴[] No.46241116{3}[source]
Try using a VPN, my ISP was killing connections and claude would randomly reset. Using a VPN fixed the issue.
152. aix1 ◴[] No.46241152[source]
Was that with Gemini 3 Pro or a different Gemini model?
153. retsibsi ◴[] No.46241154{3}[source]
What's your specific concern here? I certainly wouldn't want to, e.g., give young kids unmonitored use of an LLM, or replace their books with AI-generated text, or stop directly engaging with their games and stories and outsource that to ChatGPT. But what part of "generate fun images for my kids - turn photos into a new style, create colouring pages from pictures, etc" is likely to be "unhealthy and bad for their development"?
154. platevoltage ◴[] No.46241172{6}[source]
It wouldn't be an issue if they didn't pick the worst green on earth. "Which green would you like for the carrier text messages Mr. Jobs?" ... "#00FF00 will be fine."
155. Iwan-Zotow ◴[] No.46241206{3}[source]
To get a Hitler
156. tenpoundhammer ◴[] No.46241207[source]
I think people like me just enjoying sharing when something is working for them and they have a good experience. It probably gets voted up because people enjoy reading when that happens
157. tenpoundhammer ◴[] No.46241227[source]
I did this today it was amazing. If I would have had time I would try other models as well. Great tip thanks
158. jmstfv ◴[] No.46241303[source]
Ditto but for Claude -- blows GPT out of the water. Much better in coding and solving physics problems from the images (in foreign languages). GPT couldn't even read the image. The only annoying thing is that if you use Opus for coding, your usage will fill up pretty fast.

anyway, cancelled my chatgpt subscription.

159. anonnon ◴[] No.46241326[source]
Could you elaborate on GPT-based stock analysis?
160. Duanemclemore ◴[] No.46241371{9}[source]
I'm under no misimpression they can't. But I have found ChatGPT to be most confident when it f's up. And to suggest the worst ideas most often.

Until you queried I had forgotten to mention that the same day I was trying to work out a Linux system display issue and it very confidently suggested to remove a package and all its dependencies, which would have removed all my video drivers. On reading the output of the autoremove command I pointed out that it had done this, and the model spat out an "apology" and owned up to ** the damage it would have wreaked.

** It can't "apologize" for or "own up" to anything, it can just output those words. So I hope you'll excuse the anthropomorphization.

replies(1): >>46243093 #
161. jnordt ◴[] No.46241523[source]
Can you share some examples of this where it gives better results?

For me both Gemini and ChatGPT (both paid versions Key in Gemini and ChatGPT Plus) give me similiar results in terms of "every day" research. Im sticking with ChatGPT at the moment, as the UI and scaffolding around the model is in my view better at ChatGpt (e.g. you can add more than one picture at once...)

For Software Development, I tested Gemini3 and I was pretty disappointed in comparison to Claude Opus CLI, which is my daily driver.

162. olyjohn ◴[] No.46241530{7}[source]
GM has dedicated parking lots for employees with GM vehicles. Everybody else parks further away in the lot of shame.
replies(1): >>46242051 #
163. Daz912 ◴[] No.46241553{3}[source]
HN isn't part of my daily workflow so I dont care
164. lossyalgo ◴[] No.46241557{6}[source]
Also every model update changes how you have to prompt them to get the answers you want. Setting up pre-prompts can help, but with each new version, you have to figure out through trial and error how to get it to respond to your type of queries.

I can't wait to see how bad my finally sort-of-working ChatGPT 5.1 pre-prompts work with 5.2.

Edit: How to talk to these models is actually documented, but you have to read through huge documents: https://cdn.openai.com/gpt-5-system-card.pdf

165. baq ◴[] No.46241667{6}[source]
It definitely isn’t voodoo, it’s more like forecasting weather. Some forecasts are easier to make, some are harder (it’ll be cold when it’s winter vs the exact location and wind speed of a tornado for an extreme example). The difference is you can try to mix things up in the prompt to maximize the likelihood of getting what you want out and there are feasibility thresholds for use cases, e.g. if you get a good answer 95% of the time it’s qualitatively different than 55%.
replies(1): >>46243972 #
166. baq ◴[] No.46241680{5}[source]
MoE is a single model thing, model routing happens earlier.
167. ssl-3 ◴[] No.46242051{8}[source]
Of course.

And I've parked in the lot of shame at a Ford plant, as an outsider, in my GMC work truck -- way over there.

It wasn't so bad. A bit of a hike to go back and get a tool or something, but it was at least paved...unlike the non-union lot I'm familiar with at a P&G facility, which is a gravel lot that takes crossing a busy road to get to, lacks the active security and visibility from the plant that the union lot has, and which is full of tall weeds. At P&G, I half-expect to come back and find my tires slashed.

Anyway, it wasn't barren over there in the not-Ford lot, but it wasn't nearly so populous as the Ford lot was. The Ford-only lot is bigger, and always relatively packed.

It was very clear to me that the lots (all of the lots, in aggregate) were mostly full of Fords.

To bring this all back 'round: It is clear to me that Ford employees broadly (>50%) drive Fords to work at that plant.

---

It isn't clear to me at all that Google Pixel developers don't broadly drive iPhones. As far as I can tell, that status (which is meme-level in its age at this point) is true, and they aren't broadly making daily use of the systems they build.

(And I, for one, can't imagine spending 40 hours a week developing systems that I refuse to use. I have no appreciation for that level of apparent arrogance, and I hope to never be suaded to be that way. I'd like to think that I'd be better-motivated to improve the system than I would be to avoid using it and choose a competitor instead.

I don't shit where I sleep.)

168. spwa4 ◴[] No.46242080{4}[source]
Once you open a file, it is very much a chat app. Comments and chat work for anything you can preview btw, not just Google Docs stuff.

Not sure how you can access the chat in the directory view.

169. siva7 ◴[] No.46242398{4}[source]
that explains why their ios gemini app is so ridiculously bad. in private they probably use iphones and just chatgpt instead.
170. ◴[] No.46242434[source]
171. siva7 ◴[] No.46242448[source]
notebooklm is heavily biased to only use the sources i added and frame every task around them - even if it is nonsensical - so it is not that useful for novel research. it also tends to hallucinate when lots of data is involved.
172. me-vs-cat ◴[] No.46243093{10}[source]
I feel the same about the obsequious "apologies".
173. dmd ◴[] No.46243334{4}[source]
FWIW i’m always using 5.1 Thinking.
174. AznHisoka ◴[] No.46243473{5}[source]
Also, why introduce nuance for the sake of nuance? For every single use case, Gemini (and Claude) has performed better. I can’t give ChatGPT even the slightest credit when it doesnt deserve any
175. staticman2 ◴[] No.46243873{6}[source]
I'm sorry but I don't see what "knowledge cutoff" has to do with what we were talking about- which is using a LLM find PDFs and other sources for research.
176. coliveira ◴[] No.46243972{7}[source]
No, it's not. Nowadays we know how to predict the weather with great confidence. Prompting may get you different results each time. Moreover, LLMs depend on the context of your prompts (because of their memory), so a single prompt may be close to useless and two different people can get vastly different results.