←back to thread

421 points sohkamyung | 2 comments | | HN request time: 0.416s | source
Show context
Narciss ◴[] No.45670278[source]
> All participating organizations then generated responses to each question from each of the four AI assistants. This time, we used the free/consumer versions of ChatGPT, Copilot, Perplexity and Gemini. Free versions were chosen to replicate the default (and likely most common) experience for users. Responses were generated in late May and early June 2025.

First of all, none of the SOTA models we're currently using were released in May and early June. Gemini 2.5 came out in June 17, GPT 5 & Claude Opus 4.1 at the beginning of August.

On top of that, to use free models for anything like this is absolutely wild. I use the absolute best models, and the research versions of this whenever I do research. Anything less is inviting disaster.

You have to use the right tools for the right job, and any report that is more than a month old is useless in the AI world at this point in time, beyond a snapshot of how things 'used to be'.

replies(5): >>45670334 #>>45670358 #>>45670859 #>>45670920 #>>45672440 #
1. biophysboy ◴[] No.45670358[source]
If they used a paid version, their study would not represent how most people use AI (with the free version)
replies(1): >>45672817 #
2. Narciss ◴[] No.45672817[source]
But they’re using a free version that’s not even out there anymore. This is my problem - it came out already dated.