←back to thread

397 points Anon84 | 7 comments | | HN request time: 0s | source | bottom
Show context
Alifatisk ◴[] No.45126512[source]
I use Qwen Chat as my daily, it’s good and does the job very well. I have never thought about trying out Mistral, is their model good at anything? Any area it excels at? Or is it far behind all the other models?
replies(2): >>45126743 #>>45128654 #
1. epolanski ◴[] No.45126743[source]
Qwen is good for reasoning/feedback/creative work. I use it a lot when reviewing documentation, prose and some code, it's the one I like the most.

But when it comes to researching information is consistently among the worst performers in my comparisons.

Generally the order is Opus 4.1 > Perplexity > Gemini Pro >> GPT 5 >> Qwen.

replies(2): >>45126858 #>>45129746 #
2. powerapple ◴[] No.45126858[source]
By research you mean web search/deep research or only use knowledge embedded in llm? I use ChatGPT most of time, didn't find Claude work better for me, maybe I should switch if there is a big gap in performance.
replies(1): >>45128642 #
3. epolanski ◴[] No.45128642[source]
Deep research, I don't trust LLMs with anything really.
4. Alifatisk ◴[] No.45129746[source]
I got Perplexity Pro one year for free, what makes Opus 4.1 search so good? Is it worth switching?
replies(1): >>45130853 #
5. epolanski ◴[] No.45130853[source]
I don't know whether I can give you a conclusive answer, due to how query-dependent the results are and the lack of determinism.

I really like perplexity and if you get it free it's hard to justify spending 100$/month.

replies(1): >>45131436 #
6. Alifatisk ◴[] No.45131436{3}[source]
I understand, I thought maybe there was a couple of cases you stumbled upon that Opus 4.1 was better at, which made you rank it higher than Perplexity!
replies(1): >>45135881 #
7. epolanski ◴[] No.45135881{4}[source]
Yes this happened.

Few recent examples:

- find me public companies listed in Poland that have the best z-index of dividend yield, payout ratio and earnings growth

- list the most important psychological tendencies and biases that crew resource management tries to address