(x.ai)

82 points meetpateltech | 1 comments | 20 Sep 25 01:55 UTC | HN request time: 0.204s | source

Show context

mrklol ◴[20 Sep 25 06:51 UTC] No.45311071[source]▶

Pricing is really good for this benchmark value. Let’s see how it holds against people testing it.

NitpickLawyer ◴[20 Sep 25 07:04 UTC] No.45311134[source]▶

If this is sonoma-dusk that was on preview on openrouter, it's pretty cool. I've tested it with some code reverse engineering tasks, and it is at or above gpt5-mini level, while being faster. Works well till about 110-130k tokens tasks, then it gets the case of "getthereitis" and finishes the task even if not all constraints are met (i.e. will say I've solved x/400 tests, the rest can be done later)

replies(3): >>45311329 #>>45311522 #>>45311886 #

1. ◴[20 Sep 25 08:27 UTC] No.45311522[source]▶

>>45311134 #

↑

Grok 4 Fast