←back to thread

504 points Terretta | 1 comments | | HN request time: 0.211s | source
Show context
NitpickLawyer ◴[] No.45066063[source]
Tested this yesterday with Cline. It's fast, works well with agentic flows, and produces decent code. No idea why this thread is so negative (also got flagged while I was typing this?) but it's a decent model. I'd say it's at or above gpt5-mini level, which is awesome in my book (I've been maining gpt5-mini for a few weeks now, does the job on a budget).

Things I noted:

- It's fast. I tested it in EU tz, so ymmv

- It does agentic in an interesting way. Instead of editing a file whole or in many places, it does many small passes.

- Had a feature take ~110k tokens (parsing html w/ bs4). Still finished the task. Didn't notice any problems at high context.

- When things didn't work first try, it created a new file to test, did all the mocking / testing there, and then once it worked edited the main module file. Nice. GPT5-mini would often times edit working files, and then get confused and fail the task.

All in all, not bad. At the price point it's at, I could see it as a daily driver. Even agentic stuff w/ opus + gpt5 high as planners and this thing as an implementer. It's fast enough that it might be worth setting it up in parallel and basically replicate pass@x from research.

IMO it's good to have options at every level. Having many providers fight for the market is good, it keeps them on their toes, and brings prices down. GPT5-mini is at 2$/MTok, this is at 1.5$/MTok. This is basically "free", in the great scheme of things. I ndon't get the negativity.

replies(10): >>45066728 #>>45067116 #>>45067311 #>>45067436 #>>45067602 #>>45067936 #>>45068543 #>>45068653 #>>45068788 #>>45074597 #
jameshart ◴[] No.45067602[source]
If the Grok brand wasn’t terminally tarnished for you by the ‘mechahitler’ incident, I’m not sure what more it would take.

This is an offering being produced by a company whose idea of responsible AI use involves prompting a chatbot that “You spend a lot of time on 4chan, watching InfoWars videos” - https://www.404media.co/grok-exposes-underlying-prompts-for-...

A lot of people rightly don’t want any such thing anywhere near their code.

replies(16): >>45067741 #>>45067793 #>>45067834 #>>45067845 #>>45067876 #>>45067950 #>>45068178 #>>45068224 #>>45068385 #>>45068645 #>>45068805 #>>45068858 #>>45069087 #>>45069800 #>>45070448 #>>45071147 #
jwr[dead post] ◴[] No.45068178[source]
[flagged]
signatoremo ◴[] No.45070097[source]
What is your stand ơn using Chinese models? They censor Tiananmen Square protests, they censor Tibet ethnic cleansing, they censor any opinion against China’s role in Khmer Rouge’s mass killings. Do you boycott DeepSeek or Qwen? Or you consider those actions not evil enough compared to Elon’s?
replies(2): >>45074507 #>>45091001 #
1. jascha_eng ◴[] No.45074507[source]
Those censorships are government enforced and don't necessarily allow conclusions about how the company developing the models thinks they are just following local laws. If anthropic was a Chinese company Claude would do the same.

Do I think it's problematic? Yes, but I don't blame the company or their leadership for it. For grok and xai you can very much be skeptical about the team behind it for it's actions