Most active commenters

    ←back to thread

    Gemini CLI

    (blog.google)
    1348 points sync | 27 comments | | HN request time: 1.953s | source | bottom
    1. joelm ◴[] No.44379446[source]
    Been using Claude Code (4 Opus) fairly successfully in a large Rust codebase, but sometimes frustrated by it with complex tasks. Tried Gemini CLI today (easy to get working, which was nice) and it was pretty much a failure. It did a notably worse job than Claude at having the Rust code modifications compile successfully.

    However, Gemini at one point output what will probably be the highlight of my day:

    "I have made a complete mess of the code. I will now revert all changes I have made to the codebase and start over."

    What great self-awareness and willingness to scrap the work! :)

    replies(8): >>44379714 #>>44380383 #>>44380768 #>>44380866 #>>44381146 #>>44381754 #>>44383245 #>>44386866 #
    2. ZeroCool2u ◴[] No.44379714[source]
    Personally my theory is that Gemini benefits from being able to train on Googles massive internal code base and because Rust has been very low on uptake internally at Google, especially since they have some really nice C++ tooling, Gemini is comparatively bad at Rust.
    replies(5): >>44380405 #>>44380865 #>>44381697 #>>44382948 #>>44383662 #
    3. joshvm ◴[] No.44380383[source]
    Gemini has some fun failure modes. It gets "frustrated" when changes it makes doesn't work, and replies with oddly human phrases like "Well, that was unexpected" and then happily declares that (I see the issue!) "the final tests will pass" when it's going down a blind alley. It's extremely overconfident by default and much more exclamatory without changing the system prompt. Maybe in training it was taught/figured out that manifesting produces better results?
    replies(1): >>44380662 #
    4. dilap ◴[] No.44380405[source]
    That's interesting. I've tried Gemini 2.5 Pro from time to time because of the rave reviews I've seen, on C# + Unity code, and I've always been disappointed (compared to ChatGPT o3 and o4-high-mini and even Grok). This would support that theory.
    5. jjice ◴[] No.44380662[source]
    It also gets really down on itself, which is pretty funny (and a little scary). Aside from the number of people who've posted online about it wanting to uninstall itself after being filled with shame, I had it get confused on some Node module resolution stuff yesterday and it told me it was deeply sorry for wasting my time and that I didn't deserve to have such a useless assistant.

    Out of curiosity, I told it that I was proud of it for trying and it had a burst of energy again and tried a few more (failing) solution, before going back to it's shameful state.

    Then I just took care of the issue myself.

    replies(1): >>44380849 #
    6. raincole ◴[] No.44380768[source]
    So far I've found Gemini CLI is very good at explaining what existing code does.

    I can't say much about writing new code though.

    7. danielbln ◴[] No.44380849{3}[source]
    After a particular successful Claude Code task I praised it and told it to "let's fucking go!" to which it replied that loved the energy and proceeded to only output energetic caps lock with fire emojis. I know it's all smoke and mirrors (most likely), but I still get a chuckle out of this stuff.
    8. danielbln ◴[] No.44380865[source]
    Interesting, Gemini must be a monster when it comes to Go code then. I gotta try it for that
    replies(2): >>44381448 #>>44384886 #
    9. fpgaminer ◴[] No.44380866[source]
    Claude will do the same start over if things get too bad. At least I've seen it when its edits went haywire and trashed everything.
    10. eknkc ◴[] No.44381146[source]
    Same here. Tried to implement a new feature on one of our apps to test it. It completely screwed things up. Used undefined functions and stuff. After a couple of iterations of error reporting and fixing I gave up.

    Claude did it fine but I was not happy with the code. What Gemini came up with was much better but it could not tie things together at the end.

    replies(1): >>44382163 #
    11. Unroasted6154 ◴[] No.44381448{3}[source]
    There is way more Java and C++ than Go at Google.
    12. thimabi ◴[] No.44381697[source]
    > Personally my theory is that Gemini benefits from being able to train on Googles massive internal code base

    But does Google actually train its models on its internal codebase? Considering that there’s always the risk of the models leaking proprietary information and security architecture details, I hardly believe they would run that risk.

    replies(1): >>44381746 #
    13. kridsdale3 ◴[] No.44381746{3}[source]
    Googler here.

    We have a second, isolated model that has trained on internal code. The public Gemini AFAIK has never seen that content. The lawyers would explode.

    replies(2): >>44381786 #>>44385758 #
    14. skerit ◴[] No.44381754[source]
    I tried it too, it was so bad. I got the same "revert" behaviour after only 15 minutes.
    15. thimabi ◴[] No.44381786{4}[source]
    Oh, you’re right, there are the legal issues as well.

    Just out of curiosity, do you see much difference in quality between the isolated model and the public-facing ones?

    replies(1): >>44381810 #
    16. kridsdale3 ◴[] No.44381810{5}[source]
    We actually only got the “2.5” version of the internal one a few days ago so I don’t have an opinion yet.

    But when I had to choose between “2.0 with Google internal knowledge” and “2.5 that knows nothing” the latter was always superior.

    The bitter lesson indeed.

    replies(1): >>44390334 #
    17. taberiand ◴[] No.44382163[source]
    Sounds like you can use gemini to create the initial code, then have claude review and finalise what gemini comes up with
    18. leoh ◴[] No.44382948[source]
    >Personally my theory is that Gemini benefits from being able to train on Googles massive internal code base and because Rust has been very low on uptake internally at Google, especially since they have some really nice C++ tooling, Gemini is comparatively bad at Rust.

    Were they to train it on their C++ codebase, it would not be effective on account of the fact that they don't use boost or cmake or any major stuff that C++ in the wider world use. It would also suggest that the user make use of all kinds of non-available C++ libraries. So no, they are not training on their own C++ corpus nor would it be particularly useful.

    replies(1): >>44385355 #
    19. noisy_boy ◴[] No.44383245[source]
    I asked it to do a comparatively pedestrian task: write a script to show top 5 google searches.

    First it did the search itself and then added "echo" for each of them - cute

    Then it tried to use pytrends which didn't go anywhere

    Then it tried some other paid service which also didn't go anywhere

    Then it tried some other stuff which also didn't go anywhere

    Finally it gave up and declared failure.

    It will probably be useful as it can do the modify/run loop itself with all the power of Gemini but so far, underwhelming.

    20. data-ottawa ◴[] No.44383662[source]
    Tangental, but I worry that LLMs will cause a great stagnation in programming language evolution, and possibly a bunch of tech.

    I've tried using a few new languages and the LLMs would all swap the code for syntactically similar languages, even after telling them to read the doc pages.

    Whether that's for better or worse I don't know, but it does feel like new languages are genuinely solving hard problems as their raison d'etre.

    replies(1): >>44385738 #
    21. jordanbeiber ◴[] No.44384886{3}[source]
    As go feels like a straight-jacket compared to many other popular languages, it’s probably very suitable for an LLM in general.

    Thinking about it - was this not the idea of go from the start? Nothing fancy to keep non-rocket scientist away from foot-guns, and have everyone produce code that everyone else can understand.

    Diving in to a go project you almost always know what to expect, which is a great thing for a business.

    22. leoh ◴[] No.44385355{3}[source]
    Excuse me why was this downvoted so aggressively??
    replies(1): >>44388969 #
    23. breakingcups ◴[] No.44385738{3}[source]
    Not just that, I think this will happen on multiple levels too. Think de-facto ossified libraries, tools, etc.

    LLMs thrive because they had a wealth of high-quality corpus in the form os Stack Overflow, Github, etc. and ironically their uptake is causing a strangulation of that source of training data.

    24. blurrybird ◴[] No.44385758{4}[source]
    What model do your lawyers run on?
    25. fcoury ◴[] No.44386866[source]
    This was also my exact experience. I was pretty excited because I usually use Gemini Pro 2.5 when Claude Code gets stuck by pasting the whole code and asking questions and it was able to get me out of a few pickles a couple of times.

    Unfortunately the CLI version wasn't able to create coherent code or fix some issues I had in my Rust codebase as well.

    Here's hope that it eventually becomes great.

    26. simianwords ◴[] No.44388969{4}[source]
    How can they train on internal codebase without leaking specifics?
    27. ◴[] No.44390334{6}[source]