Le Chat: Custom MCP Connectors, Memories

(mistral.ai)

397 points Anon84 | 2 comments | 04 Sep 25 11:04 UTC | HN request time: 1.443s | source

Show context

barrell ◴[04 Sep 25 11:36 UTC] No.45126116[source]▶

I recently upgraded a large portion of my pipeline from gpt-4.1-mini to gpt-5-mini. The performance was horrible - after some research I decided to move everything to mistral-medium-0525.

Same price, but dramatically better results, way more reliable, and 10x faster. The only downside is when it does fail, it seems to fail much harder. Where gpt-5-mini would disregard the formatting in the prompt 70% of the time, mistral-medium follows it 99% of the time, but the other 1% of the time inserts random characters (for whatever reason, normally backticks... which then causes it's own formatting issues).

Still, very happy with Mistral so far!

replies(11): >>45126199 #>>45126266 #>>45126479 #>>45126528 #>>45126707 #>>45126741 #>>45126840 #>>45127790 #>>45129028 #>>45130298 #>>45136002 #

mark_l_watson ◴[04 Sep 25 11:55 UTC] No.45126266[source]▶

>>45126116 #

It is such a common pattern for LLMs to surround generated JSON with ```json … ``` that I check for this at the application level and fix it. Ten years ago I would do the same sort of sanity checks on formatting when I used LSTMs to generate synthetic data.

replies(9): >>45126463 #>>45126482 #>>45126489 #>>45126578 #>>45127374 #>>45127884 #>>45127900 #>>45128015 #>>45128042 #

fumeux_fume ◴[04 Sep 25 14:45 UTC] No.45127884[source]▶

>>45126266 #

Very common struggle, but a great way to prevent that is prefilling the assistant response with "{" or as much JSON output as you're going to know ahead of time like '{"response": ['

replies(2): >>45128284 #>>45128591 #

1. psadri ◴[04 Sep 25 15:20 UTC] No.45128284[source]▶

>>45127884 #

Haven’t tried this. Does it mix well with tool calls? Or does it force a response where you might have expected a tool call?

replies(1): >>45129068 #

2. fumeux_fume ◴[04 Sep 25 16:30 UTC] No.45129068[source]▶

>>45128284 (TP) #

It'll force a response that begins with an open bracket. So if you might need a response with a tool call that doesn't start with "{", then it might not fit your workflow.

↑