Le Chat: Custom MCP Connectors, Memories

(mistral.ai)

397 points Anon84 | 2 comments | 04 Sep 25 11:04 UTC | HN request time: 0.517s | source

Show context

barrell ◴[04 Sep 25 11:36 UTC] No.45126116[source]▶

I recently upgraded a large portion of my pipeline from gpt-4.1-mini to gpt-5-mini. The performance was horrible - after some research I decided to move everything to mistral-medium-0525.

Same price, but dramatically better results, way more reliable, and 10x faster. The only downside is when it does fail, it seems to fail much harder. Where gpt-5-mini would disregard the formatting in the prompt 70% of the time, mistral-medium follows it 99% of the time, but the other 1% of the time inserts random characters (for whatever reason, normally backticks... which then causes it's own formatting issues).

Still, very happy with Mistral so far!

replies(11): >>45126199 #>>45126266 #>>45126479 #>>45126528 #>>45126707 #>>45126741 #>>45126840 #>>45127790 #>>45129028 #>>45130298 #>>45136002 #

mark_l_watson ◴[04 Sep 25 11:55 UTC] No.45126266[source]▶

>>45126116 #

It is such a common pattern for LLMs to surround generated JSON with ```json … ``` that I check for this at the application level and fix it. Ten years ago I would do the same sort of sanity checks on formatting when I used LSTMs to generate synthetic data.

replies(9): >>45126463 #>>45126482 #>>45126489 #>>45126578 #>>45127374 #>>45127884 #>>45127900 #>>45128015 #>>45128042 #

1. Alifatisk ◴[04 Sep 25 12:25 UTC] No.45126463[source]▶

>>45126266 #

I think this is the first time I stumped upon someone who actually mentions LSTM in a practical way instead of just theory. Cool!

Would you like to elaborate further on how the experience was with it? What was your approach for using it? How did you generate synthetic data? How did it perform?

replies(1): >>45127590 #

2. p1esk ◴[04 Sep 25 14:21 UTC] No.45127590[source]▶

>>45126463 (TP) #

10 years ago I used LSTMs for music generation. Worked pretty well for short MIDI snippets (30-60 seconds).

↑