(arxiv.org)

568 points PaulHoule | 1 comments | 07 Jul 25 12:31 UTC | HN request time: 0.206s | source

1. ahmedhawas123 ◴[07 Jul 25 18:51 UTC] No.44493493[source]▶

Reinforcement learning really helped Transformer based LLMs evolve in terms of quality and reasoning which we saw as DeepSeek was launched. I am curious if what this is is equivalent to an early GPT 4o that has not yet reaped the benefits of add-on technologies that helped improve the quality?

↑

Mercury: Ultra-fast language models based on diffusion