Small language models are the future of agentic AI

(arxiv.org)

112 points favoboa | 2 comments | 01 Jul 25 03:33 UTC | HN request time: 0.475s | source

Show context

iagooar ◴[01 Jul 25 09:46 UTC] No.44432199[source]▶

I think that part of the beauty of LLMs is their versatility in so many different scenarios. When I build my agentic pipeline, I can plug in any of the major LLMs, add a prompt to it, and have it go off to do its job.

Specialized, fine-tuned models sit somewhere in between LLMs and traditional procedural code. The fine-tuning process takes time and is a risk if it goes wrong. In the meantime, the LLMs by major providers get smarter every day.

Sure enough, latency and cost are a thing. But unless you have a very specific task performed at a huge scale, you might be better off using an off-the-shelf LLM.

replies(1): >>44433140 #

1. incrudible ◴[01 Jul 25 12:18 UTC] No.44433140[source]▶

>>44432199 #

> In the meantime, the LLMs by major providers get smarter every day.

Are they though? Or are they just getting better at gaming benchmarks?

Subjectively, there has been modest progress in the past year, but I'm curious to hear other anecdotes from people that aren't firmly invested in the hype.

replies(1): >>44433807 #

2. iagooar ◴[01 Jul 25 13:37 UTC] No.44433807[source]▶

>>44433140 (TP) #

If you have used Sonnet 3.5, 3.7 and 4 in the last few months, you know how much the model has improved. I am achieving 3-5x complexity with latest Sonnet as compared to what was possible with the earlier versions.

They are getting much much better.

↑