Ask HN: Is anyone doing anything cool with tiny language models?

684 points prettyblocks | 1 comments | 21 Jan 25 19:39 UTC | HN request time: 0.209s | source

I mean anything in the 0.5B-3B range that's available on Ollama (for example). Have you built any cool tooling that uses these models as part of your work flow?

Show context

kaspermarstal ◴[22 Jan 25 07:48 UTC] No.42790190[source]▶

>>42784365 (OP) #

I built an Excel Add-In that allows my girlfriend to quickly filter 7000 paper titles and abstracts for a review paper that she is writing [1]. It uses Gemma 2 2b which is a wonderful little model that can run on her laptop CPU. It works surprisingly well for this kind of binary classification task.

The nice thing is that she can copy/paste the titles and abstracts in to two columns and write e.g. "=PROMPT(A1:B1, "If the paper studies diabetic neuropathy and stroke, return 'Include', otherwise return 'Exclude'")" and then drag down the formula across 7000 rows to bulk process the data on her own because it's just Excel. There is a gif on the readme on the Github repo that shows it.

[1] https://github.com/getcellm/cellm

replies(12): >>42790265 #>>42790359 #>>42790494 #>>42790901 #>>42791645 #>>42791646 #>>42793924 #>>42795545 #>>42796501 #>>42805657 #>>42812155 #>>42813125 #

afro88 ◴[22 Jan 25 08:38 UTC] No.42790494[source]▶

>>42790190 #

How accurate are the classifications?

replies(1): >>42790670 #

kaspermarstal ◴[22 Jan 25 09:02 UTC] No.42790670[source]▶

>>42790494 #

I don't know. This paper [1] reports accuracies in the 97-98% range on a similar task with more powerful models. With Gemma 2 2b the accuracy will certainly be lower.

[1] https://www.medrxiv.org/content/10.1101/2024.10.01.24314702v...

replies(2): >>42791570 #>>42792014 #

beernet ◴[22 Jan 25 12:26 UTC] No.42792014[source]▶

>>42790670 #

> I don't know.

HN in a nutshell: I've built some cool tech but have no idea if it is helpful or even counter productive...

replies(4): >>42792550 #>>42793068 #>>42794731 #>>42794935 #

kaspermarstal ◴[22 Jan 25 17:04 UTC] No.42794935[source]▶

>>42792014 #

I am not going to claim or report any kind of accuracy, especially with such a small model and such a specific, context dependent use case. It is the user’s responsibility to cross validate if it’s accurate enough for their use case and upgrade model or use another approach if not.

replies(2): >>42795373 #>>42811207 #

1. dzamo_norton ◴[24 Jan 25 07:23 UTC] No.42811207[source]▶

>>42794935 #

Offer a 100% money back guarantee if the user finds that the software is not fit for purpose :)

↑