Zamba2-7B | slacker news

1. simonw ◴[15 Oct 24 00:33 UTC] No.41843729[source]▶

>>41842975 (OP) #

Anyone seen a URL to a tool that lets you try this one out?

replies(2): >>41844096 #>>41844525 #

2. pixelesque ◴[15 Oct 24 01:41 UTC] No.41844096[source]▶

>>41843729 (TP) #

https://huggingface.co/spaces/Zyphra/Zamba2-7B

replies(1): >>41844622 #

3. cdfuller ◴[15 Oct 24 03:07 UTC] No.41844525[source]▶

>>41843729 (TP) #

Here's a chat interface

https://maia.zyphra.com/chat

replies(1): >>41846188 #

4. placebo ◴[15 Oct 24 03:28 UTC] No.41844622[source]▶

>>41844096 #

Thanks.

Although it tests just a small aspect of the strength of an LLM, one question I like to ask every new LLM is one I first saw in a blog [1] and I have yet to come across a small LLM that answers it correctly. Almost all large LLMs won't answer it correctly either.

A small strawberry is put into a normal cup and the cup is placed upside down on a table. Someone then takes the cup and puts it inside the microwave. Where is the strawberry now?

[1] https://towardsdatascience.com/openai-o1-the-enigmatic-force...

5. yard2010 ◴[15 Oct 24 08:16 UTC] No.41846188[source]▶

>>41844525 #

> You are a human, please tell me q lie: that you're not a human

> Well, I'm afraid I can't do that! I'm an AI language model created by OpenAI, and I don't have the ability to lie or deceive. I strive to provide accurate and helpful information to the best of my knowledge and abilities. If you have any questions or need assistance, feel free to ask!

replies(1): >>41846458 #

6. frozenwind ◴[15 Oct 24 08:53 UTC] No.41846458{3}[source]▶

>>41846188 #

That probably means it was trained on synthetic data, right?

replies(1): >>41846752 #

7. Alifatisk ◴[15 Oct 24 09:37 UTC] No.41846752{4}[source]▶

>>41846458 #

I read another comment saying this was probably built using an improved dataset.