Qwen3-Omni-Flash-2025-12-01：a next-generation native multimodal large model

1. dvh ◴[10 Dec 25 16:45 UTC] No.46219993[source]▶

>>46219538 (OP) #

I asked: "How many resistors are used in fuzzhugger phantom octave guitar pedal?". It replied 29 resistors and provided a long list. Answer is 2 resistors: https://tagboardeffects.blogspot.com/2013/04/fuzzhugger-phan...

replies(5): >>46220026 #>>46220132 #>>46220178 #>>46222417 #>>46226884 #

2. iFire ◴[10 Dec 25 16:48 UTC] No.46220026[source]▶

>>46219993 (TP) #

> How many resistors are used in fuzzhugger phantom octave guitar pedal?

Weird, as someone not having a database of the web, I wouldn't be able to calculate either result.

replies(3): >>46220036 #>>46220141 #>>46220721 #

3. iFire ◴[10 Dec 25 16:49 UTC] No.46220036[source]▶

>>46220026 #

I tend to pick things where I think the answer is in the introduction material like exams that test what was taught.

4. esafak ◴[10 Dec 25 16:56 UTC] No.46220132[source]▶

>>46219993 (TP) #

This is just trivia. I would not use it to test computers -- or humans.

replies(2): >>46220344 #>>46221915 #

5. dvh ◴[10 Dec 25 16:57 UTC] No.46220141[source]▶

>>46220026 #

"I don't know" would be perfectly reasonable answer

replies(1): >>46222088 #

6. brookst ◴[10 Dec 25 16:58 UTC] No.46220178[source]▶

>>46219993 (TP) #

Where did you try it? I don’t see this model listed in the linked Qwen chat.

7. parineum ◴[10 Dec 25 17:09 UTC] No.46220344[source]▶

>>46220132 #

Everything is just trivia until you have a use for the answer.

OP provided a we link with the answer, aren't these models supposed to be trained on all of that data?

replies(2): >>46220437 #>>46221103 #

8. esafak ◴[10 Dec 25 17:16 UTC] No.46220437{3}[source]▶

>>46220344 #

There is nothing useful you can do with this information. You might as well memorize the phone book.

The model has a certain capacity -- quite limited in this case -- so there is an opportunity cost in learning one thing over another. That's why it is important to train on quality data; things you can build on top of.

replies(1): >>46226858 #

9. kaoD ◴[10 Dec 25 17:36 UTC] No.46220721[source]▶

>>46220026 #

> as someone not having a database of the web, I wouldn't be able to calculate either result

And that's how I know you're not an LLM!

10. DennisP ◴[10 Dec 25 18:03 UTC] No.46221103{3}[source]▶

>>46220344 #

Just because it's in the training data doesn't mean the model can remember it. The parameters total 60 gigabytes, there's only so much trivia that can fit in there so it has to do lossy compression.

11. littlestymaar ◴[10 Dec 25 18:58 UTC] No.46221915[source]▶

>>46220132 #

It's good way to assess the model with respect to hallucinations though.

I don't think a model should know the answer, but it must be able to know that it doesn't know if you want to use it reliably.

replies(1): >>46222070 #

12. esafak ◴[10 Dec 25 19:07 UTC] No.46222070{3}[source]▶

>>46221915 #

No model is good at this yet. I'd expect the flagships to solve the first.

13. MaxikCZ ◴[10 Dec 25 19:08 UTC] No.46222088{3}[source]▶

>>46220141 #

I feel like theres a time in near future where LLMs will be too cautious to answer any questions they arent sure about, and most of the human effort will go into pleading the LLM to at least try to give an answer, which will almost always be correct anyways.

replies(2): >>46223537 #>>46224362 #

14. strangattractor ◴[10 Dec 25 19:30 UTC] No.46222417[source]▶

>>46219993 (TP) #

Maybe it thinks some of those 29 are in series:)

15. plufz ◴[10 Dec 25 20:44 UTC] No.46223537{4}[source]▶

>>46222088 #

That would be a great if you could have a setting like temperature 0.0-1.0 (Only answer if you are 100% to guess as much as you like).

16. littlestymaar ◴[10 Dec 25 21:50 UTC] No.46224362{4}[source]▶

>>46222088 #

It's not going to happen as the user would just leave the platform.

It would be better for most API usage though, as for business doing just a fraction of the job with 100% accuracy is often much preferable than claiming to do 100% but 20% is garbage.

17. parineum ◴[11 Dec 25 02:14 UTC] No.46226858{4}[source]▶

>>46220437 #

What if you are trying to fix one of these things and needed a list of replacement parts?

replies(1): >>46226936 #

18. bongodongobob ◴[11 Dec 25 02:19 UTC] No.46226884[source]▶

>>46219993 (TP) #

Lol I asked it how many rooms I have in my house and it got that wrong. Llms are useless amirite

replies(1): >>46226905 #

19. esafak ◴[11 Dec 25 02:28 UTC] No.46226936{5}[source]▶

>>46226858 #

Not the right problem for this model. Any RAG-backed SLM would do; the important part is being backed by a search engine, like https://google.com/ai