Adaptive LLM routing under budget constraints

1. fny ◴[01 Sep 25 17:46 UTC] No.45094906[source]▶

Is there a reason human preference data is even needed? Don't LLMs already have a strong enough notion of question complexity to build a dataset for routing?

replies(3): >>45094974 #>>45095189 #>>45101110 #

2. delichon ◴[01 Sep 25 17:54 UTC] No.45094974[source]▶

>>45094906 (TP) #

> a strong enough notion of question complexity

Aka Wisdom. No, LLMs don't have that. Me neither, I usually have to step in the rabbit holes in order to detect them.

replies(1): >>45095394 #

3. jibal ◴[01 Sep 25 18:17 UTC] No.45095189[source]▶

>>45094906 (TP) #

LLMs don't have notions ... they are pattern matchers against a vast database of human text.

replies(2): >>45095298 #>>45096345 #

4. mhh__ ◴[01 Sep 25 18:29 UTC] No.45095298[source]▶

>>45095189 #

Please do a SELECT * from this database

replies(1): >>45096051 #

5. fny ◴[01 Sep 25 18:39 UTC] No.45095394[source]▶

>>45094974 #

"Do you think you need to do high/medium/low amount of thinking to answer X?" seems well within an LLMs wheelhouse if the goal is to build an optimized routing engine.

replies(1): >>45095871 #

6. nutjob2 ◴[01 Sep 25 19:34 UTC] No.45095871{3}[source]▶

>>45095394 #

How do you think that an LLM could come by that information? Do you think that LLM vendors are logging performance and feeding that back into the model or some other mechanism?

replies(3): >>45096007 #>>45096296 #>>45096334 #

7. ◴[01 Sep 25 19:50 UTC] No.45096007{4}[source]▶

>>45095871 #

8. ashirviskas ◴[01 Sep 25 19:56 UTC] No.45096051{3}[source]▶

>>45095298 #

What was the name of the rocket that brought the first humans into space?

9. carlhjerpe ◴[01 Sep 25 20:22 UTC] No.45096296{4}[source]▶

>>45095871 #

Yes, that's why they keep getting better and why Anthropic is switching privacy policy defaults to eat my data please.

10. adtac ◴[01 Sep 25 20:27 UTC] No.45096334{4}[source]▶

>>45095871 #

Why not something dumb like this: https://chatgpt.com/share/68b60199-b6ac-8009-b50d-3e7cfff1d7... (gpt-4o)

11. imtringued ◴[02 Sep 25 10:18 UTC] No.45101110[source]▶

>>45094906 (TP) #

This is like asking someone to make you a sandwich and expect them to read your mind to determine what kind of sandwich you want.