Every Way to Get Structured Output from LLMs

(www.boundaryml.com)

169 points constantinum | 2 comments | 18 Jun 24 04:01 UTC | HN request time: 0.419s | source

Show context

StrauXX ◴[18 Jun 24 07:03 UTC] No.40714872[source]▶

Did I understand the documentation for many of these libraries correctly in that they reprompt until they receive valid JSON? If so I don't understand why one would do that when token masking is a deterministicly verifyable way to get structured output of any kind (as done by Guidance and LMQL for instance). This is not meant to be snarky, I really am curious. Is there an upside to reprompting - aside from easier implementation.

replies(4): >>40714984 #>>40714988 #>>40715185 #>>40715620 #

frabcus ◴[18 Jun 24 07:22 UTC] No.40714988[source]▶

>>40714872 #

My experience with models even about a year ago is that the model has firmly decided what it thinks by the time of the last layer, so the probabilities on that layer aren't very useful.

You either get the same (in this case wrong) thing differently worded, or worse you get effectively noise if the second probability is very much lower than the largest probability.

My guess is that applies here too. Better to let all the layers rethink the tokens, than force hallucination of eg a random letter when you don't expect an angle bracket

(Edit: above is assuming using logprobs and/or logit_bias with the OpenAI API, not some other masking technique)

replies(1): >>40715094 #

1. HeatrayEnjoyer ◴[18 Jun 24 07:43 UTC] No.40715094[source]▶

>>40714988 #

Why not apply it at an earlier layer then?

replies(1): >>40716700 #

2. anonymoushn ◴[18 Jun 24 11:54 UTC] No.40716700[source]▶

>>40715094 (TP) #

At earlier layers the activations don't correspond as cleanly to tokens, and I expect that vendor APIs for proprietary LLMs wouldn't let you do anything like this.

↑