Did I understand the documentation for many of these libraries correctly in that they reprompt until they receive valid JSON? If so I don't understand why one would do that when token masking is a deterministicly verifyable way to get structured output of any kind (as done by Guidance and LMQL for instance).
This is not meant to be snarky, I really am curious. Is there an upside to reprompting - aside from easier implementation.
replies(4):