Man googles offerings are so inconsistent,
batch processing has been available on vertex for a while now,
I dont really get why they have two different offering in vertex and gemini, both are equally inaccessible
replies(2):
[1]: http://web.archive.org/web/20240517173258/https://cloud.goog..., "By default Google caches a customer's inputs and outputs for Gemini models to accelerate responses to subsequent prompts from the customer. Cached contents are stored for up to 24 hours."