(arxiv.org)

204 points tdchaitanya | 1 comments | 01 Sep 25 16:57 UTC | HN request time: 0s | source

Show context

hackathonguy ◴[02 Sep 25 02:48 UTC] No.45098651[source]▶

I'm very curious whether a) anecdotally, anyone has encountered a real enterprise cost-cutting effort focused on LLM APIs and b) empirically, whether anyone has done any research on price elasticity in LLMs of different performance scales.

So far, my experience has been that it's just too early for most people / applications to worry about cost - at most, I've seen AI to be accountable for 10% of cloud costs. But very curious if others have other experiences.

replies(2): >>45099654 #>>45101476 #

1. dahcryn ◴[02 Sep 25 11:11 UTC] No.45101476[source]▶

>>45098651 #

LLM is far from the highest AI related cost, so we basically don't care about optimizing LLMs.

Obviously we don't use the super expensive ones like GPT4.5 or so. But we don't really bother with mini models, because GPT4.1 etc.. are cheap enough.

Stuff like speech to text etc.. are still way more expensive, and yes there we do focus on cost optimization. We have no large scale image generation use cases (yet)

↑

Adaptive LLM routing under budget constraints