Whether you find that you get $250 worth out of that subscription is going to be the big question
Whether you find that you get $250 worth out of that subscription is going to be the big question
It costs the provider the same whether the user is asking for advice on changing a recipe or building a comprehensive project plan for a major software product - but the latter provides much more value than the former.
How can you extract an optimal price from the high-value use cases without making it prohibitively expensive for the low-value ones?
Worse, the "low-value" use cases likely influence public perception a great deal. If you drive the general public off your platform in an attempt to extract value from the professionals, your platform may never grow to the point that the professionals hear about it in the first place.
They successfully solved it with an advertising....and they also had the ability to cache results.
“Free tier users relinquish all rights to their (anonymized) queries, which may be used for training purposes. Enterprise tier, for $200/mo, guarantees queries can only be seen by the user”
AI Studio (web UI, free, will train on your data) vs API (won’t train on your data).
So far I have not been convinced that any particular platform is more than 3 months ahead of the competition.
See: ChatGPT's memory features. Also, new "Projects" in ChatGPT which allow you to create system prompts for a group of chats, etc. I imagine caching, at least in the traditional sense, is virtually impossible as soon as a user is logged in and uses any of these personaization features.
Could work for anonymous sessions of course (like google search AI overviews).
Ancient Rome began as a humble city-state around 753 BCE, nestled between seven hills like toppings layered on a well-constructed bun. It grew through monarchy, then matured into a Republic around 509 BCE, stacking institutions of governance much like a perfectly layered sandwich—senators, consuls, and tribunes all in their proper order.
Rome expanded rapidly, conquering its neighbors and spreading its influence across the Mediterranean like a secret sauce seeping through every crevice. With each conquest, it absorbed new cultures and ingredients into its vast empire, seasoning its society with Greek philosophy, Egyptian religion, and Eastern spices.
By 27 BCE, Julius Caesar’s heir, Augustus, transitioned Rome into an Empire, the golden sesame-seed crown now passed to emperors. Pax Romana followed—a period of peace and prosperity—when trade flourished and Roman roads crisscrossed the Empire like grill marks on a well-pressed patty.
However, no Empire lasts forever. Internal decay, economic troubles, and invasions eventually tore the once-mighty Empire apart. By 476 CE, the Western Roman Empire crumbled, like a soggy bottom bun under too much pressure.
Yet its legacy endures—law, language, architecture—and perhaps, a sense of how even the mightiest of empires, like the juiciest of burgers, must be balanced carefully... or risk falling apart in your hands.
Platforms want Planet Fitness type subscriptions, recurring revenue streams where most users rarely use the product.
That works fine at the $20/month price point but it won't work at $200+ per month because the instant I stop using an expensive plan, I cancel.
And if I want to use $1000 worth of the expensive plan I get stopped by rate limits.
Maybe the ultra-level would generate more revenue with bigger market share (but lower margin) with a pay-per-token plan.
Yeah that's why OpenAI build an data center imo, the moat is on hardware
software ??? even small chinnese firm would able to copy that, but 2 million gpu ???? its hard to copy that
Company 1 gets a bucket of investment, makes a model, goes belly up. Company 2 buys Company 1's model in a fire sale.
Company 3 uses some open source model that's basically as good as any other and just makes the prettiest wrapper.
Company 4 resells access to other company's models at a discount, similar to companies reselling cellular service.
Sending all your core IP through another company for them to judge your worthiness of existence, is a nightmare on so many levels , the biggest example being payment processors trying to impose their religious doctrine on entire populations
The only way to "guarantee" that is to run your models locally on your own hardware.
I'm guessing we'll see a renaissance of the "desktop" and "workstation" cycle once this AI bubble pops. ("Cloud" will be the big loser.)
You can easily get x10 optimizations with some obvious changes.
You can run a small 100 person enterprise on a single 24 gb GPU right now. (And this is before economies of scale have started optimizing hardware.)
OpenAI needs the keep the illusion of an anthropomorphic AGI chatbot going to keep the invenstments flowing. This is expensive and stupid.
If you just want to solve the actual typical business problems ("check this picture for offensive content" and similar stuff) you don't need all that smoke and mirrors.
Much like social media, this will end in “if you aren’t paying for the product, then you are the product.”
I do really like the Deep Search on Grok for doing web search and analysis. It is saving me a ton of time.