Andrej Karpathy: Software in the era of AI [video]

(www.youtube.com)

1479 points sandslash | 1 comments | 19 Jun 25 00:33 UTC | HN request time: 0.204s | source

Show context

gchamonlive ◴[19 Jun 25 01:31 UTC] No.44314670[source]▶

I think it's interesting to juxtapose traditional coding, neural network weights and prompts because in many areas -- like the example of the self driving module having code being replaced by neural networks tuned to the target dataset representing the domain -- this will be quite useful.

However I think it's important to make it clear that given the hardware constraints of many environments the applicability of what's being called software 2.0 and 3.0 will be severely limited.

So instead of being replacements, these paradigms are more like extra tools in the tool belt. Code and prompts will live side by side, being used when convenient, but none a panacea.

replies(4): >>44315052 #>>44316337 #>>44322007 #>>44323973 #

karpathy ◴[19 Jun 25 02:58 UTC] No.44315052[source]▶

>>44314670 #

I kind of say it in words (agreeing with you) but I agree the versioning is a bit confusing analogy because it usually additionally implies some kind of improvement. When I’m just trying to distinguish them as very different software categories.

replies(5): >>44315296 #>>44319138 #>>44319445 #>>44320206 #>>44320915 #

miki123211 ◴[19 Jun 25 03:57 UTC] No.44315296[source]▶

>>44315052 #

What do you think about structured outputs / JSON mode / constrained decoding / whatever you wish to call it?

To me, it's a criminally underused tool. While "raw" LLMs are cool, they're annoying to use as anything but chatbots, as their output is unpredictable and basically impossible to parse programmatically.

Structured outputs solve that problem neatly. In a way, they're "neural networks without the training". They can be used to solve similar problems as traditional neural networks, things like image classification or extracting information from messy text, but all they require is a Zod or Pydantic type definition and a prompt. No renting GPUs, labeling data and tuning hyperparameters necessary.

They often also improve LLM performance significantly. Imagine you're trying to extract calories per 100g of product, but some product give you calories per serving and a serving size, calories per pound etc. The naive way to do this is a prompt like "give me calories per 100g", but that forces the LLM to do arithmetic, and LLMs are bad at arithmetic. With structured outputs, you just give it the fifteen different formats that you expect to see as alternatives, and use some simple Python to turn them all into calories per 100g on the backend side.

replies(3): >>44316175 #>>44319590 #>>44322019 #

1. coderatlarge ◴[19 Jun 25 20:00 UTC] No.44322019[source]▶

>>44315296 #

note the per 100g prompt might lead the llm to reach for the part of its training distribution that is actually written in terms of the 100g standard and just lead to different recall rather than a suboptimal calculation based on non-standardized per 100g training examples.

↑