Most active commenters

jackphilson(3)

NanoChat – The best ChatGPT that $100 can buy

(github.com)

https://x.com/karpathy/status/1977755427569111362

1. daft_pink ◴[13 Oct 25 15:43 UTC] No.45569633[source]▶

>>45569350 (OP) #

Wow, how do we sign up for the Eurekalabs course and how much does it cost?

replies(2): >>45569747 #>>45571160 #

2. huseyinkeles ◴[13 Oct 25 15:52 UTC] No.45569747[source]▶

>>45569633 #

Karpathy says nanochat will become the capstone project of the course LLM101n being developed by Eureka Labs.

I guess it’s still a work in progress? Couldn’t find any other information elsewhere.

replies(1): >>45570589 #

3. karimf ◴[13 Oct 25 16:02 UTC] No.45569878[source]▶

>>45569350 (OP) #

I've always thought about the best way to contribute to humanity: number of people you help x how much you help them. I think what Karpathy is doing is one of the highest leverage ways to achieve that.

Our current world is build on top of open source projects. This is possible because there are a lot of free resources to learn to code so anyone from anywhere in the world can learn and make a great piece of software.

I just hope the same will happen with the AI/LLM wave.

replies(1): >>45570834 #

4. Schiphol ◴[13 Oct 25 16:53 UTC] No.45570589{3}[source]▶

>>45569747 #

A bit more info [here](https://github.com/karpathy/LLM101n)

5. ◴[13 Oct 25 17:12 UTC] No.45570814[source]▶

>>45569350 (OP) #

6. flakiness ◴[13 Oct 25 17:13 UTC] No.45570831[source]▶

>>45569350 (OP) #

Eureka Labs: https://github.com/EurekaLabsAI

What a prolific person Andrej is. It's been more than amazing to follow along!

7. jackphilson ◴[13 Oct 25 17:14 UTC] No.45570834[source]▶

>>45569878 #

[flagged]

replies(1): >>45570884 #

8. nsriv ◴[13 Oct 25 17:17 UTC] No.45570884{3}[source]▶

>>45570834 #

Controlling culture, yes but wild pivot to mention that criminal alongside Karpathy.

replies(2): >>45570907 #>>45571202 #

9. jackphilson ◴[13 Oct 25 17:19 UTC] No.45570907{4}[source]▶

>>45570884 #

I mean just an example. He obviously wasn't the most ethical person. Depends how you do it

replies(1): >>45571161 #

10. ◴[13 Oct 25 17:26 UTC] No.45570994[source]▶

>>45569350 (OP) #

11. TheAceOfHearts ◴[13 Oct 25 17:32 UTC] No.45571062[source]▶

>>45569350 (OP) #

Here's the announcement post [0] from Karpathy, which provides a bit of additional context.

[0] https://x.com/karpathy/status/1977755427569111362

replies(1): >>45571613 #

12. swyx ◴[13 Oct 25 17:34 UTC] No.45571096[source]▶

>>45569350 (OP) #

> Thank you to chief LLM whisperer Alec Radford for advice/guidance.

oh man an Alec x Andrej podcast would BREAK THE INTERNET... just saying... going from glory days of GPT1 to now building GPT3? in 4 hours

replies(1): >>45571175 #

13. karpathy ◴[13 Oct 25 17:39 UTC] No.45571160[source]▶

>>45569633 #

Still under development, remaining work includes tuning nanochat (current state being solid v0.1) and finalizing the in-between projects so that students can "unlock" all complexity that hides underneath: `torch.Tensor`, `torch.dist`, `.backward()`, '.compile()`, etc. And then the more ops heavy aspects.

14. IOT_Apprentice ◴[13 Oct 25 17:39 UTC] No.45571161{5}[source]▶

>>45570907 #

Neither are Stalin, Netanyahu, Pol Pot, Hitler, Charles Manson et al.

Way to derail the conversation. Focus on the positive people and their legacy of time, sharing, positive energy and contributions to society

replies(1): >>45571245 #

15. codybontecou ◴[13 Oct 25 17:40 UTC] No.45571175[source]▶

>>45571096 #

Please oh please. This would be perfect.

16. cultofmetatron ◴[13 Oct 25 17:42 UTC] No.45571202{4}[source]▶

>>45570884 #

not a particularly ethical guy and I wouldn't hold him up as a example of morality but the guy hasn't actually been found guilty YET. Multiple courts have tried. You'd think that for a guy under as much scrutiny as him that they would have SOMETHING to pin him on by now.

Innocent until PROVEN guilty is a foundational legal precedent for a reason.

replies(1): >>45571325 #

17. mhitza ◴[13 Oct 25 17:43 UTC] No.45571218[source]▶

>>45569350 (OP) #

Should be "that you can train for $100"

Curios to try it someday on a set of specialized documents. Though as I understand the cost of running this is whatever GPU you can rent with 80GB of VRAM. Which kind of leaves hobbyists and students out. Unless some cloud is donating gpu compute capacity.

replies(2): >>45571268 #>>45571369 #

18. jackphilson ◴[13 Oct 25 17:46 UTC] No.45571245{6}[source]▶

>>45571161 #

not derailing, just pointing out effective ways of producing good which is what i was responding to. i think its good for people to be aware of this. those people are all examples of people who have influenced culture for bad. you can do it for good: bryan johnson, civil rights leaders, leftist streamers. andrew tate was just the most effective, recent, and obvious one which is why I pointed him out.

19. portaouflop ◴[13 Oct 25 17:49 UTC] No.45571268[source]▶

>>45571218 #

If I have let’s say 40gb RAM does it not work at all or just take twice as long to train?

replies(1): >>45571442 #

20. portaouflop ◴[13 Oct 25 17:53 UTC] No.45571325{5}[source]▶

>>45571202 #

He is definitely guilty of being a waste of human life, a massive asshole and a general detriment to society worldwide. Don’t need a court to prove that.

There are 6 criminal cases against him in several countries, let’s see how they pan out - but regardless he is not an innocent person.

21. Onavo ◴[13 Oct 25 17:56 UTC] No.45571369[source]▶

>>45571218 #

A GPU with 80GB VRAM costs around $1-3 USD an hour on commodity clouds (i.e. the non-Big 3 bare metal providers e.g. https://getdeploying.com/reference/cloud-gpu/nvidia-h100). I think it's accessible to most middle class users in first world countries.

22. typpilol ◴[13 Oct 25 18:02 UTC] No.45571442{3}[source]▶

>>45571268 #

Won't work at all. Or if it does it'll be so slow since it'll have to go to the disk for every single calculation so it won't ever finish.

23. Havoc ◴[13 Oct 25 18:09 UTC] No.45571541[source]▶

>>45569350 (OP) #

>If your GPU(s) have less than 80GB, you'll have to tune some of the hyperparameters or you will OOM / run out of VRAM. Look for --device_batch_size in the scripts and reduce it until things fit. E.g. from 32 (default) to 16, 8, 4, 2, or even 1.

That sounds like it could run on a 24gb GPU. Batch size of 8 would imply 20gb mem, no?

...presumably just takes forever

24. dang ◴[13 Oct 25 18:16 UTC] No.45571613[source]▶

>>45571062 #

Thanks - we'll put that in the toptext as well

↑