←back to thread

135 points lnyan | 1 comments | | HN request time: 0.215s | source
Show context
llm_trw ◴[] No.42731112[source]
>The estimated training time for the end-to-end model on an 8×H100 machine is 2.6 days.

That's a $250,000 machine for the micro budget. Or if you don't want to do it locally ~$2,000 to do it on someone else's machine for the one model.

replies(4): >>42731300 #>>42731535 #>>42732654 #>>42738646 #
1. corysama ◴[] No.42732654[source]
So, a kilo-budget.