←back to thread

365 points kashifr | 1 comments | | HN request time: 0.544s | source
1. gdiamos ◴[] No.44502342[source]
Nice work anton et al.

I hope you continue the 50-100M parameter models.

I think there is a case for models that finish fast on CPUs in solve by llm test cases.