←back to thread

577 points simonw | 1 comments | | HN request time: 0.204s | source
1. sneak ◴[] No.44724415[source]
What is the SOTA for benchmarking all of the models you can run on your local machine vs a test suite?

Surely this must exist, no? I want to generate a local leaderboard and perhaps write new test cases.