←back to thread

1311 points msoad | 1 comments | | HN request time: 0.212s | source
1. dvt ◴[] No.35395606[source]
This seems suspiciously like a bug (either in inference or in mmap reporting), as these models are not sparse enough for the savings to come from anywhere viable.