←back to thread

152 points fzliu | 1 comments | | HN request time: 0.211s | source
Show context
bob1029 ◴[] No.43562889[source]
So, we're proposing a multiplicative increase of something that already scales quadratically with the context size?

I think we've already got a bit of a bottleneck in terms of memory bandwidth utilization.

replies(4): >>43563169 #>>43563334 #>>43563390 #>>43563970 #
1. kadushka ◴[] No.43563390[source]
If you have a bottleneck in terms of memory bandwidth utilization, this method is great - it would utilize the idle compute.