2 points badmonster | 2 comments | | HN request time: 0.458s | source
1. badmonster ◴[] No.44535221[source]
a subtle but powerful insight: large multimodal models like CLIP don’t just learn individual concepts. they also depend heavily on how often those concepts appear together during training.