This is a cool way to look at multimodal embeddings. They look at performance as the the percentage of inputs slides from one modality to another:
https://i0.wp.com/blog.voyageai.com/wp-content/uploads/2024/...
replies(1):
https://i0.wp.com/blog.voyageai.com/wp-content/uploads/2024/...
why does it pop up at the end?