←back to thread

213 points Philpax | 1 comments | | HN request time: 0.21s | source
1. breadislove ◴[] No.42174176[source]
There is this really interesting blog post about making rope (by the main author of the paper) multimodal as used by qwen2 vl. it's in chinese but google translate does a pretty good job: https://spaces.ac.cn/archives/10040