Points I thought were interesting:
> Apple’s model is built on top of Qwen2.5‑7B, an open-source foundation model from Alibaba. Alibaba first fine-tuned that model for better code generation (as Qwen2.5‑Coder‑7B), then Apple took it and made its own adjustments.
> it still doesn’t quite reach the level of GPT-4 or Gemini Diffusion.