What this mostly seems to demonstrate is that hip-pop is endlessly derivative. That might be a consequence of their data source:
> To build this project, we used the dataset of hundreds of thousands of songs on Genius.com accessible through their API, over 200,000 of which were “connected” in some way by sample, interpolation, cover, or remix.
Genres where sampling is openly and explicitly acknowledged are going to be massively over-represented. It would be cool build a relationship network using feature extraction on the actual audio.
replies(4):