Bonobos use a kind of syntax once thought to be unique to humans

> They recorded over 300 of these observations, including what the caller was doing at the time, what was happening in the environment and the behaviour of the caller and audience after the vocalisation.

> To reveal the meaning of each call, they used a technique from linguistics to create a cloud of utterance types, placing vocalisations that occurred in similar circumstances closer together. “We kind of established this dictionary,” says Berthlet. “We have one vocalisation and one meaning.”

This is lots of manual effort, could the recent advancement in language models help decode animal languages more easily? I guess it will need lots 24/7 capture of physical movement/action and sound data and train a model (that already understands vocal English too) perhaps.