Did you try this recently with GPT-4? Here's my attempt to replicate your test:
---Prompt---
I show you a set of four magisterial unicorns in a cute pink stable, each of them has a letter painted on one side of their body and a shape painted on the other. Right now you see all four unicorns, two of which are showing the shape side of their body and two of which show the letter side. You see that the unicorns read: 'A', 'S', Triangle, Square
Which unicorns do you need to spin around to test the validity of the following statement:
"if a unicorn shows a consonant on one side of its body, then the opposite side of its body is a triangle?"
---GPT-4---
To test the validity of the statement "if a unicorn shows a consonant on one side of its body, then the opposite side of its body is a triangle," you need to look for potential counterexamples to the rule. The two cases to check are:
* Unicorns with a consonant showing.
* Unicorns with a triangle showing.
You do not need to check unicorns with a vowel (like 'A') or a different shape (like Square), as they don't violate the statement.
So, you need to spin the following unicorns:
The unicorn with the letter 'S' (to check if it has a triangle on the other side)
The unicorn with the Triangle shape (to check if it has a consonant on the other side)
---
So it got the answer wrong, but in a way that I think a lot of humans would (you flip the square unicorn, not the triangle). Still seriously impressive in my book and definitely not falling into incoherence