A study in Nature Human Behaviour compares the theory of mind capabilities of GPT-3.5, GPT-4, and LLaMA2-70B against humans, finding that AI models show varying degrees of success across different tasks.
A study in Nature Human Behaviour compares the theory of mind capabilities of GPT-3.5, GPT-4, and LLaMA2-70B against humans, finding that AI models show varying degrees of success across different tasks.