An Entire Company Was Staffed With AI Agents and You’ll Never Guess What Happened
Researchers at Carnegie Mellon University staffed a fake software company with AI agents to test their capabilities. The results were chaotic, with the best-performing model only completing 24% of tasks at a high cost. The study highlights AI’s limitations in common sense, social skills, and internet navigation, suggesting it is not ready to replace human workers in complex roles.