45% of answers are wrong: the study proving that AI hasn’t (yet) learned rigor

📊 A study that cools the hypeA large-scale analysis conducted by the BBC, the European Broadcasting Union, and several European public media outlets has revealed a troubling fact: nearly one out of every two answers generated by large language models (LLMs) — ChatGPT, Gemini, Copilot, Perplexity — contains a significant error. Out of 3,000 answers tested […]