Before the installation of the new AI chatbots or other agentic AI, they need profound testing. Wise statistics are quoted with the conviction: it is all about testing, testing, testing. Any systems that build on statistical reasoning (LLMs or machine learning) will behave erratically on what is known as an area with stronger impacts of, for example, statistical outliers. On both ends of the “normal distribution” of events or reasoning the statistical models and algorithms used in AI will produce “spurious” errors or have larger error margins on such topics a bit off the 95% of usual cases.
This means, testing, testing and testing again for the programmers of such AI systems before the release to the public or enterprise specific solutions. The tendency to keep costs of testing phases low compared to developing costs bears obvious risks to the “precautionary principle” applied in the European Union. Testing is most important to check the WEIRD bias of the most basic AI systems. In this sense AI development has become a sociological exercise as they have to deal with “selection bias” of many kinds that could have very expensive legal consequences.
(Image: Extract from Bassano, Jacopo: Abduction of Europa by Zeus, Odessa Museum treasures at exhibition in Berlin Gemäldegalerie 2025-5).