Before the installation of the new AI chatbots or other agentic AI, they need profound testing. Wise statistics are quoted with the conviction: it is all about testing, testing, testing. Any systems that build on statistical reasoning (LLMs or machine learning) will behave erratically on what is known as an area with stronger impacts of, for example, statistical outliers. On both ends of the “normal distribution” of events or reasoning the statistical models and algorithms used in AI will produce “spurious” errors or have larger error margins on such topics a bit off the 95% of usual cases.
This means, testing, testing and testing again for the programmers of such AI systems before the release to the public or enterprise specific solutions. The tendency to keep costs of testing phases low compared to developing costs bears obvious risks to the “precautionary principle” applied in the European Union. Testing is most important to check the WEIRD bias of the most basic AI systems. In this sense AI development has become a sociological exercise as they have to deal with “selection bias” of many kinds that could have very expensive legal consequences.
(Image: Extract from Bassano, Jacopo: Abduction of Europa by Zeus, Odessa Museum treasures at exhibition in Berlin Gemäldegalerie 2025-5).






























The AI ChatGPT is advocating AI for the PS for mainly 4 reasons: (1) efficiency purposes; (2) personalisation of services; (3) citizen engagement; (4) citizen satisfaction. (See image below). The perspective of employees of the public services is not really part of the answer by ChatGPT. This is a more ambiguous part of the answer and would probably need more space and additional explicit prompts to solicit an explicit answer on the issue. With all the know issues of concern of AI like gender bias or biased data as input, the introduction of AI in public services has to be accompanied by a thorough monitoring process. The legal limits to applications of AI are more severe in public services as the production of official documents is subject to additional security concerns.
(See image). ChatGPT provides a more careful definition as the “crowd” or networked intelligence of Wikipedia. AI only “refers to the simulation” of HI processes by machines”. Examples of such HI processes include the solving of problems and understanding of language. In doing this AI creates systems and performs tasks that usually or until now required HI. There seems to be a technological openness embedded in the definition of AI by AI that is not bound to legal restrictions of its use. The learning systems approach might or might not allow to respect the restrictions set to the systems by HI. Or, do such systems also learn how to circumvent the restrictions set by HI systems to limit AI systems? For the time being we test the boundaries of such systems in multiple fields of application from autonomous driving systems, video surveillance, marketing tools or public services. Potentials as well as risks will be defined in more detail in this process of technological development. Society has to accompany this process with high priority since fundamental human rights are at issue. Potentials for assistance of humans are equally large. The balance will be crucial.
