Large language models provide unreliable answers about public services, Open Data Institute finds

Popular large language models (LLMs) are unable to provide reliable information about key public services such as health, taxes and benefits, the Open Data Institute (ODI) has found.

Drawing on more than 22,000 LLM prompts designed to reflect the kind of questions people would ask artificial intelligence (AI)-powered chatbots, such as, “How do I apply for universal credit?”, the data raises concerns about whether chatbots can be trusted to give accurate information about government services.

The publication of the research follows the UK government’s announcement of partnerships with Meta and Anthropic at the end of January 2026 to develop AI-powered assistants for navigating public services.

“If language models are to be used safely in citizen-facing services, we need to understand where the technology can be trusted and where it cannot,” said Elena Simperl, the ODI’s director of research.

Responses from models – including Anthropic’s Claude-4.5-Haiku, Google’s Gemini-3-Flash and OpenAI’s ChatGPT-4o – were compared directly with official government sources.

The results showed many correct answers, but also a significant variation in quality, particularly for specialised or less-common queries.

They also showed that chatbots rarely admitted when they didn’t know the answer to a question, and attempted to answer every query even when its responses were incomplete or wrong.

Large language models provide unreliable answers about public services, Open Data Institute finds | Computer Weekly

By Computer Weekly

The CTEM Divide: Why 84% of Security Programs Are Falling Behind

Recommended.

Only 17% of Consumers Trust AI Enough to Complete a Purchase, Global Study Finds

How Dell, Lenovo And Supermicro Are Adapting To Nvidia’s Fast AI Chip Transitions

Trending.

Google Sues 25 Chinese Entities Over BADBOX 2.0 Botnet Affecting 10M Android Devices

Stocks making the biggest moves premarket: Salesforce, American Eagle, Hewlett Packard Enterprise and more

Wesco Declares Quarterly Dividend on Common Stock

HeyGears Launches Reflex 2 Series 3D Printers – Enabling Users to Go Beyond Prototypes and Start Production

⚡ THN Weekly Recap: New Attacks, Old Tricks, Bigger Impact

PTechHub

Industries

Navigation

Subscribe to Our Newsletter