Daily Guardian UAEDaily Guardian UAE
  • Home
  • UAE
  • What’s On
  • Business
  • World
  • Entertainment
  • Lifestyle
  • Sports
  • Technology
  • Travel
  • Web Stories
  • More
    • Editor’s Picks
    • Press Release
What's On

A Simple 5-minute Test Could Help Identify Heart Attack Risk Early, Say RAK Hospital Cardiologists

January 22, 2026

This AI creativity study says you still beat it, if you’re top tier

January 22, 2026

Arada sees sales triple in 2025 to pass AED17 billion, with over 5,000 units sold in the UAE

January 22, 2026

Your next budget workstation GPU may be Intel Arc Pro B70

January 22, 2026

HONOR EMPOWERS EVERYDAY CREATORS THROUGH “MASTER THE LIGHT” PHOTOGRAPHY MASTERCLASS AT DUBAI MALL

January 22, 2026
Facebook X (Twitter) Instagram
Finance Pro
Facebook X (Twitter) Instagram
Daily Guardian UAE
Subscribe
  • Home
  • UAE
  • What’s On
  • Business
  • World
  • Entertainment
  • Lifestyle
  • Sports
  • Technology
  • Travel
  • Web Stories
  • More
    • Editor’s Picks
    • Press Release
Daily Guardian UAEDaily Guardian UAE
Home » Google finds AI chatbots are only 69% accurate… at best
Technology

Google finds AI chatbots are only 69% accurate… at best

By dailyguardian.aeDecember 16, 20252 Mins Read
Share
Facebook Twitter LinkedIn Pinterest Email

Google has published a blunt assessment of how reliable today’s AI chatbots really are, and the numbers are not flattering. Using its newly introduced FACTS Benchmark Suite, the company found that even the best AI models struggle to break past a 70% factual accuracy rate. The top performer, Gemini 3 Pro, reached 69% overall accuracy, while other leading systems from OpenAI, Anthropic, and xAI scored even lower. The takeaway is simple and uncomfortable. These chatbots still get roughly one out of every three answers wrong, even when they sound confident doing it.

The benchmark matters because most existing AI tests focus on whether a model can complete a task, not whether the information it produces is actually true. For industries like finance, healthcare, and law, that gap can be costly. A fluent response that sounds confident but contains errors can do real damage, especially when users assume the chatbot knows what it is talking about.

What Google’s accuracy test reveals

The FACTS Benchmark Suite was built by Google’s FACTS team with Kaggle to directly test factual accuracy across four real-world use. One test measures parametric knowledge, which checks whether a model can answer fact-based questions using only what it learned during training. Another evaluates search performance, testing how well models use web tools to retrieve accurate information. A third focuses on grounding, meaning whether the model sticks to a provided document without adding false details. The fourth examines multimodal understanding, such as reading charts, diagrams, and images correctly.

ai-accuracy-rankings-by-facts-google

The results show sharp differences between models. Gemini 3 Pro led the leaderboard with a 69% FACTS score, followed by Gemini 2.5 Pro and OpenAI’s ChatGPT-5 nearly at 62% percent. Claude 4.5 Opus landed at ~51% percent, while Grok 4 scored ~54%. Multimodal tasks were the weakest area across the board, with accuracy often below 50%. This matters because these tasks involve reading charts, diagrams, or images, where a chatbot could confidently misread a sales graph or pull the wrong number from a document, leading to mistakes that are easy to miss but hard to undo.

The takeaway isn’t that chatbots are useless, but blind trust is risky. Google’s own data suggests AI is improving, yet it still needs verification, guardrails, and human oversight before it can be treated as a reliable source of truth.

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Keep Reading

This AI creativity study says you still beat it, if you’re top tier

Your next budget workstation GPU may be Intel Arc Pro B70

AT&T’s new Turbo Live service aims to keep your phone usable at crowded events

The iPhone Air 2 could arrive this fall, but don’t expect big changes

Apple might launch an even more powerful AirPods Pro version this year

If your workload eats memory, this MacBook Pro is the smart configuration

Rokid’s AI glasses offer a more affordable route to wearables than Meta Ray-Ban

Apple plans to turn Siri into a full AI chatbot to take on ChatGPT and Gemini

You can now turn PDFs into podcasts and slides with Adobe’s new AI feature

Editors Picks

This AI creativity study says you still beat it, if you’re top tier

January 22, 2026

Arada sees sales triple in 2025 to pass AED17 billion, with over 5,000 units sold in the UAE

January 22, 2026

Your next budget workstation GPU may be Intel Arc Pro B70

January 22, 2026

HONOR EMPOWERS EVERYDAY CREATORS THROUGH “MASTER THE LIGHT” PHOTOGRAPHY MASTERCLASS AT DUBAI MALL

January 22, 2026

Subscribe to News

Get the latest UAE news and updates directly to your inbox.

Latest Posts

AT&T’s new Turbo Live service aims to keep your phone usable at crowded events

January 22, 2026

NEOPAY partners with Nymbl to enable Nymbl QX, a next-generation QR ordering and pay-at- table solution

January 22, 2026

The iPhone Air 2 could arrive this fall, but don’t expect big changes

January 22, 2026
Facebook X (Twitter) Pinterest TikTok Instagram
© 2026 Daily Guardian UAE. All Rights Reserved.
  • Privacy Policy
  • Terms
  • Advertise
  • Contact

Type above and press Enter to search. Press Esc to cancel.