Daily Guardian UAEDaily Guardian UAE
  • Home
  • UAE
  • What’s On
  • Business
  • World
  • Entertainment
  • Lifestyle
  • Sports
  • Technology
  • Travel
  • Web Stories
  • More
    • Editor’s Picks
    • Press Release
What's On

THE BENCH RESTRUCTURES LEADERSHIP TO SUPPORT GLOBAL GROWTH AND INVESTOR ENGAGEMENT

January 14, 2026

Your Galaxy lock screen just got hundreds of Samsung unlock animations

January 14, 2026

Leak says your entry Galaxy S26 won’t be stuck at 25W

January 14, 2026

Your Android 17 Quick Settings could get two big upgrades

January 14, 2026

Google’s Veo just got better at making videos for your social feeds

January 14, 2026
Facebook X (Twitter) Instagram
Finance Pro
Facebook X (Twitter) Instagram
Daily Guardian UAE
Subscribe
  • Home
  • UAE
  • What’s On
  • Business
  • World
  • Entertainment
  • Lifestyle
  • Sports
  • Technology
  • Travel
  • Web Stories
  • More
    • Editor’s Picks
    • Press Release
Daily Guardian UAEDaily Guardian UAE
Home » ChatGPT’s latest model may be a regression in performance
Technology

ChatGPT’s latest model may be a regression in performance

By dailyguardian.aeNovember 22, 20242 Mins Read
Share
Facebook Twitter LinkedIn Pinterest Email

According to a new report from Artificial Analysis, OpenAI’s flagship large language model for ChatGPT, GPT-4o, has significantly regressed in recent weeks, putting the state-of-the-art model’s performance on par with the far smaller, and notably less capable, GPT-4o-mini model.

This analysis comes less than 24 hours after the company announced an upgrade for the GPT-4o model. “The model’s creative writing ability has leveled up–more natural, engaging, and tailored writing to improve relevance & readability,” OpenAI wrote on X. “It’s also better at working with uploaded files, providing deeper insights & more thorough responses.” Whether those claims continue to hold up is now being cast in doubt.

“We have completed running our independent evals on OpenAI’s GPT-4o release yesterday and are consistently measuring materially lower eval scores than the August release of GPT-4o,” the Artificial Analysis announced via an X post on Thursday, noting that the model’s Artificial Analysis Quality Index decreased from 77 to 71 (and is now equal to that of GPT-4o mini).

What’s more, GPT-4o’s performance on the GPQA Diamond benchmark decreased from 51% to 39% while its MATH benchmarks decreased from 78% to 69%.

Simultaneously, the researchers discovered more than a doubling in the speed increase of the model’s responses, accelerating from around 80 output tokens per second to roughly 180 tokens/s. “We have generally observed significantly faster speeds on launch day for OpenAI models (likely due to OpenAI provisioning capacity ahead of adoption), but previously have not seen a 2x speed difference,” the researchers wrote.

Wait – is the new GPT-4o a smaller and less intelligent model?

We have completed running our independent evals on OpenAI’s GPT-4o release yesterday and are consistently measuring materially lower eval scores than the August release of GPT-4o.

GPT-4o (Nov) vs GPT-4o (Aug):
➤… pic.twitter.com/gjY2pBFuUv

— Artificial Analysis (@ArtificialAnlys) November 21, 2024

“Based on this data, we conclude that it is likely that OpenAI’s Nov 20th GPT-4o model is a smaller model than the August release,” they continued. “Given that OpenAI has not cut prices for the Nov 20th version, we recommend that developers do not shift workloads away from the August version without careful testing.”

GPT-4o was first released in May 2024 to surpass the existing GPT-3.5 and GPT-4 models. GPT-4o offers state-of-the-art benchmark results in voice, multilingual, and vision tasks, according to OpenAI, making it ideal for advanced applications like real-time translation and conversational AI.











Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Keep Reading

Your Galaxy lock screen just got hundreds of Samsung unlock animations

Leak says your entry Galaxy S26 won’t be stuck at 25W

Your Android 17 Quick Settings could get two big upgrades

Google’s Veo just got better at making videos for your social feeds

You might get an OpenAI ear wearable, and it’s not just earbuds

Your first NVIDIA N1X laptop could come from Dell

Piece by piece, SpaceX preps first Starship flight from Space Coast

Scientists are teaching OLED screens how to shine smarter

AI chatbots still struggle with news accuracy, study finds

Editors Picks

Your Galaxy lock screen just got hundreds of Samsung unlock animations

January 14, 2026

Leak says your entry Galaxy S26 won’t be stuck at 25W

January 14, 2026

Your Android 17 Quick Settings could get two big upgrades

January 14, 2026

Google’s Veo just got better at making videos for your social feeds

January 14, 2026

Subscribe to News

Get the latest UAE news and updates directly to your inbox.

Latest Posts

Standard Chartered Global Market Outlook 2026

January 14, 2026

Second Edition of the UAE Libraries Forum Kicks Off

January 14, 2026

You might get an OpenAI ear wearable, and it’s not just earbuds

January 14, 2026
Facebook X (Twitter) Pinterest TikTok Instagram
© 2026 Daily Guardian UAE. All Rights Reserved.
  • Privacy Policy
  • Terms
  • Advertise
  • Contact

Type above and press Enter to search. Press Esc to cancel.