Daily Guardian UAEDaily Guardian UAE
  • Home
  • UAE
  • What’s On
  • Business
  • World
  • Entertainment
  • Lifestyle
  • Sports
  • Technology
  • Travel
  • Web Stories
  • More
    • Editor’s Picks
    • Press Release
What's On

Google will let some Chromebooks transition into a Googlebook experience soon

May 13, 2026

Zoho Report: Cybersecurity Challenges and Zero Trust in UAE

May 13, 2026

The Portable Chargers We’d Actually Buy This Spring

May 13, 2026

MAAIA Accelerates Construction Progress Across La Clé and La Vue, Reaffirms Q2 2027 Handover

May 13, 2026

Disney+ just confirmed the VisionQuest release date, finally moving the WandaVision trilogy toward its conclusion

May 13, 2026
Facebook X (Twitter) Instagram
Finance Pro
Facebook X (Twitter) Instagram
Daily Guardian UAE
Subscribe
  • Home
  • UAE
  • What’s On
  • Business
  • World
  • Entertainment
  • Lifestyle
  • Sports
  • Technology
  • Travel
  • Web Stories
  • More
    • Editor’s Picks
    • Press Release
Daily Guardian UAEDaily Guardian UAE
Home » Salesforce Announces the World’s First LLM Benchmark for CRM
What's On

Salesforce Announces the World’s First LLM Benchmark for CRM

By dailyguardian.aeJune 22, 20245 Mins Read
Share
Facebook Twitter LinkedIn Pinterest Email

New benchmark and leaderboard give businesses the guidance they need to make smart decisions when evaluating generative AI models for their CRM systems

UAE, 21 June 2024 – Salesforce announced the world’s first LLM benchmark for CRM to help businesses evaluate the rapidly growing number of large language models (LLMs) for use in their customer relationship management (CRM) systems. 

The new benchmark is a comprehensive evaluation framework that measures the performance of LLMs against four key measures: accuracy, cost, speed, and trust and safety. It’s been specifically designed to evaluate common sales and service use cases, including prospecting, lead nurturing, as well as sales opportunity and service case summaries. The benchmark also includes a public leaderboard to help professionals decide which LLM is best for their CRM needs. Salesforce will continue to incorporate new use case scenarios into the benchmark and enhance its evaluation of LLMs, which will soon include fine-tuned LLMs. 

“As AI continues to evolve, enterprise leaders are saying it’s important to find the right mix of performance, accuracy, responsibility, and cost to unlock the full potential of generative AI to drive business growth,” said Silvio Savarese, EVP & Chief Scientist, Salesforce AI Research. “Salesforce’s new LLM Benchmark for CRM is a significant step forward in the way businesses assess their AI strategy within the industry. It not only provides clarity on next-generation AI deployment but also can accelerate time to value for CRM-specific use cases. Our commitment is to continuously evolve this benchmark to keep pace with technological advancements, ensuring it remains relevant and valuable.” 

Why it matters: Existing LLM benchmarks have been limited to academic and consumer use cases, with very little business relevance. They also lack adequate expert human evaluations and fail to address accuracy, speed, cost, and trust considerations. These deficiencies have left CRM customers lacking a reliable way to gauge the effectiveness of generative AI-powered CRM solutions. Without a clear sense of how LLMs perform across those metrics for specific use cases, businesses are left to make decisions in the dark. 

Dive deeper: Developed by Salesforce AI Research, the benchmark uniquely uses real-world CRM data, and also uniquely makes use of expert human evaluations by practitioners. This enables businesses to use the benchmark to make more strategic decisions about how to incorporate generative AI into their CRM systems, with specific attention to: 

  1. Accuracy: This metric comprises four subcategories: factuality, completeness, conciseness, and instruction-following. The more accurate the predictions or recommendations, the more valuable the results are to teams across the organization. And the more valuable the results, the better the actions they can take to improve customer experience. If a model is accurate enough for a use case, it’s also important to consider the other metrics. Even if the model isn’t accurate enough, techniques like prompt engineering and fine-tuning can improve it. 
  2. Cost: The cost metric is categorized as high, medium, and low, based on percentiles. It’s the estimated operational cost that varies by CRM use case. Customers can evaluate the cost-effectiveness of different LLMs to ensure they align with their budget and resource allocation strategies.
  3. Speed: This metric assesses the LLM’s responsiveness and efficiency in processing and delivering information. Faster response times enhance the user experience, reduce wait times for customers, and enable sales and service teams to address inquiries and issues promptly.
  4. Trust and Safety: This metric measures the LLM’s capability to shield sensitive customer data, adhere to data privacy regulations, secure information, and refrain from bias and toxicity for CRM use cases. By assessing the reliability of LLMs for CRM, this benchmark gives organizations a sense of transparency regarding trust and safety.

Organizations can use this benchmark to compare LLMs, identify the best solution, and make more informed decisions that will deliver customer success and propel their business forward.  

And, with Salesforce’s Einstein 1 Platform, customers can choose from existing LLMs or bring their own models to meet their unique business needs. By selecting models for their CRM use cases using the benchmark, businesses can deploy more effective and efficient generative AI solutions. 

“Business organizations are looking to utilize AI to drive growth, cut costs, and deliver personalized customer experiences, not to plan a kid’s birthday party or summarize Othello,” said Clara Shih, CEO of Salesforce AI. “Our customers have been asking for a purpose-built way to evaluate and select from among the proliferation of new AI models, and we are thrilled to introduce the world’s first LLM benchmark for CRM to help them navigate the complex landscape of models. This benchmark is not just a measure; it’s a comprehensive, dynamically evolving framework that empowers companies to make informed decisions, balancing accuracy, cost, speed, and trust.”

Learn more:

“The information provided in this press release does not, and is not intended to, constitute an endorsement of any particular LLM; instead, all information, content, and materials available are for general informational purposes only. Readers should make their own determinations based on their needs. Opinions of the referenced presenters and/or author are their own and do not necessarily reflect the official position of Salesforce.”

                                               – End –

About Salesforce

Salesforce empowers companies of every size and industry to connect with their customers in a whole new way through the power of AI + data + CRM.

For more information about Salesforce (NYSE: CRM), visit: www.salesforce.com.

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Keep Reading

Zoho Report: Cybersecurity Challenges and Zero Trust in UAE

MAAIA Accelerates Construction Progress Across La Clé and La Vue, Reaffirms Q2 2027 Handover

Whitewill Q1 2026 report records AED 139.2 billion in Dubai real estate transactions

NCCCL Expands in UAE’s Booming Construction Market

UAE Economy Resilience: SIB’s AED 2.59 Billion Rights Issue Success

SAP Connect UAE Highlights Role of Core Business Systemsin Scaling Enterprise AI

Thumbay Group Launches First Private Psychiatric Hospital in Sharjah

ANAROCK Expands in Middle East with Key Leadership Changes

EGA and ADNOC L&S Forge Strategic Logistics Partnership

Editors Picks

Zoho Report: Cybersecurity Challenges and Zero Trust in UAE

May 13, 2026

The Portable Chargers We’d Actually Buy This Spring

May 13, 2026

MAAIA Accelerates Construction Progress Across La Clé and La Vue, Reaffirms Q2 2027 Handover

May 13, 2026

Disney+ just confirmed the VisionQuest release date, finally moving the WandaVision trilogy toward its conclusion

May 13, 2026

Subscribe to News

Get the latest UAE news and updates directly to your inbox.

Latest Posts

Whitewill Q1 2026 report records AED 139.2 billion in Dubai real estate transactions

May 13, 2026

Samsung’s next Galaxy Z foldables will give you plenty of AI love with Gemini Intelligence

May 13, 2026

NCCCL Expands in UAE’s Booming Construction Market

May 13, 2026
Facebook X (Twitter) Pinterest TikTok Instagram
© 2026 Daily Guardian UAE. All Rights Reserved.
  • Privacy Policy
  • Terms
  • Advertise
  • Contact

Type above and press Enter to search. Press Esc to cancel.