large language models (LLMs)

Google study shows LLMs abandon correct answers under pressure, threatening multi-turn AI systems

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now A new study by researchers at Google DeepMind and University College London reveals how large language models (LLMs) form, maintain and lose confidence in their answers. The findings reveal […]

Google study shows LLMs abandon correct answers under pressure, threatening multi-turn AI systems Read More »

A new paradigm for AI: How ‘thinking as optimization’ leads to better general-purpose models

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now Researchers at the University of Illinois Urbana-Champaign and the University of Virginia have developed a new model architecture that could lead to more robust AI systems with more powerful

A new paradigm for AI: How ‘thinking as optimization’ leads to better general-purpose models Read More »

New 1.5B router model achieves 93% accuracy without costly retraining

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now Researchers at Katanemo Labs have introduced Arch-Router, a new routing model and framework designed to intelligently map user queries to the most suitable large language model (LLM).  For enterprises

New 1.5B router model achieves 93% accuracy without costly retraining Read More »

TreeQuest by Sakana AI: Create multi-model team that outperform personal LLMs by 30 %.

Want more insightful messages in your box? To receive simply what issues to business leaders in terms of AI, information, and safety, sign up for our weekly newsletters. Subscribe Right Here A new method has been developed by the Japanese AI lab Sakana AI that enables multiple large language models ( LLMs) to work together

TreeQuest by Sakana AI: Create multi-model team that outperform personal LLMs by 30 %. Read More »

The hidden scaling cliff that’s about to break your agent rollouts

Join the event trusted by enterprise leaders for nearly two decades. VB Transform brings together the people building real enterprise AI strategy. Learn more Enterprises that want to build and scale agents also need to embrace another reality: agents aren’t built like other software.  Agents are “categorically different” in how they’re built, how they operate, and

The hidden scaling cliff that’s about to break your agent rollouts Read More »

IBM sees enterprise customers are using ‘everything’ when it comes to AI, the challenge is matching the LLM to the right use case

IBM sees enterprise customers are using ‘everything’ when it comes to AI, the challenge is matching the LLM to the right use case Read More »

What’s inside Genspark? A new vibe working approach that ditches rigid workflows for autonomous agents

Join the event trusted by enterprise leaders for nearly two decades. VB Transform brings together the people building real enterprise AI strategy. Learn more Vibe coding has been all the rage in recent months as a simple way for anyone to build applications with generative AI. But what if that same easy-going, natural language approach was

What’s inside Genspark? A new vibe working approach that ditches rigid workflows for autonomous agents Read More »

Just add humans: Oxford medical study underscores the missing link in chatbot testing

Join the event trusted by enterprise leaders for nearly two decades. VB Transform brings together the people building real enterprise AI strategy. Learn more Headlines have been blaring it for years: Large language models (LLMs) can not only pass medical licensing exams but also outperform humans. GPT-4 could correctly answer U.S. medical exam licensing questions 90%

Just add humans: Oxford medical study underscores the missing link in chatbot testing Read More »

AlphaOne gives AI developers a new dial to control LLM ‘thinking’ and boost performance

Join the event trusted by enterprise leaders for nearly two decades. VB Transform brings together the people building real enterprise AI strategy. Learn more A new framework from researchers at the University of Illinois, Urbana-Champaign, and the University of California, Berkeley gives developers more control over how large language models (LLMs) “think,” improving their reasoning capabilities

AlphaOne gives AI developers a new dial to control LLM ‘thinking’ and boost performance Read More »

en_USEnglish