AI research

Google study shows LLMs abandon correct answers under pressure, threatening multi-turn AI systems

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now A new study by researchers at Google DeepMind and University College London reveals how large language models (LLMs) form, maintain and lose confidence in their answers. The findings reveal […]

Google study shows LLMs abandon correct answers under pressure, threatening multi-turn AI systems Read More »

OpenAI, Google DeepMind and Anthropic sound alarm: ‘We may be losing the ability to understand AI’

Scientists from OpenAI, Google DeepMind, Anthropic and Meta have abandoned their fierce corporate rivalry to issue a joint warning about artificial intelligence safety. More than 40 researchers across these competing companies published a research paper today arguing that a brief window to monitor AI reasoning could close forever — and soon. The unusual cooperation comes

OpenAI, Google DeepMind and Anthropic sound alarm: ‘We may be losing the ability to understand AI’ Read More »

A new paradigm for AI: How ‘thinking as optimization’ leads to better general-purpose models

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now Researchers at the University of Illinois Urbana-Champaign and the University of Virginia have developed a new model architecture that could lead to more robust AI systems with more powerful

A new paradigm for AI: How ‘thinking as optimization’ leads to better general-purpose models Read More »

New 1.5B router model achieves 93% accuracy without costly retraining

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now Researchers at Katanemo Labs have introduced Arch-Router, a new routing model and framework designed to intelligently map user queries to the most suitable large language model (LLM).  For enterprises

New 1.5B router model achieves 93% accuracy without costly retraining Read More »

TreeQuest by Sakana AI: Create multi-model team that outperform personal LLMs by 30 %.

Want more insightful messages in your box? To receive simply what issues to business leaders in terms of AI, information, and safety, sign up for our weekly newsletters. Subscribe Right Here A new method has been developed by the Japanese AI lab Sakana AI that enables multiple large language models ( LLMs) to work together

TreeQuest by Sakana AI: Create multi-model team that outperform personal LLMs by 30 %. Read More »

Outset raises $17M to replace human interviewers with AI agents for enterprise research

Join the event trusted by enterprise leaders for nearly two decades. VB Transform brings together the people building real enterprise AI strategy. Learn more Outset, a San Francisco startup that uses artificial intelligence to conduct market research interviews, has raised $17 million in Series A funding to accelerate adoption of its AI-moderated research platform among Fortune

Outset raises $17M to replace human interviewers with AI agents for enterprise research Read More »

AlphaOne gives AI developers a new dial to control LLM ‘thinking’ and boost performance

Join the event trusted by enterprise leaders for nearly two decades. VB Transform brings together the people building real enterprise AI strategy. Learn more A new framework from researchers at the University of Illinois, Urbana-Champaign, and the University of California, Berkeley gives developers more control over how large language models (LLMs) “think,” improving their reasoning capabilities

AlphaOne gives AI developers a new dial to control LLM ‘thinking’ and boost performance Read More »

Meet AlphaEvolve, the Google AI that writes its own code—and just saved millions in computing costs

Meet AlphaEvolve, the Google AI that writes its own code—and just saved millions in computing costs Read More »

en_USEnglish