benchmarks

Mistral just updated its open source Small model from 3.1 to 3.2: here’s why

Join the event trusted by enterprise leaders for nearly two decades. VB Transform brings together the people building real enterprise AI strategy. Learn more French AI darling Mistral is keeping the new releases coming this summer. Just days after announcing its own domestic AI-optimized cloud service Mistral Compute, the well-funded company has released an update to […]

Mistral just updated its open source Small model from 3.1 to 3.2: here’s why Read More »

Just add humans: Oxford medical study underscores the missing link in chatbot testing

Join the event trusted by enterprise leaders for nearly two decades. VB Transform brings together the people building real enterprise AI strategy. Learn more Headlines have been blaring it for years: Large language models (LLMs) can not only pass medical licensing exams but also outperform humans. GPT-4 could correctly answer U.S. medical exam licensing questions 90%

Just add humans: Oxford medical study underscores the missing link in chatbot testing Read More »

en_USEnglish