Skip to content

OpenAI releases GDPval, a benchmark to test AI performance on “economically valuable, real-world tasks”, and says Claude Opus 4.1 was the best performing model (Maxwell Zeff/TechCrunch)

    Snarful Solutions Group, LLC.