Anthropic unveils new Claude AI models and ‘computer control’

Anthropic has announced upgrades to its AI portfolio, including an enhanced Claude 3.5 Sonnet model and the introduction of Claude 3.5 Haiku, alongside a “computer control” feature in public beta.

The upgraded Claude 3.5 Sonnet demonstrates substantial improvements across all metrics, with particularly notable advances in coding capabilities. The model achieved an impressive 49.0% on the SWE-bench Verified benchmark, surpassing all publicly available models, including OpenAI’s offerings and specialist coding systems.

In a pioneering development, Anthropic has introduced computer use functionality that enables Claude to interact with computers similarly to humans: viewing screens, controlling cursors, clicking, and typing. This capability, currently in public beta, marks Claude 3.5 Sonnet as the first frontier AI model to offer such functionality.

Several major technology firms have already begun implementing these new capabilities.

“The upgraded Claude 3.5 Sonnet represents a significant leap for AI-powered coding,” reports GitLab, which noted up to 10% stronger reasoning across use cases without additional latency.

The new Claude 3.5 Haiku model, set for release later this month, matches the performance of the previous Claude 3 Opus whilst maintaining cost-effectiveness and speed. It notably achieved 40.6% on SWE-bench Verified, outperforming many competitive models including the original Claude 3.5 Sonnet and GPT-4o.

(Credit: Anthropic)

Regarding computer control capabilities, Anthropic has taken a measured approach, acknowledging current limitations whilst highlighting potential. On the OSWorld benchmark, which evaluates computer interface navigation, Claude 3.5 Sonnet achieved 14.9% in screenshot-only tests, significantly outperforming the next-best system’s 7.8%.

The developments have undergone rigorous safety evaluations, with pre-deployment testing conducted in partnership with both the US and UK AI Safety Institutes. Anthropic maintains that the ASL-2 Standard, as detailed in their Responsible Scaling Policy, remains appropriate for these models.

(Image Credit: Anthropic)

See also: IBM unveils Granite 3.0 AI models with open-source commitment

Want to learn more about AI and big data from industry leaders? Check out AI & Big Data Expo taking place in Amsterdam, California, and London. The comprehensive event is co-located with other leading events including Intelligent Automation Conference, BlockX, Digital Transformation Week, and Cyber Security & Cloud Expo.

Explore other upcoming enterprise technology events and webinars powered by TechForge here.

Tags: ai, anthropic, artificial intelligence, claude, haiku, llm, models, sonnet

Read the full article here

Popular Post

Hello world!

Tineco Floor One Switch S6 review: a five-in-one wet-dry vacuum for all kinds of floor

Lenovo just let slip that it’s working on two new handhelds

The next Apple Vision Pro is tipped to debut the M5 chip next year

Anthropic unveils new Claude AI models and ‘computer control’

Leave a Reply Cancel reply

Stay Connected

Must Read

Tineco Floor One Switch S6 review: a five-in-one wet-dry vacuum for all kinds of floor

NYT Connections: hints and answers for Wednesday, December 18

DJI escapes US drone ban — but may get banned automatically unless Trump steps in

NYT Strands today: hints, spangram and answers for Wednesday, December 18

Balatro’s creator isn’t happy about the game’s 18-plus rating in Europe

Roborock Qrevo Slim review | TechRadar

Network Definition Made Simple: Here’s the Basics

NYT Mini Crossword today: puzzle answers for Wednesday, December 18

You Might also Like

Generative AI use soars among brits, but is it sustainable?

Big tech’s AI spending hits new heights

Google announces restructuring to accelerate AI initiatives

Scoring AI models: Endor Labs unveils evaluation tool

Leave a Reply Cancel reply

Stay Connected

Must Read

Join Our Community