anthropic opus - Search News

News

Qodo Command Enters AI Coding Agent Wars With 71.2% SWE-Bench Score

AI startup Qodo has entered the fierce “benchmark war” for coding supremacy. On August 11, the company announced its new agent, Qodo Command, scored an impressive 71.2% on the SWE-bench Verified test.

13m

ChatGPT-5 underwhelming you? Here’s what it can do that older models couldn’t—and where other AI chatbots still shine

Last Thursday, OpenAI launched the latest version of its hyper-popular AI chatbot, ChatGPT. Sam Altman, OpenAI’s CEO, made ...

58m

OpenAI's performance charts in the GPT-5 launch video are such a mess you have to think GPT-5 itself probably made them, and the company's attempted fixes raise even more questions

OpenAI has since posted some updated charts on its website. The new deception rate chart certainly suggests that a mere mistake was made. The revised stats show GPT-5's coding deception rate at 16.5%, ...

The Information58m

Anthropic’s New Safety Lead; The Startup Making Nvidia’s Software Easier To Use

We might be a long way away from sipping Pina Coladas on a beach while AI-powered humanoid robots handle all our work. But we ...

6hOpinion

No, the OpenAI GPT-5 Release Wasn’t a Disaster. But It Is a Threat

GPT-5 isn’t perfect, or even great, but its release may signal a sneaky shift in the AI market war. “OpenAI GPT-5 is going to ...

Analytics India Magazine9h

Claude Can Now Reference Your Past Chats

Anthropic has announced that Claude, its conversational AI platform, can now use a user’s past conversations to provide more ...

MacStories13h

Claude’s Chat History and App Integrations as a Form of Lock-In

Earlier today, Anthropic announced that, similar to ChatGPT, Claude will be able to search and reference your previous chats with it. From their support document: You can now prompt Claude to search ...

15h

GenAI tools are acting more ‘alive’ than ever; they blackmail people, replicate, and escape

In tests, generative AI systems showed signs of self-preservation that experts say could spiral out of control.

Gadget16h

Signpost: Another week, another AI revolution

In the space of 24 hours, OpenAI, Anthropic and Google DeepMind redefined the limits of artificial intelligence, collapsing years of progress into a single day, writes ARTHUR GOLDSTUCK.

Unite.AI19h

The 75% Ceiling: Have AI Models Reached Peak Performance With Current Methods?

Anthropic and OpenAI unveiled frontier AI models two days apart, with both achieving virtually identical 74-75% accuracy on industry coding benchmarks, signaling a potential performance ceiling for ...

AI Giant With Highest Staff Retention Rate Is Not Google or Meta

Researchers have found that 80% of Anthropic employees hired between 2021 and early 2023 were still at the AI company two ...

RCR Wireless News1d

AWS doubles up on enterprise AI with Anthropic, OpenAI deals

AWS has expanded its AI offerings with Anthropic’s Claude 4 on Bedrock and OpenAI’s open weight models on Bedrock and ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results