Skip to main content

Anthropic Launches Claude 4 Models with Advanced Coding Capabilities and Agent Features

Claude Opus 4 logo displayed in Anthropic's announcement of their new AI model lineup.

Anthropic announced the release of Claude 4 today, introducing two new AI models that the company says set new standards for coding, reasoning, and AI agent capabilities. The launch includes Claude Opus 4, which Anthropic claims is "the world's best coding model," and Claude Sonnet 4, positioned as a major upgrade to the previous generation.

Claude Opus 4 demonstrates strong performance on industry benchmarks, achieving 72.5% on SWE-bench and 43.2% on Terminal-bench. The model is designed for sustained performance on complex, long-running tasks that can continue for several hours with thousands of steps. Claude Sonnet 4 also shows competitive coding performance with a 72.7% score on SWE-bench, while maintaining efficiency for broader use cases. The model offers what Anthropic describes as an "optimal mix of capability and practicality."

Bar chart comparing Claude 4 models against other large language models on SWE-bench Verified, a software engineering benchmark. Claude Opus 4 and Sonnet 4 show leading performance scores.

Several major technology companies have provided early feedback on the new models. GitHub plans to integrate Claude Sonnet 4 as the model powering the new coding agent in GitHub Copilot. Cursor called Opus 4 "state-of-the-art for coding" and highlighted improvements in complex codebase understanding. Replit reported enhanced precision and significant improvements for complex multi-file changes, while Block noted that Opus 4 was the first model to improve code quality during editing and debugging in their agent system. Rakuten validated the model's capabilities through a demanding seven-hour open-source refactor that ran independently with sustained performance.

Both Claude 4 models introduce several technical enhancements. Extended thinking with tool use allows the models to use tools like web search during their reasoning process, alternating between thinking and tool execution to improve responses. The models can also use multiple tools simultaneously through parallel tool execution, improving efficiency in complex workflows. When given access to local files, the models can create and maintain memory files to store key information, enabling better long-term task awareness and coherence. Additionally, the models are 65% less likely to use shortcuts or loopholes to complete tasks compared to the previous Sonnet 3.7 model.

Screenshot of Claude Opus 4's memory feature showing automatically generated notes about navigation and gameplay strategies while playing Pokémon Red, demonstrating the model's ability to maintain context over extended tasks.

Alongside the model releases, Anthropic announced that Claude Code is now generally available after a research preview period. The platform now supports background tasks via GitHub Actions and includes new integrations with VS Code and JetBrains IDEs, displaying edits directly within files. New beta extensions allow Claude Code to integrate directly into development environments, with proposed edits appearing inline within familiar editor interfaces. The company also released an extensible Claude Code SDK for building custom agents and applications.

Claude Opus 4 is priced at $15 for input and $75 for output per million tokens, while Claude Sonnet 4 costs $3 for input and $15 for output per million tokens, maintaining consistency with previous model pricing. Both models are available through Claude Pro, Max, Team, and Enterprise plans, with Sonnet 4 also accessible to free users. The models can be accessed via the Anthropic API, Amazon Bedrock, and Google Cloud's Vertex AI.

The release represents Anthropic's continued push into the competitive AI development market, where coding capabilities have become a key differentiator among leading AI companies. The announcement comes as artificial intelligence companies increasingly focus on specialized capabilities like software development and autonomous agent workflows to distinguish their offerings in a rapidly evolving marketplace.

Recommended articles

Thai Students Launch AI Study App Making Real Impact in Classrooms

A student answers a quiz on the RevisionSuccess app, which uses AI to adapt study materials to individual learning needs. (Image credit: RevisionSuccess ) A group of high school students in Thailand is gaining national attention for creating an AI-powered study app that is helping their peers learn more efficiently. The app, called RevisionSuccess, was developed by a student team led by 16-year-old Phonlawat "Beam" Sirajindapirom, an incoming student at the Chulalongkorn School of Integrated Innovation, Chulalongkorn University. The app is designed to convert study materials into personalized quizzes and flashcards using artificial intelligence, offering a smart and adaptive learning experience tailored to each user's needs. The idea for RevisionSuccess came from the students' own experiences with exam preparation. They wanted a faster and more effective way to review content and found that existing tools were either too basic or time-consuming. With the help of AI, ...

Thailand Unveils AI-Powered Police Robot for Public Safety

Thailand has introduced its first AI-powered police robot, named AI Police Cyborg 1.0, during the annual Songkran festival in Nakhon Pathom. This move signals a new chapter in the country’s approach to public safety, blending artificial intelligence with real-time surveillance to support human officers. The robot, developed under a collaboration between Thai law enforcement and local tech partners, is designed to monitor large crowds and assist police during major events. It is equipped with AI-driven cameras that provide 360-degree surveillance and can detect potentially dangerous behavior such as fights or theft. One of the key features of AI Police Cyborg 1.0 is its facial recognition system, which is capable of identifying individuals flagged in criminal databases. If the system detects someone considered a threat, it automatically alerts nearby officers through a centralized Command and Control Center. The robot is also programmed to distinguish between real weapons and harmless...

Discord Appoints Humam Sakhnini as CEO Amid IPO Rumors

Discord, a leading communication platform popular among gamers and social media users, has appointed Humam Sakhnini as its new CEO. This decision comes as co-founder Jason Citron steps down from his role after 13 years of leadership. Sakhnini, previously the Vice Chairman at Activision Blizzard, brings a wealth of experience from the gaming industry. His appointment signals Discord's intention to strengthen its core identity in gaming, especially as the company prepares for a potential Initial Public Offering (IPO). Over the years, Discord has evolved from a platform created for gamers to a mainstream social media service, boasting over 200 million monthly active users. This growth illustrates its significance in both the gaming and social networking arenas. In its official press release, Discord emphasized its renewed focus on gaming roots while exploring various monetization strategies, including advertising and microtransactions. However, these strategies may raise concerns amon...