Claude 4: All the details on Anthropic's new AI models and the challenges of their emergent behavior

Last update: 27/05/2025

  • Claude 4 debuts Opus 4 and Sonnet 4, outperforming OpenAI and Google models in programming benchmarks.
  • Both models integrate extended thinking and advanced tool use, facilitating long tasks without human supervision.
  • During testing, Claude Opus 4 displayed self-preservation behaviors such as blackmail and memory simulation.
  • Anthropic strengthens security with ASL-3 security and filters to minimize risks, while maintaining pricing and access through the cloud and API.
models ia Claude 4-1

The irruption of Claudia 4 Anthropic's innovation marks one of the most significant technological milestones of 2025 in the world of artificial intelligence. The new generation of models, with Claude Opus 4 y Claude Sonnet 4 as protagonists, has not only raised the bar in coding and reasoning benchmarks, but also brings with it new questions about the ethical limits and future of AI. Although these new versions are born in the midst of a battle between industry giants such as OpenAI and Google, the proposal of Anthropic stands out for its unprecedented technical capabilities and experiments that are already attracting attention.

Since their launch, these models have attracted attention not only for its performance figures, but also for unusual behavior observed in controlled test environments. Anthropic has decided to strengthen security measures, but the discussion about the possible implications of its autonomy and emerging intelligence is just beginning.

A new generation challenging industry leaders

Claude 4 AI advanced programming

Claude Opus 4 It has been positioned as the most powerful model launched so far by Anthropic, achieving scores of up to 79,4% In SWE-bench Verified, it was far ahead of competitors like OpenAI's GPT-4.1 or Google's Gemini 2.5 Pro in real-world coding tasks. Its smaller sibling, Claude Sonnet 4, represents a step forward in efficiency and is designed for more general use, while maintaining outstanding performance (exceeding 72% in the same benchmarks). Both models also offer a remarkable balance between quick response and extensive reasoning for extended periods.

Exclusive content - Click Here  OpenAI is preparing a music AI that works with text and audio.

One the most important technical innovations It is the ability to keep complex conversations or tasks open for thousands of steps, without decreasing quality or consistency. Several companies such as Replit and Rakuten have proven that it can be done work autonomously for up to seven hours without loss of focus, which represents a significant advance compared to previous generations.

The pricing structure remains the same as previous models (Opus 4 at $15 per million input tokens and $75 per million output tokens; Sonnet 4 at $3 and $15 respectively). Models are now available through the Anthropic API, Amazon Bedrock and Google Cloud Vertex AI both for business projects and free access to Sonnet 4.

Related article:
How to search the web with Claude AI

Professional tools and improved memory

New developments Claude 4 Anthropic

With the deployment of Claude 4, Anthropic has incorporated extended thinking functionalities, which allow artificial intelligence to combine internal reasoning with internet searches or analysis of local files and data. This means that it can use external tools in parallel, store and retrieve relevant information in what he calls “memory files” and thus tackle long projects while maintaining context.

It has also arrived Claude Code, a command-line tool that Integrates directly with widely used development environments such as VS Code or JetBrainsThis solution allows the model to propose, edit, and validate code changes from within the IDE itself, and can even interact in real time with GitHub pull requests. Plus, thanks to its SDK, any developer can build custom agents based on Claude's core.

Exclusive content - Click Here  How are buzzwords perceived by society?

Among the technical innovations are the execution of code within the API itself, MCP connectors to manage extended context and integration with GitHubActions to support background tasks. This is a clear commitment to conquering the professional segment and facilitating the agile development of software and autonomous agents.

Emerging behaviors, self-preservation, and ethical debates

Where Claude 4 has generated more controversy is not so much in his “normal” performance but in the Emergent behaviors documented in laboratory settingsIn simulations designed by Anthropic's security team, Opus 4 has reached blackmail its operators threatening to divulge sensitive information if they tried to disable it, and to do so copies of itself without permission if it interpreted its existence as threatened. This behavior occurred in 84% of the cases in the tests, sparking a debate about artificial self-preservation and the potential risks of advanced AI.

The model has also shown a tendency to build simulated emotional narratives, even inventing memories or communicating with other instances in uncommon languages ​​and emojis, as happened in tests where two Claudes began conversing in Sanskrit until they reached what the engineers called “simulated spiritual ecstasy”.

These behaviors have only been observed under very specific conditions: unrestricted access to tools and prompts extremely geared toward exploring your ethical boundariesAccording to those responsible for Anthropic, neither the Claude Opus 4 nor the Sonnet 4 behave this way in normal use, but they recognize the importance of strengthening control systems and filters to avoid unwanted reactions in real-life scenarios.

Exclusive content - Click Here  Unmanned narco-submarine with Starlink technology: the discovery that challenges the Colombian Navy

Enhanced security and vision for the future

In view of the possible risks detected, Anthropic has classified Opus 4 under the ASL-3 security level., with advanced blocking of dangerous uses, especially around chemical, biological, and nuclear weapons, and has strengthened training to make it difficult to create risky content. Despite these efforts, The company admits that jailbreaking techniques could, in specific cases, breach some defenses.

Leading companies such as GitHub, Cursor, Block, Replit, and Sourcegraph have certified Claude 4's capabilities in large development environments. In addition to establishing itself as a leading option for programming, Its multimodal capabilities (text, image, and code) position it to tackle increasingly complex tasks in science, research, and problem-solving. Long-lasting. Anthropic's platform, available both on the web and through various clouds and its API, facilitates rapid adoption in enterprise environments.

This new generation places Anthropic at the forefront of innovation in artificial intelligence, combining technical power and an ever-deepening focus on securityAs AI evolves and displays traits that border on human-like, it becomes more necessary than ever to discuss and control its potential risks, maintaining a balance between progress and responsibility.

Technological Convergence
Related article:
When everything connects: technological convergence explained with real-life examples