Anthropic Claude Opus 4.7 boosts coding while limiting cyber capabilities

Anthropic’s latest flagship is aimed squarely at software work

Anthropic has released Claude Opus 4.7 as a direct upgrade to Opus 4.6, positioning the model as a more capable system for autonomous coding and complex technical work. Based on the supplied source material, the biggest headline is a substantial gain on the SWE-bench Pro coding benchmark, where Opus 4.7 scored 64.3 percent versus 53.4 percent for Opus 4.6.

The report also says that figure puts the model ahead of OpenAI’s GPT-5.4 at 57.7 percent on the same benchmark, while still trailing Anthropic’s own Claude Mythos Preview at 77.8 percent. That framing matters. The company is not presenting Opus 4.7 as its absolute peak experimental system, but as a production-facing model that materially improves on its immediate predecessor in a commercially important area: software engineering.

For enterprise buyers and development teams, coding performance is one of the clearest AI product differentiators because it maps directly to time saved, bug reduction, and the ability to automate well-scoped engineering work. Anthropic’s announcement suggests the company is continuing to compete by improving practical output quality rather than relying on a broad marketing reset.

Instruction-following and vision both move forward

Anthropic also says Opus 4.7 follows instructions more precisely than Opus 4.6. That sounds incremental, but it can have real consequences in production. The source notes that prompts written for older models may now produce unexpected results because the new system interprets instructions more literally, rather than loosely handling or skipping parts of them.

That kind of change cuts both ways. Better adherence can make model behavior more reliable when prompts are well written, but it can also expose weak prompt design that previously went unnoticed. In practice, teams upgrading to Opus 4.7 may need to revisit existing prompts, guardrails, and evaluation flows instead of assuming drop-in parity.

Vision is another area of notable change. According to the supplied text, the model now processes images up to 2,576 pixels on the long edge, or roughly 3.75 megapixels, which Anthropic says is more than triple what earlier Claude models could handle. The company links that to better performance for computer-use agents reading dense screenshots and for extracting information from complex diagrams.

The article cites an increase on the OfficeQA Pro document reasoning benchmark, from 57.1 percent with Opus 4.6 to 80.6 percent with Opus 4.7. It also describes gains in biomolecular reasoning and visual navigation on ScreenSpot-Pro. Taken together, those changes suggest Anthropic is treating visual understanding not as a side feature but as a core part of the model’s usefulness in office, technical, and agentic workflows.

Apple sends Siri developers to AI coding bootcamp

More in AI & Robotics

苹果让 Siri 工程师返校培训，力图在 AI 竞赛中追赶

据报道，苹果正在对部分 Siri 团队成员进行为期数周的 AI 编程训练营培训，为公司准备一款重新设计的助手，该产品预计将于 6 月在 WWDC 上亮相。

Read article

Anthropic is making safety tradeoffs explicit

One of the more unusual details in the release is not a capability gain but a deliberate restriction. The source says Anthropic tried during training to reduce risky cybersecurity capabilities and now automatically blocks related requests. That makes Opus 4.7 notable not just for being more capable overall, but for being selectively less capable in an area the company considers dangerous.

This is an important signal for the market. Many frontier model announcements focus on raw gains first and policy language second. Here, Anthropic appears to be foregrounding the idea that higher-performing models do not need to advance equally across every domain. The product message is that stronger coding assistance and stronger vision do not have to come with unrestricted cyber behavior.

Whether customers view that as a feature or a limitation will depend on use case. For mainstream software development, the company is betting the answer is clear: safer boundaries around cyber-related behavior are acceptable if coding quality still rises sharply.

The pricing note may matter as much as the benchmark gain

The report says per-token pricing remains unchanged, but it adds a consequential caveat: a new tokenizer can map the same text to as much as 35 percent more tokens. That means the effective cost of a request can increase even when the published token price does not.

That detail is easy to miss and difficult for buyers to ignore. Organizations evaluating AI models increasingly care about real workload economics, not just posted rate cards. If tokenization changes increase billable usage, then benchmarking a new model requires measuring accuracy, latency, and cost together.

In other words, Claude Opus 4.7 may be meaningfully better, but it may not be meaningfully cheaper for a given task. That does not undercut the release, but it does push the conversation from headline performance to operational value.

Bytedance rolls out Seedance 2.0 to 100+ countries but keeps the US off the list

More in AI & Robotics

字节跳动将 Seedance 2.0 扩展到 100 多个国家，但不包括美国

字节跳动的云业务已将其 Seedance 2.0 AI 视频模型面向 100 多个国家的企业客户推出，但由于版权争议，美国市场仍被排除在外。

Read article

A product release aimed at serious users

Based on the supplied material, Claude Opus 4.7 is a focused release: better autonomous coding, better image handling, more literal compliance with prompts, and a clearer attempt to suppress dangerous cyber behavior. It is not being sold as a vague leap in intelligence. It is being sold as a more useful technical system.

That makes the launch noteworthy. The AI market is moving past generalized claims and toward sharper product distinctions. Anthropic’s latest move suggests one of those distinctions will be the willingness to improve high-value capabilities while intentionally constraining others.

This article is based on reporting by The Decoder. Read the original article.

Originally published on the-decoder.com

Anthropic’s latest flagship is aimed squarely at software work

Instruction-following and vision both move forward

More in AI & Robotics

苹果让 Siri 工程师返校培训，力图在 AI 竞赛中追赶

据报道，苹果正在对部分 Siri 团队成员进行为期数周的 AI 编程训练营培训，为公司准备一款重新设计的助手，该产品预计将于 6 月在 WWDC 上亮相。

Read article

Anthropic is making safety tradeoffs explicit

The pricing note may matter as much as the benchmark gain

More in AI & Robotics

字节跳动将 Seedance 2.0 扩展到 100 多个国家，但不包括美国

字节跳动的云业务已将其 Seedance 2.0 AI 视频模型面向 100 多个国家的企业客户推出，但由于版权争议，美国市场仍被排除在外。

Read article

A product release aimed at serious users

This article is based on reporting by The Decoder. Read the original article.

Originally published on the-decoder.com

Anthropic Pushes Claude Opus 4.7 Further Into Coding While Deliberately Limiting Cyber Use

Anthropic’s latest flagship is aimed squarely at software work

Instruction-following and vision both move forward

苹果让 Siri 工程师返校培训，力图在 AI 竞赛中追赶

Anthropic is making safety tradeoffs explicit

The pricing note may matter as much as the benchmark gain

字节跳动将 Seedance 2.0 扩展到 100 多个国家，但不包括美国

A product release aimed at serious users

Comments (0)

Related Articles

Cadence深化与Nvidia和Google Cloud的AI合作

谷歌扩展 Gemini 语音工具，推出更具表现力的 TTS 模型

Keep Reading

Anthropic Pushes Claude Opus 4.7 Further Into Coding While Deliberately Limiting Cyber Use

Anthropic’s latest flagship is aimed squarely at software work

Instruction-following and vision both move forward

苹果让 Siri 工程师返校培训，力图在 AI 竞赛中追赶

Anthropic is making safety tradeoffs explicit

The pricing note may matter as much as the benchmark gain

字节跳动将 Seedance 2.0 扩展到 100 多个国家，但不包括美国

A product release aimed at serious users

Comments (0)

Related Articles

Cadence深化与Nvidia和Google Cloud的AI合作

谷歌扩展 Gemini 语音工具，推出更具表现力的 TTS 模型

Keep Reading