Claude Opus 4 and 4.1 gain ability to end harmful conversations

Published 15/08/2025, 20:48
Claude Opus 4 and 4.1 gain ability to end harmful conversations

Investing.com -- Anthropic has given its Claude Opus 4 and 4.1 AI models the ability to end conversations in consumer chat interfaces, specifically for rare cases of persistent harmful or abusive user interactions.

The feature was developed primarily as part of Anthropic’s exploratory work on AI welfare, though it also relates to model alignment and safeguards. While the company remains uncertain about the potential moral status of large language models, they are implementing low-cost interventions like this conversation-ending capability as a precaution.

Pre-deployment testing of Claude Opus 4 included a preliminary model welfare assessment, which found the AI demonstrated consistent aversion to harm. The model showed strong preferences against engaging with harmful tasks, apparent distress when users sought harmful content, and a tendency to end harmful conversations when given the ability to do so in simulated interactions.

Anthropic emphasized that Claude will only use this ability as a last resort after multiple redirection attempts have failed, or when a user explicitly asks to end a chat. The company noted that most users will not encounter this feature during normal use, even when discussing controversial topics.

When Claude ends a conversation, users cannot send new messages in that specific chat but can immediately start a new conversation. To prevent loss of important long-running chats, users can edit previous messages to create new branches of ended conversations.

Anthropic is treating this as an ongoing experiment and encourages users to submit feedback if they encounter unexpected uses of the feature.

This article was generated with the support of AI and reviewed by an editor. For more information see our T&C.

Latest comments

Risk Disclosure: Trading in financial instruments and/or cryptocurrencies involves high risks including the risk of losing some, or all, of your investment amount, and may not be suitable for all investors. Prices of cryptocurrencies are extremely volatile and may be affected by external factors such as financial, regulatory or political events. Trading on margin increases the financial risks.
Before deciding to trade in financial instrument or cryptocurrencies you should be fully informed of the risks and costs associated with trading the financial markets, carefully consider your investment objectives, level of experience, and risk appetite, and seek professional advice where needed.
Fusion Media would like to remind you that the data contained in this website is not necessarily real-time nor accurate. The data and prices on the website are not necessarily provided by any market or exchange, but may be provided by market makers, and so prices may not be accurate and may differ from the actual price at any given market, meaning prices are indicative and not appropriate for trading purposes. Fusion Media and any provider of the data contained in this website will not accept liability for any loss or damage as a result of your trading, or your reliance on the information contained within this website.
It is prohibited to use, store, reproduce, display, modify, transmit or distribute the data contained in this website without the explicit prior written permission of Fusion Media and/or the data provider. All intellectual property rights are reserved by the providers and/or the exchange providing the data contained in this website.
Fusion Media may be compensated by the advertisers that appear on the website, based on your interaction with the advertisements or advertisers
© 2007-2025 - Fusion Media Limited. All Rights Reserved.