AI theft AI News & Updates
Anthropic Exposes Massive Chinese AI Model Distillation Campaign Targeting Claude
Anthropic has accused three Chinese AI companies (DeepSeek, Moonshot AI, and MiniMax) of creating over 24,000 fake accounts to conduct distillation attacks on Claude, generating 16 million exchanges to copy its capabilities in reasoning, coding, and tool use. The accusations emerge amid debates over US AI chip export controls to China, with Anthropic arguing that such attacks require advanced chips and justify stricter export restrictions. The incident raises concerns about AI model theft, national security risks from models stripped of safety guardrails, and the effectiveness of current export control policies.
Skynet Chance (+0.04%): The distillation attacks stripped safety guardrails from advanced AI models and proliferated dangerous capabilities to actors who may deploy them for offensive cyber operations, disinformation, and surveillance, increasing risks of misaligned AI deployment. Open-sourcing models without safety protections amplifies the risk of uncontrolled AI systems being used by malicious actors.
Skynet Date (-1 days): The successful large-scale theft and rapid advancement of Chinese AI capabilities through distillation accelerates the global proliferation of frontier AI capabilities to actors with fewer safety constraints. This compressed timeline for widespread advanced AI deployment increases near-term risks.
AGI Progress (+0.03%): The incident demonstrates that distillation can rapidly transfer advanced capabilities like agentic reasoning, tool use, and coding across models, effectively democratizing frontier capabilities and accelerating global progress toward AGI-relevant skills. DeepSeek's upcoming V4 model reportedly outperforms Claude and ChatGPT in coding, showing successful capability extraction.
AGI Date (-1 days): Distillation techniques enable rapid capability transfer at fraction of original development cost, significantly accelerating the pace at which multiple labs can achieve frontier performance levels. The fact that Chinese labs achieved near-parity with US frontier models through these methods suggests AGI-relevant capabilities will spread faster than anticipated through traditional development timelines.