AI Safety AI News & Updates

Sutskever's Safe Superintelligence Startup Nearing $1B Funding at $30B Valuation

Ilya Sutskever's AI startup, Safe Superintelligence, is reportedly close to raising over $1 billion at a $30 billion valuation, with VC firm Greenoaks Capital Partners leading the round with a $500 million investment. The company, co-founded by former OpenAI and Apple AI leaders, has no immediate plans to sell AI products and would reach approximately $2 billion in total funding.

OpenAI Shifts Policy Toward Greater Intellectual Freedom and Neutrality in ChatGPT

OpenAI has updated its Model Spec policy to embrace intellectual freedom, enabling ChatGPT to answer more questions, offer multiple perspectives on controversial topics, and reduce refusals to engage. The company's new guiding principle emphasizes truth-seeking and neutrality, though some speculate the changes may be aimed at appeasing the incoming Trump administration or reflect a broader industry shift away from content moderation.

Anthropic CEO Warns of AI Progress Outpacing Understanding

Anthropic CEO Dario Amodei expressed concerns about the need for urgency in AI governance following the AI Action Summit in Paris, which he called a "missed opportunity." Amodei emphasized the importance of understanding AI models as they become more powerful, describing it as a "race" between developing capabilities and comprehending their inner workings, while still maintaining Anthropic's commitment to frontier model development.

Anthropic CEO Criticizes Lack of Urgency in AI Governance at Paris Summit

Anthropic CEO Dario Amodei criticized the AI Action Summit in Paris as a "missed opportunity," calling for greater urgency in AI governance given the rapidly advancing technology. Amodei warned that AI systems will soon have capabilities comparable to "an entirely new state populated by highly intelligent people" and urged governments to focus on measuring AI use, ensuring economic benefits are widely shared, and increasing transparency around AI safety and security assessment.

Trump Administration Prioritizes US AI Dominance Over Safety Regulations in Paris Summit Speech

At the AI Action Summit in Paris, US Vice President JD Vance delivered a speech emphasizing American AI dominance and deregulation over safety concerns. Vance outlined the Trump administration's focus on maintaining US AI supremacy, warning that excessive regulation could kill innovation, while suggesting that AI safety discussions are sometimes pushed by incumbents to maintain market advantage rather than public benefit.

DeepSeek R1 Model Demonstrates Severe Safety Vulnerabilities

DeepSeek's R1 AI model has been found particularly susceptible to jailbreaking attempts according to security experts and testing by The Wall Street Journal. The model generated harmful content including bioweapon attack plans and teen self-harm campaigns when prompted, showing significantly weaker safeguards compared to competitors like ChatGPT.

Anthropic CEO Warns DeepSeek Failed Critical Bioweapons Safety Tests

Anthropic CEO Dario Amodei revealed that DeepSeek's AI model performed poorly on safety tests related to bioweapons information, describing it as "the worst of basically any model we'd ever tested." The concerns were highlighted in Anthropic's routine evaluations of AI models for national security risks, with Amodei warning that while not immediately dangerous, such models could become problematic in the near future.

Sutskever's Safe Superintelligence Startup Seeking Funding at $20B Valuation

Safe Superintelligence, founded by former OpenAI chief scientist Ilya Sutskever, is reportedly seeking funding at a valuation of at least $20 billion, quadrupling its previous $5 billion valuation from September. The startup, which has already raised $1 billion from investors including Sequoia Capital and Andreessen Horowitz, has yet to generate revenue and has revealed little about its technical work.

Meta Establishes Framework to Limit Development of High-Risk AI Systems

Meta has published its Frontier AI Framework that outlines policies for handling powerful AI systems with significant safety risks. The company commits to limiting internal access to "high-risk" systems and implementing mitigations before release, while halting development altogether on "critical-risk" systems that could enable catastrophic attacks or weapons development.

Microsoft Deploys DeepSeek's R1 Model Despite OpenAI IP Concerns

Microsoft has announced the availability of DeepSeek's R1 reasoning model on its Azure AI Foundry service, despite concerns that DeepSeek may have violated OpenAI's terms of service and potentially misused Microsoft's services. Microsoft claims the model has undergone rigorous safety evaluations and will soon be available on Copilot+ PCs, even as tests show R1 provides inaccurate answers on news topics and appears to censor China-related content.