model distillation AI News & Updates

Chinese AI Lab DeepSeek Allegedly Used Google's Gemini Data for Model Training

Chinese AI lab DeepSeek is suspected of training its latest R1-0528 reasoning model using outputs from Google's Gemini AI, based on linguistic similarities and behavioral patterns observed by researchers. This follows previous accusations that DeepSeek trained on data from rival AI models including ChatGPT, with OpenAI claiming evidence of data distillation practices. AI companies are now implementing stronger security measures to prevent such unauthorized data extraction and model distillation.

DeepSeek Releases Efficient R1 Distilled Model That Runs on Single GPU

DeepSeek released a smaller, distilled version of its R1 reasoning AI model called DeepSeek-R1-0528-Qwen3-8B that can run on a single GPU while maintaining competitive performance on math benchmarks. The model outperforms Google's Gemini 2.5 Flash on certain tests and nearly matches Microsoft's Phi 4, requiring significantly less computational resources than the full R1 model. It's available under an MIT license for both academic and commercial use.