cognitive cybersecurity intelligence

News and Analysis

Search

Researchers Replicated DeepSeek’s R1-Zero Model for Just $30

Researchers have replicated DeepSeek’s R1-Zero model as TinyZero for $30, focusing on countdown and multiplication tasks using reinforcement learning on a 3-billion-parameter language model. This open-source project showcases enhanced reasoning capabilities developed independently. Detailed methodologies are available on Weights & Biases, with a formal research paper forthcoming to promote cost-effective AI research.

Source: cybersecuritynews.com –

Subscribe to newsletter

Subscribe to HEAL Security Dispatch for the latest healthcare cybersecurity news and analysis.

More Posts