Monday, January 27, 2025

How DeepSeek outpaced OpenAI at 3% of the cost: open-source approach, pure reinforcement learning, not supervised fine-tuning, and building on DeepSeek-R1-Zero (Matt Marshall/VentureBeat)

Matt Marshall / VentureBeat:
How DeepSeek outpaced OpenAI at 3% of the cost: open-source approach, pure reinforcement learning, not supervised fine-tuning, and building on DeepSeek-R1-Zero  —  DeepSeek R1's Monday release has sent shockwaves through the AI community, disrupting assumptions about what's required to achieve cutting-edge AI performance.



No comments:

Post a Comment

DeepSeek says it used Nvidia H800 chips, available in China until October 2023, to train R1, suggesting future models could be hampered by US export controls (Bloomberg)

Bloomberg : DeepSeek says it used Nvidia H800 chips, available in China until October 2023, to train R1, suggesting future models could b...