The day Donald Trump assumed office for his second term, a Chinese startup called DeepSeek quietly released an AI model called ๐๐ฒ๐ฒ๐ฝ๐ฆ๐ฒ๐ฒ๐ธโ๐ฅ๐ญ. While Trumpโs subsequent shake-ups were expected, little did one know that DeepSeek-R1 would send tremors in the AI industry.
Benchmark tests have proved DeepSeek-R1 bests the performance of OpenAIโs best model yet – the ๐ข๐ฝ๐ฒ๐ป๐๐ ๐ผ๐ญ and accomplishes this at a fraction of the cost o1 and the other similar AI models incur. DeepSeek was founded in 2023 with efficiency and cost as its primary drivers. The company claims it has spent less than $6 million in computing resources for training its model. In comparison, it is estimated that this cost for GPT4 was hundreds of millions of dollars!
Now comes the interesting part. For the users of the DeepSeek-R1 model the ๐ฐ๐ผ๐๐ ๐ผ๐ณ ๐ถ๐ป๐ฝ๐๐ ๐ฎ๐ป๐ฑ ๐ผ๐๐๐ฝ๐๐ ๐๐ผ๐ธ๐ฒ๐ป๐ ๐ถ๐ ๐ฎ๐ฏ๐ผ๐๐ ๐ฏ๐ฌ ๐ง๐๐ ๐๐ฆ ๐น๐ฒ๐๐๐ฒ๐ฟ than in OpenAI o1! ย Total game changer! DeepSeek has accomplished this feat despite the imposition of constraints and sanctions by the US on China. Incredible story. Reminds one of the Wright brothers’ story. Samuel Pierpont Langley, an American physicist was heavily funded by the U.S. government in his attempts to build a powered, heavier-than-air aircraft. However, through their innovative approach and meticulous experimentation, the Wright brothers succeeded in achieving this with a lower budget. DeepSeek has proved yet again loads of money and resources are not necessary for innovation to thrive. The team at DeepSeek dared to think differently, challenged the basic assumptions in other AI models and the rest is history.
This morning a colleague pointed out that the model has constraints – it does not answer some sensitive questions. Not unexpected, I said to myself. Since DeepSeek has open sourced DeepSeek-R1 (๐ธ๐ฉ๐ช๐ญ๐ฆ ๐๐ฑ๐ฆ๐ฏ๐๐ ๐ฐ1 ๐ช๐ด ๐ฑ๐ณ๐ฐ๐ฑ๐ณ๐ช๐ฆ๐ต๐ข๐ณ๐บ) a simple hack is to download it and use it with, say, Perplexity, and the constraints vanish! No wonder the AI developer community is rejoicing. DeepSeek has also blasted the ๐ฆ๐ฐ๐ฎ๐น๐ถ๐ป๐ด ๐๐๐ฝ๐ผ๐๐ต๐ฒ๐๐ถ๐ – (that you need more compute and resources to churn out better and more intelligent models) by accomplishing similar results deploying just a fraction of the resources. This has sent shivers down the spine of Nvidia, OpenAI and others who have invested billions of dollars to scale up compute. On Monday the 27th of Jan, NVIDIA shares tumbled and eroded $600 billion in its market value.
Just till a few days back, the world was staring at a massive energy shortfall in the future due to the projected needs of energy-guzzling AI infrastructure. With DeepSeek R-1 consuming much lesser energy, things will look much better on the energy front and climate activists will heave a big sigh of relief.
AI stands democratized.
Thanks to drastically reduced costs we will witness an exponential rise in number of AI-based applications (and of course start ups ) with accelerated adoption of AI in business. We have heard this refrain from CXOs โ I see a lot of AI Proofs-of-Concept being done. Show me one big AI projectโ. This will now change as the cost of implementing full size projects nosedives. We are stepping into an exciting era of AI Transformation.
Lower infrastructure and running costs for AI will mean more wings to government sponsored learning and skilling initiatives, and other programmes targeted at the masses. I havenโt seen a more significant disruption yet, and am sanguine this will eventually lead to a state of abundance and overall improvement in the happiness, quality of life and health of all of humankind – a realisation of the universal prayer :
๐ฐ๐ฎ ๐ด๐ข๐ณ๐ท๐ฆ ๐ฃ๐ฉ๐ข๐ท๐ข๐ฏ๐ต๐ถ ๐ด๐ถ๐ฌ๐ฉ๐ช๐ฏ๐ข๐ฉ
๐ด๐ข๐ณ๐ท๐ฆ ๐ด๐ข๐ฏ๐ต๐ถ ๐ฏ๐ช๐ณ๐ข๐ฎ๐ข๐บ๐ข๐ฉ …
