DeepSeek confirmed us that scaling isn’t all it’s worthwhile to resolve AI compute

February 24, 2025

2

Returning nearer to the current day, we discover industrial improvement of AI beholden to “The Bitter Lesson.” After Nvidia’s CUDA enabled environment friendly tensor operations on GPUs and deep networks like AlexNet drove unprecedented progress in diverse fields, the beforehand numerous strategies competing for dominance in machine studying benchmarks homogenized to solely throwing extra compute at deep studying.

There’s maybe no higher instance of the bitter lesson than massive language fashions, which displayed unimaginable emergent capabilities with scaling over the previous decade. May we actually attain synthetic basic intelligence (AGI), that’s, programs amounting to the archetypal depictions of AI seen in Blade Runner or 2001: A Area Odyssey, just by including extra parameters to those LLMs and extra GPUs to the clusters they’re educated on?

My work at UCSD was predicated on the idea that this scaling wouldn’t result in true intelligence. And, as we’ve seen in current reporting from prime AI labs like OpenAI and luminaries like François Chollet, the way in which we’ve been approaching deep studying has hit a wall. “Now all people is trying to find the subsequent huge factor,” Sutskever aptly places it. Is it doable that, with strategies like making use of reinforcement studying to LLMs à la OpenAI’s o3, we’re ignoring the knowledge of the bitter lesson (although these strategies are undoubtedly computationally intensive)? What if we sought to know a “principle of the whole lot” for studying, after which double down on that?

We have now to deconstruct, then reconstruct, how AI fashions are educated

Reasonably than black-box approximations, at UCSD we developed breakthrough know-how that understands how neural networks really study. Deep studying fashions characteristic synthetic neurons vaguely just like ours, filtering information by way of them after which backpropagating them again as much as study options within the information (the latter step is alien to biology). It’s this characteristic studying mechanism that drives the success of AI in fields as disparate as finance and healthcare.

Supply hyperlink

DeepSeek confirmed us that scaling isn’t all it’s worthwhile to resolve AI compute

We have now to deconstruct, then reconstruct, how AI fashions are educated

Related Articles

MongoDB acquires Voyage AI to cut back hallucinations in AI functions

Flaw present in stalkerware apps, exposing hundreds of thousands of individuals. Here is find out how to discover out in case your telephone is...

Lenovo Delivers Unmatched Flexibility, Efficiency and Design with New ThinkSystem V4 Servers Powered by Intel Xeon 6 Processors

LEAVE A REPLY Cancel reply

Latest Articles

MongoDB acquires Voyage AI to cut back hallucinations in AI functions

Flaw present in stalkerware apps, exposing hundreds of thousands of individuals. Here is find out how to discover out in case your telephone is...

Lenovo Delivers Unmatched Flexibility, Efficiency and Design with New ThinkSystem V4 Servers Powered by Intel Xeon 6 Processors

Auto-Woodpecker’s anniversary! | Kaspersky official weblog

AI coding assistants are on a downward spiral