AI Markets Were Deceived To Believe In DeepSeek's Low Training Costs; They Are Actually 400 Times Higher Than The Reported Figure

Hotznplotzn@lemmy.sdf.org · 1 day ago

AI Markets Were Deceived To Believe In DeepSeek's Low Training Costs; They Are Actually 400 Times Higher Than The Reported Figure

BrikoX@lemmy.zip · edit-2 22 hours ago

It provides a bunch of claims which it fails to prove (they don’t even bother to prove them to be honest).

It’s like me saying “Based on my own analysis @[email protected] is likely a paid actor”. Without any evidence it’s meaningless claim that nobody will take seriously.

~~And it is open source by OSI definition. The only thing they don’t provide is the raw training data, which OSI definition doesn’t require to qualify.~~

Hotznplotzn@lemmy.sdf.org · 22 hours ago

The definition says it must include data information (“the complete description of all data used for training, including (if used) of unshareable data, disclosing the provenance of the data, its scope and characteristics, how the data was obtained and selected, the labeling procedures, and data processing and filtering methodologies”), as well as code and paramters. Read your link.

The guys at Hugging Face are working on a more open model based on Deepseek as they also claim it is not fully Open Source.

Thank you for stating that “@[email protected] is likely a paid actor” being baseless. It indeed is, although your hint is not too friendly.

BrikoX@lemmy.zip · edit-2 22 hours ago

You are right, it indeed doesn’t quality under OSI definition. I wasn’t aware they didn’t share the code for training the model. My bad on assuming they did, based on the public GitHub repo.

Even then, it’s still the most open commercial model out there that rivals anything US Big Tech managed to come up with using their unlimited budget. There is no diminishing that. Lack of training code only affects other companies with enough resources to build it. It’s a huge win for consumers and huge embarrassment for the US companies.

P.S. There isn’t such a thing as “not fully open source”. It either is or it’s not.