@Ksin - Lemmy.in

Ksin@lemmy.world

0 Posts
1 Comment

Joined 2 years ago

Cake day: June 11th, 2023

You are not logged in. If you use a Fediverse account that is able to follow users, you can follow this user.

OverviewCommentsPosts

Ksin@lemmy.worldtoTechTakes@awful.systems•DeepSeek roundup: banned by governments, no guard rails, lied about its training costs
link
fedilink
English
arrow-up
0
arrow-down
4·
3 days ago
The 70b model is a distilation of Llama3.3, that is to say it replicates the output of Llama3.3 while using the deepseekR1 architecture for better processing efficiency. So any criticism of the capability of the model is just criticism of Llama3.3 and not deepseekR1.

link
fedilink