Is making user interaction between instances easier even possible in current state of fediverse and the protocol?

legolas@fedit.pl · edit-2 20 hours ago

Damn you sound like bot you know that? No typos, perfect answer. Seriously. Can you prove you’re a human? xd

legolas@fedit.pl · 20 hours ago

Well maybe. Apparntly some folks are already doing that but its not done yet. Let’s wait for the results. If everything is legit we should have not one but plenty of similar and better models in near future. If Chinese did this with 100 chips imagine what can be done with 100000 chips that nvidia can sell to a us company

legolas@fedit.pl · edit-2 20 hours ago

https://www.youtube.com/watch?v=RSr_vwZGF2k This is what I watched. I base my opinion on this. Im not saying this is true. It just sounded legit enough and I didnt have time to research more. I will gladly follow some links that lead me to content that destroys this guys arguments

legolas@fedit.pl · 20 hours ago

WTF dude. You mentioned Asia. I love Asians. Asia is vast. There are many countries, not just China bro. I think you need to do these reflections. Im talking about very specific case of Chinese Deepseek devs potentiall lying about the chips. The assumptions and generalizations you are thinking of are crazy.

legolas@fedit.pl · 21 hours ago

Did they? According to their repo its still WIP https://github.com/huggingface/open-r1

legolas@fedit.pl · edit-2 22 hours ago

well if they really are and methodology can be replicated, we are surely about to see some crazy number of deepseek comptention, cause imagine how many us companies in ai and finance sector exist out there that are in posession of even larger number of chips than chinese clamied to have trained their model on.

Although the question rises - if the methodology is so novel why would these folks make it opensource? Why would they share results of years of their work to the public losing their edge over competition? I dont understand.

Can somebody who actually knows how to read machine learning codebase tell us something about deepseek after reading their code?

legolas@fedit.pl · edit-2 22 hours ago

Its just my opnion based on few sources I saw on the web. Should I attach them as links to the comment? I guess I could. But thats extra time which Im not sure I want to spend. Imagine the discussion where both sides provide links and sources to everything they say. Would be great? I guess? But at the same time would be very diffcult on both sides and time consuming. Nobody doest that in todays internet. Nobody ever did that in causal conversations. Not just internet acutally, in both real life and internet. Providing evidence is generally for court talk.

You are right. We are all on our own in pursue of truth. And with rise of AI and fake reality things are going to be crazier and crazier each year. Pair that also with the fact that our brains have limited storage capacity for information and knowledge and it doesnt look bright for humans. I stay optimistic though despite that.

legolas@fedit.pl · 22 hours ago

Yup. Thats internet nowadays. Full of comments like this. Cant do muich about it

legolas@fedit.pl · 22 hours ago

internet

legolas@fedit.pl · edit-2 22 hours ago

So are these techiques so novel and breaktrough? Will we now have a burst of deepseek like models everywhere? Cause that’s what absolutely should happen if the whole storey is true. I would assume there are dozens or even hundreds of companies in USA that are in a posession of similar number but surely more chips that Chinese folks claimed to trained their model on, especially in finance sector and just AI reserach focused.

legolas@fedit.pl · edit-2 1 day ago

Apparently DeepSeek is lying, they were collecting thousands of NVIDIA chips against the US embargo and it’s not about the algorithm. The model’s good results come just from sheer chip volume and energy used. That’s the story I’ve heard and honeslty it sounds legit.

Not sure if this questions has been answered though: if it’s open sourced, cant we see what algorithms they used to train it? If we could then we would know the answer. I assume we cant, but if we cant, then whats so cool about it being open source on the other hand? What parts of code are valuable there besides algorithms?

legolas@fedit.pl · edit-2 1 day ago

deleted by creator

legolas@fedit.pl · 3 days ago

Is making user interaction between instances easier even possible in current state of fediverse and the protocol?