• 1 Post
  • 11 Comments
Joined 11 months ago
cake
Cake day: March 28th, 2024

help-circle





  • well if they really are and methodology can be replicated, we are surely about to see some crazy number of deepseek comptention, cause imagine how many us companies in ai and finance sector exist out there that are in posession of even larger number of chips than chinese clamied to have trained their model on.

    Although the question rises - if the methodology is so novel why would these folks make it opensource? Why would they share results of years of their work to the public losing their edge over competition? I dont understand.

    Can somebody who actually knows how to read machine learning codebase tell us something about deepseek after reading their code?


  • Its just my opnion based on few sources I saw on the web. Should I attach them as links to the comment? I guess I could. But thats extra time which Im not sure I want to spend. Imagine the discussion where both sides provide links and sources to everything they say. Would be great? I guess? But at the same time would be very diffcult on both sides and time consuming. Nobody doest that in todays internet. Nobody ever did that in causal conversations. Not just internet acutally, in both real life and internet. Providing evidence is generally for court talk.

    You are right. We are all on our own in pursue of truth. And with rise of AI and fake reality things are going to be crazier and crazier each year. Pair that also with the fact that our brains have limited storage capacity for information and knowledge and it doesnt look bright for humans. I stay optimistic though despite that.





  • Apparently DeepSeek is lying, they were collecting thousands of NVIDIA chips against the US embargo and it’s not about the algorithm. The model’s good results come just from sheer chip volume and energy used. That’s the story I’ve heard and honeslty it sounds legit.

    Not sure if this questions has been answered though: if it’s open sourced, cant we see what algorithms they used to train it? If we could then we would know the answer. I assume we cant, but if we cant, then whats so cool about it being open source on the other hand? What parts of code are valuable there besides algorithms?