I wonder what his first clue was.

  • burghler@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    47
    ·
    22 hours ago

    It’s been a few days and a simple search reveals it’s already been reproduced by many different bodies using the “vague” pdf. What’s this disservice for?

    • KingRandomGuy@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      11 hours ago

      TBH the paper is a bit light on the details, at least compared to the standards of top ML conferences. A lot of DeepSeek’s innovations on the engineering front aren’t super well documented (at least well enough that I could confidently reproduce them) in their papers.