I wonder what his first clue was.

  • KingRandomGuy@lemmy.world
    link
    fedilink
    English
    arrow-up
    2
    ·
    21 hours ago

    TBH the paper is a bit light on the details, at least compared to the standards of top ML conferences. A lot of DeepSeek’s innovations on the engineering front aren’t super well documented (at least well enough that I could confidently reproduce them) in their papers.