Overall, when tested on 40 prompts, DeepSeek was found to have a similar energy efficiency to the Meta model, but DeepSeek tended to generate much longer responses and therefore was found to use 87% more energy.

  • Onno (VK6FLAB)
    link
    fedilink
    English
    arrow-up
    12
    ·
    2 days ago

    And here I thought that the energy consumption was in the training.

    • Aatube@kbin.melroy.orgOP
      link
      fedilink
      arrow-up
      1
      ·
      1 day ago

      The issue might be that the energy it saves in training is offset by its more intensive techniques for answering questions, and by the long answers they produce.