• NocturnalMorning@lemmy.world
    link
    fedilink
    arrow-up
    25
    ·
    1 month ago

    Isn’t it a well known fact that training on other AI output data leads to complete collapse of the newly trained AI models?

          • antonim@lemmy.dbzer0.com
            link
            fedilink
            arrow-up
            2
            arrow-down
            1
            ·
            edit-2
            1 month ago

            He has both

            In 1990, he entered Queen’s University in Kingston, Ontario.[49][50] Two years later, he transferred to the University of Pennsylvania, where he studied until 1995.[51] Although Musk has said that he earned his degrees in 1995, the University of Pennsylvania did not award them until 1997 – a Bachelor of Arts in physics and a Bachelor of Science in economics from the university’s Wharton School.[52][53][54][55][56]

        • Kairos@lemmy.today
          link
          fedilink
          arrow-up
          1
          ·
          edit-2
          1 month ago

          What was the major? Judging by what he does it’d probably be “kinematics of sex”

    • 8uurg@lemmy.world
      link
      fedilink
      arrow-up
      2
      ·
      1 month ago

      Not quite, actually. It is moreso training recursively on the output without any changes, i.e., Data -> Model A -> Data (generated by Model A) -> Model B -> Data (generated by Model B -> …, that leads to (complete) collapse. A single step like this can still worsen performance notably, though, especially when it makes up the sheer majority of the data. [source]

      And if they train using little data, you won’t get anywhere near the chatbots we have now. If they fine-tune an existing model to do as they wish, it would likely have side effects. Like being more likely to introduce security bugs in generated code, generally give incorrect answers to other common sense questions, and so on. [source]

    • Avicenna@lemmy.world
      link
      fedilink
      arrow-up
      1
      ·
      1 month ago

      From what he wrote it feels like it will majorly be existing data with substitutions/corrections made in places where they deem necessary. Like when you ask about Elon it will probably spew sth along the lines of the greatest inventor of the last century, a polymath and a very successful path of exile 2 player.