• surewhynotlem@lemmy.world
    link
    fedilink
    English
    arrow-up
    7
    arrow-down
    3
    ·
    2 days ago

    Prove it. Please, show me the full training data to guarantee you’re right.

    But also, all the kids used for “kids face data” didn’t sign up to be porn

    • jaschen@lemm.ee
      link
      fedilink
      English
      arrow-up
      8
      arrow-down
      2
      ·
      2 days ago

      I don’t need to. It’s is just the way gen AI works. It takes images of things it knows and then generates NEW content based on what it think you want with your prompts.

      If I’m looking for a infant flying an airplane, gen AI knows what a pilot looks like and what a child looks like and it creates something new.

      Also kids face data doesn’t mean they take the actual face of the actual child and paste it on a body. It might take an eyebrow and a freckle from one kidand use a hair style from another and eyes from someone else.

      Lastly, the kids parents consented when they upload images of their kids on social media.

            • jaschen@lemm.ee
              link
              fedilink
              English
              arrow-up
              2
              arrow-down
              2
              ·
              18 hours ago

              So is that the Gen AI problem or the open internets problem. It sounds like you hate the open internet and awful people who put real cp online and not Gen AI.

        • ExLisper@lemmy.curiana.net
          link
          fedilink
          English
          arrow-up
          3
          arrow-down
          5
          ·
          24 hours ago

          What AI are you talking about? Are you suggesting the commercial models from OpenAI are trained using CP? Or just that there are some models out there that were trained using CP? Because yeah, anyone can create a model at home and train it with whatever. But suggesting that OpenAI has a DB of tagged CP is a different story.

          • surewhynotlem@lemmy.world
            link
            fedilink
            English
            arrow-up
            4
            ·
            21 hours ago

            Open AI just scours the Internet. 100% chance it’s come across someone illegal and horrible. They don’t pre-approve its training data.

            • ExLisper@lemmy.curiana.net
              link
              fedilink
              English
              arrow-up
              3
              arrow-down
              3
              ·
              21 hours ago

              But you have to describe it. It doesn’t just suck in images at random. I imagine someone will remove CP when the images are reviewed. Or do you think they just download all images and add them to the training set without even looking at them?

              • surewhynotlem@lemmy.world
                link
                fedilink
                English
                arrow-up
                1
                ·
                10 hours ago

                I think that’s exactly what they do. Curation at the quantities that they’re working at would require an army.

                • ExLisper@lemmy.curiana.net
                  link
                  fedilink
                  English
                  arrow-up
                  1
                  ·
                  48 minutes ago

                  So you think to train AI you just show it random images without describing what they represent and AI just magically learns? If I then ask AI to create an image of a computer, how does it know what a computer is? Does it just learn this on it’s own from all the random images?