

Yup. I don’t think training should be considered breaking copyright. Regurgitating though should.
There are examples of use cases besides the right now obvious one of LLMs “creating” “original” content.
One that comes to my mind is indexing books. Allowing for people to search for books based on a description.
You don’t need to be a journalist to copy/paste a title verbatim