There has (quite rightly) been a lot of discussion on the internet about whether training Copilot on non-permissively licensed open source code is fair. There are similar debates to be had about training DALL-E and others on artworks.
For software, my feeling is that this is very much a grey area, so why not make it explicit?
Should we develop licences that are crystal clear around whether you are permitted to use this codebase for the purposes of training models?