Training Data Inc: The AI Companies Profiting From Content They Never Licensed
We cross-referenced publicly available training datasets against copyright registries and publisher opt-out lists. The scale of unlicensed ingestion is larger than any company has admitted.