Training Data Inc: The AI Companies Profiting From Content They Never Licensed
We cross-referenced publicly available training datasets against copyright registries, publisher opt-out lists and court filings. The scale of unlicensed ingestion is larger than any company has admitted.