if we are relatively rapidly moving toward ~all online content being AI-generated, interestingly internet content 1995-2025 is the seed for all content forever, as training on data from post-2025 will be mostly recursively training on output generated from data from 1995-2025