At Daily Nous, there’s discussion of how much philosophy sites figure in Google’s C4 data set— and so in the training set of Large Language Models. The Washington Post has a widget to search for the rank of specific domains.
This very site— this blog plus my other foofaraw— ranks 612,096th with about 38 thousand tokens.
My old blog ranks close behind at 625,716th with about 37k tokens.
Although the tool isn’t designed to give this kind of result, the two together would rank somewhere around 300,000th.