@josephcox depending on language subcorpus, the Google Books ngrams are already tainted/biased by all kinds of things.
@fotis_jannidis did some digging for the German corpus, cf. https://zenodo.org/doi/10.5281/zenodo.7715377 (presented at #DHd2023). His conclusion is that the German ngrams are corrupted at least since 2000.