#googlebot

Barry Schwartzrustybrick@c.im
2026-02-11

Google updates its Googlebot/crawlers file size limit help document again for clarification seroundtable.com/google-crawle

#google #googlebot #googleseo

text from help doc
Inautiloinautilo
2026-02-10


Google lists Googlebot file limits · Do Google’s crawling limits affect your website? ilo.im/16adna

_____

PPC Landppcland
2026-02-07

Testing tool simulates Google's 2MB HTML limit as SEO professionals assess crawling impact: Dave Smart added 2MB truncation feature to Tame the Bots fetch tool on February 6, enabling technical SEO professionals to simulate Googlebot's reduced file size limits. ppc.land/testing-tool-simulate

Barry Schwartzrustybrick@c.im
2026-02-04

Google clarifies Googlebots crawling limit of 15MB (old) but what is new is 2MB for other file types and 64MB for PDF documents seroundtable.com/googlebot-fil

#seo #googleseo #google #googlebot

Google clarifies Googlebots crawling limit of 15MB (old) but what is new is 2MB for other file types and 64MB for PDF documents
Barry Schwartzrustybrick@c.im
2026-02-03
text from the page
Barry Schwartzrustybrick@c.im
2026-01-22

A new Googlebot named "Google Messages" was added to Google's documentation seroundtable.com/new-googlebot

#google #googlebot

A new Googlebot named "Google Messages" was added to Google's documentation
2025-12-31

Vers un #web toujours plus fragile siecledigital.fr/2025/12/31/et
À eux seuls, les #bots représenteraient près de 30% du trafic web mondial, avec des pics capables de générer des volumes comparables à des attaques DDoS
#Googlebot est le #crawler dominant avec 4,5% des requêtes HTML
En 2025, le #smartphone s’impose avec environ 43% des utilisateurs mondiaux, contre 57% pour les ordinateurs. #Android domine largement le trafic mobile à l’échelle mondiale, tandis qu’#iOS conserve une position forte

Jack Yan (甄爵恩)jackyan
2025-12-10

Why would go through ?

A visit from someone using Hetzner, but their user-agent says they are Googlebot.
Barry Schwartzrustybrick@c.im
2025-11-20

Google internally proposed 6 options for (or not for) controlling how AI can use your content and blocking controls seroundtable.com/google-option via @natejhake

#google #ai #googleai #googlebot #seo

Google internally proposed 6 options for (or not for) controlling how AI can use your content and blocking controls
Barry Schwartzrustybrick@c.im
2025-11-04

New Google user agent, Google-CWS Chrome Web Store, added to the user-triggered fetchers list seroundtable.com/google-chrome

#google #googlebot #chrome

New Google user agent, Google-CWS Chrome Web Store, added to the user-triggered fetchers list
2025-10-29

So, someone in the issue made me realize that some bots impersonate the user agents of big actors, such as Googlebot. I checked my webserver logs and found a lot of them actually!

I liked the challenge, so I just wrote an article about how to do this in less than 40 SLOC 🏆
reaction.ppom.me/filters/usera

#reactionrust #bots #badbots #google #googlebot

Barry Schwartzrustybrick@c.im
2025-10-29

An undocumented Google user agent named geminiios was discovered seroundtable.com/geminiios-goo

#gemini #google #googlebot #useragent

An undocumented Google user agent named geminiioswebview geminios
Djoerd Hiemstra 🍉djoerd@idf.social
2025-10-18

@jackyan I suspect they created #GoogleOther to break the crawling / robots.txt / nettiquette rules without getting too many repurcusions on #GoogleBot.

Barry Schwartzrustybrick@c.im
2025-10-16

Google Read Aloud user agent service updates to list Google services that use it plus how AI is used and not used seroundtable.com/google-read-a

#google #googlebot

Google Read Aloud user agent service updates to list Google services that use it plus how AI is used and not used
Barry Schwartzrustybrick@c.im
2025-10-09

Google added Google NotebookLM to the list of Google crawlers, under the list of user-triggered fetchers seroundtable.com/google-notebo

#notebooklm #google #googlebot

Google added Google NotebookLM to the list of Google crawlers, under the list of user-triggered fetchers
Jack Yan (甄爵恩)jackyan
2025-09-27

Is hacking?

Reports of abusive content scans from users examining Googlebot's activities on their servers.
autorankerautoranker75
2025-09-23

Không phải nội dung nào cũng nên xuất hiện trên Google! Noindex giúp bạn kiểm soát điều đó.
Tìm hiểu chi tiết tại: autoranker.net/noindex-la-gi/

2025-09-13

@cks

Early results are not promising. I've had a handful of HEAD requests in the past day. Only 2 appear legitimate, in that they hit genuine page URLs. The others were attempts to exploit WordPress vulnerabilities.

#HTTP #httpd #GoogleBot #djbwares #WordPress

2025-09-12

@cks

It makes me think that there's one well-behaved 'bot drowned in a sea of ill-behaved ones.

I'm just instrumenting #djbwares httpd to log GET and HEAD differently. I wonder what I'll see.

#HTTP #httpd #GoogleBot

Client Info

Server: https://mastodon.social
Version: 2025.07
Repository: https://github.com/cyevgeniy/lmst