WeSearch

Learning my lesson that Python virtual environments aren't always movable

·2 min read · 0 reactions · 0 comments · 5 views
#web security#crawling#browsers#blogging#user agents#Wandering Thoughts#CSpace#Chris Siebenmann#Inoreader#Vivaldi#archive.today#archive.ph#archive.is
⚡ TL;DR · AI summary

The author of the blog Wandering Thoughts explains that visitors using old browser versions or certain user agents may be blocked due to anti-crawler measures. This is in response to a surge in high-volume crawlers, often using outdated Chrome user agents, potentially for LLM training data collection. The author clarifies that legitimate services like Inoreader and archive.org are not blocked, but some archiving services and browsers like Vivaldi may trigger restrictions.

Key facts
Original article
Utoronto
Read full at Utoronto →
Opening excerpt (first ~120 words) tap to expand

You're using a suspiciously old browser You're probably reading this page because you've attempted to access some part of my blog (Wandering Thoughts) or CSpace, the wiki thing it's part of. Unfortunately you're using a browser version that my anti-crawler precautions consider suspicious, most often because it's too old (most often this applies to versions of Chrome). Unfortunately, as of early 2025 there's a plague of high volume crawlers (apparently in part to gather data for LLM training) that use a variety of old browser user agents, especially Chrome user agents. To reduce the load on Wandering Thoughts I'm experimenting with (attempting to) block all of them, and you've run into this.

Excerpt limited to ~120 words for fair-use compliance. The full article is at Utoronto.

Anonymous · no account needed
Share 𝕏 Facebook Reddit LinkedIn Threads WhatsApp Bluesky Mastodon Email

Discussion

0 comments

More from Utoronto