go-away — GeistHaus

powxy

Codeberg.org Runxi Yu Apr 4, 2026

Scraper-defense reverse proxy

2 inbound links object en gitnon-profitfossossfreesoftwareopensourcecodehosting

Mataroa blog

mataroa.blog?ref=illugination.com Jan 1, 2020

Blogging platform for minimalists.

0 inbound links en blogbloggingplatformfastsimpleminimal

Crawlers hitting Forgejo instances - global abuse trend

Codeberg.org Mar 21, 2025

Detailed discussions on excessive crawling targeting code.forgejo.org: - [February 2025](https://codeberg.org/forgejo/discussions/issues/297) - [April 2025](https://codeberg.org/forgejo/discussions/issues/331) --- Codeberg and the Forgejo infrastructure (which are entirely separate) were both ...

2 inbound links object en gitnon-profitfossossfreesoftwareopensourcecodehosting

Golang language and apps | Jeff McNeill

Jeff McNeill Admin Apr 6, 2026

0 inbound links website en LinuxSoftware

Why doesn’t mataroa block AI scrapers?

Blog of Mataroa.blog Published on Jan 26, 2026

0 inbound links en

Code

Busybee Apr 8, 2026

Hopefully-useful tidbits of code and information.

0 inbound links webpage en

Dealing with Web Scrapers

Brandon Rozek Brandon Rozek Jul 2, 2025

Nowadays it seems like every tech company is eager to scrape the web. Unfortunately, it seems like 1 the majority of traffic that comes to this small site are scrapers. While my static website is able to handle the load, the same cannot be said about everyone. Overall, the techinques I’ve seen website owners use aim to make scraping more difficult. Though it’s a balance. The harder we make it for bots to access a website, the more we turn away regular humans as well. Here’s a short and non-exhaustive list of techinques:

0 inbound links article en blog Web ScrapingCAPTCHARate LimitingRobots.txtProof of Work

Preventing bot scraping on Publ and Flask

beesbuzz.biz Jul 5, 2025

This morning I was once again thinking about how to put some proper antibot behavior onto my websites, without relying on Cloudflare. There are plenty of fronting proxies like Anubis and Go Away which put a simple proof-of-work task in front of a website. This is pretty effective, but it adds more of an admin tax (and is often quite difficult to configure for servers that host multiple websites, such as mine), and sometimes the false positive rates can have some other bad effects, such as disallowing feed readers and the like.

0 inbound links website en

Mataroa blog

mataroa.blog Jan 1, 2020

Blogging platform for minimalists.

0 inbound links en blogbloggingplatformfastsimpleminimal