WeSearch

Scraping 241 UK council planning portals – 2.6M decisions so far

·1 min read · 0 reactions · 0 comments · 1 view
#planning data#web scraping#uk councils#data aggregation#local government
⚡ TL;DR · AI summary

An individual has scraped planning decision data from 241 of the UK's 400+ council portals, amassing 2.6 million records despite technical challenges posed by outdated systems, inconsistent schemas, and anti-bot protections. The data reveals a national planning application approval rate of around 88%, with significant variation between local areas. Delays in decision-making have increased, with 36.5% of home extensions in England and Wales missing the 8-week target in 2025, up from 27.9% in 2019. The project is currently offered as a free postcode checker and paid PDF reports, with no paying customers yet.

Original article
Ycombinator
Read full at Ycombinator →
Full article excerpt tap to expand

I've been scraping 241 UK council planning portals – 2.6M decisions so farUK planning data is technically public. In practice it's locked behind 400+ different council portals, some still running bespoke ASP.NET that looks like it dates from 2004, some behind AWS WAF, all with subtly different schemas. I've spent four months scraping them. I'm now at 241 councils and 2.6 million decisions across England, Scotland and Wales.The scraping problemMost UK councils run one of a handful of portal systems, Idox being the most common. In theory this makes things easy. In practice every council has configured theirs differently, some block non-browser requests via TLS fingerprinting, some have rate limits that will get you banned inside 10 minutes, and a handful are running the aforementioned bespoke ASP.NET.I ended up writing several scrapers: a standard requests-based one, a Playwright-based one for councils that block anything that doesn't look like a real browser, and a curl_cffi one for TLS fingerprinting. Some councils I still can't get. Liverpool's portal sits behind AWS WAF with a JavaScript challenge. I have a working Playwright-based scraper that solves the challenge once and reuses cookies, but the WAF rate-limits the IP after about 10 requests and then blocks me for a day. So I have 60k Liverpool decisions from an old scrape and no easy way to add more.What I foundThe approval rate stuff is what most people come for. Nationally it's around 88%, but it varies wildly by ward within a council, not just between councils.The more interesting finding came from the time-to-decision data. Across 119 English and Welsh councils, 36.5% of home extension applications missed the statutory 8-week target in 2025, up from 27.9% in 2019. Guildford is the worst at scale: 66% of decisions over target, averaging 13.3 weeks.What it is nowA postcode checker (free) and paid PDF reports (£19/£79). Zero paying customers so far, which is fine. I've been heads down on data quality and coverage.Site is planninglens.co.uk if you want to poke around. AMA on the scraping side – that's where the interesting problems are.

This excerpt is published under fair use for community discussion. Read the full article at Ycombinator.

Anonymous · no account needed
Share 𝕏 Facebook Reddit LinkedIn Email

Discussion

0 comments

More from Ycombinator