URL Unshortener


In this post, I talk about what I learned from scraping random URL shortener links.

Motivation

While organizing my old files, I came across a tiny python project I made that takes in a shortened URL and spits out the redirected result. It also had an option to try a random URL. Being a little intrigued, I decided to fix up the code a bit and then ran it overnight such that it would continuously try out random links and log the live links for later inspection.

General Patterns

While going through the links the next day, the vast majority of the links were 404 pages. However, there were a few common categories of links that I found interesting:

Specific Sites

More rewarding than the general patterns, which were mostly garbage, I found a handful of little sites:

Scam Websites?

I found a couple scam websites, which I won’t link directly.

Post Count by Month

Conclusions

I found a few neat websites and learned a bit through this project, but I should probably get back to organizing my files. If I work on this again in the future, I’m interested to investigate the types of links that I would get from words or combinations of words.