web analytics

Interesting numbeers

I got an email from the Wayback Machine today. They’ve partnered with Automattic (the company behind WordPress) to publish a free plugin to fight link rot – dead links. According to them, 43% of the whole www is published with WordPress.

They say they’ve catalogued a trillion (!) web pages. Apparently, Pew looked into it and found that from a group of 10-year-old websites they studied, 38% of the links were dead.

I run into this a lot. Second only to websites who don’t publish the date, so you think you’re looking at the dates for this year’s Weaselfest but it’s actually 2017. But I digress.

The way it works, you install this plugin and it catalogues all the links you’ve posted and schedules them for backup. Then if the site breaks, they seamlessly switched to the archived version. Not a bad idea, if it works.

A trillion webpages. That’s got to be incredibly expensive to store. I’ve always wondered how they’re funded, so I asked Grok.

He says they have an annual budget of $20–37 million and a significant chunk of that comes from small donors. Then there are regular grants from philanthropic foundations and money from the government for archiving. They offer services like book digitizing.

I know what you’re going to ask. I’m not sure if I’m eligible for this plugin. Some years ago, there was a frenzy of copyright trolling. It was costing little websites thousands in legal claims. I’m usually careful about copyright, but I’ve published about 6,000 images and I couldn’t be sure, so I asked Wayback to forget sweasel.com. I never checked to see if they did.

Comments


Comment from ExpressoBold Pureblood
Time: April 15, 2026, 8:31 pm

Image-hosting is worthless of they don’t maintain the original or better resolution. I hate looking at a 1920 X 1080 image rendered as 640 X 480.

There are some image-only, member-only websites for the distant past I’d like to look into.


Comment from Bob Mulroy
Time: April 15, 2026, 9:09 pm

Maybe we could all donate our unused Google and Amazon storage?


Comment from Rich Rostrom
Time: April 15, 2026, 10:47 pm

The other issue is linkrot. The target still exists, but the host site has been reorganized and the URL is different. This can be serious. For instance, for several years now, Supreme Court decisions have included links to online content – even including video. If those links rot…

Write a comment

(as if I cared)

(yeah. I'm going to write)

(oooo! you have a website?)


Beware: more than one link in a comment is apt to earn you a trip to the spam filter, where you will remain -- cold, frightened and alone -- until I remember to clean the trap. But, hey, without Akismet, we'd be up to our asses in...well, ass porn, mostly.


<< carry me back to ol' virginny