Online content cannot escape the "short-lived" outcome: the study found that 38% of websites in 2013 are currently inaccessible

2024/5/19 20:44:00 Source: IT Home Author: Qingyuan Editor in charge: Qingyuan
Thanks to the netizens of IT Home Coje_He Our clues are delivered!

IT Home On May 19, it was reported that the Internet is an unimaginable treasure house in modern life, but even if users around the world rely on the Internet every day to obtain books, pictures, news reports and other resources, these contents may eventually disappear from people's sight. The Pew Research Center published a study on the 17th local time and found that the current online content is still "fleeting".

Source Pixabay

The researchers randomly collected about 1 million web pages from the archives of Common Crawl (an Internet archive service that regularly collects snapshots of websites at different time points). From 2013 to 2023, they will take samples from the web pages collected by Common Crawl every year and check whether these samples still exist at present.

According to the data, about 38% of the web pages existing in 2013 are now unavailable, while only 8% of the web pages existing in 2023.

Of all the pages we collected from 2013 to 2023, 25% will be inaccessible by October 2023. The results of these pages that cannot be accessed include two different situations: 16% of the pages are not in the domain name; The other 9% of the pages don't even have domain names.

Specifically, among the news websites sampled, 23% of web pages contain at least one invalid link High flow news websites and low flow news websites contain basically the same proportion of bad links. Among the top 20% of news websites with website traffic, about 25% of the pages contain at least one invalid link; Among the websites in the bottom 20% of website traffic, the data is 26%.

In terms of encyclopedia websites, the researchers chose the "reference link" part of Wikipedia. Of the more than 1 million reference links in total, 11% are no longer accessible 54% of Wikipedia page reference links contain at least one invalid link.

In terms of social networking sites, researchers published nearly 5 million samples of tweets on the social media platform X (Twitter) from March 8 to April 27, 2023, and tried to observe whether these tweets were still on the site until June 15, 2023. The study found that nearly 18% of tweets were almost invisible within three months. 60% of them are because the account is set as private, frozen or cancelled Forty percent of the accounts are deleted by the account publisher, but the account itself is still active.

reference resources

Advertising statement: The external jump links (including but not limited to hyperlinks, QR codes, passwords, etc.) contained in the text are used to convey more information and save selection time. The results are for reference only. All articles of IT Home contain this statement.

Related articles

key word: internet website

Website of soft media: IT Home Spicy food - super value shopping guide, coupons Best buy rebate coupon IPhone Home Win7 Home Win10 Home Win11 Home

Software under soft media: Rubik's cube Cool desktop Win7 optimization master Win10 optimization master Soft media mobile phone APP application