Cache of Internet sites: how to find information deleted from the Web

The Internet is absolutely not permanent. Due to various circumstances (power line breaks, hoster bankruptcy, domain non-payment), any site may stop working. In the browsers of users after that, only messages about the inaccessibility of a favorite resource will be displayed. If the site changes beyond recognition, and the administration removes the page with important information, the resource will continue its work, but the end user will not be in trouble in this case.

Do not worry and curse the evil rock. Perhaps the portal is temporarily unavailable, and specialists are busy restoring its work. In addition, each user of the Global Network has a powerful tool that will allow you to get the necessary information - a cache of sites.

Google

Google is a mega-corporation whose server capacities are able to constantly scan the Internet for new pages and changes to old ones. By adding resources to their database, algorithms not only index sites, but also take pictures of them. Roughly speaking, Google backs up the Internet in case the source material becomes unavailable.

The Google Sites cache is accessible to everyone without exception. To access any indexed page, you must enter the query in the search line: [cashe: http: // full link to the page]. A copy of the page will be displayed on the screen, the following information will be displayed at the top of the screen:

  • The date of the last save, which will make it possible to judge whether the information provided could change.
  • Here is a link to the picture, which contains only text.
  • Another URL will show the complete source code that will interest webmasters.

site cache

Owners of resources on the Internet need to know that the cache of Google’s sites is a voluntary system to use. If you need to exclude any pages of your portal from the list of saved ones, you can prohibit taking pictures. To do this, add the meta tag <meta content = "noarchive"> to the page. You can also prohibit or enable caching in your office, if you have the appropriate account.

If you need to delete already saved pictures from the Google cache, you will need to send an email with a request, and then confirm your rights to the site.

Yandex

In second place in the list of companies that maintain the cache of sites is the domestic industry giant. The coverage of Yandex is much less, so it’s worth looking here mainly for pictures of large resources with high attendance.

Just enter the URL of the page you want into the search bar and press ENTER. The search results will show the site you need in the first place. Next to the link to it will be an icon in the form of a triangle. By clicking on it and choosing the menu item “Saved Copy”, you will open the last available snapshot of the page.

The wayback machine

In 1996, Brewster Cale opened a nonprofit organization, now called the Internet Archive. The company collects copies of web pages, videos, graphic images, audio recordings, software. The collected material is archived, and anyone can get free access to it.

The main goal of The Wayback Machine is to preserve the cultural values ​​created by civilization after the wide spread of the Internet, and to create the most complete electronic library of mankind. Currently, the Archive stores over 10 petabytes of data, which allows users to familiarize themselves with 85 billion web pages. This means Archive is the most comprehensive cache of sites.

cache sites on the Internet

Archive.org is the organization’s site, you can try to find a snapshot of the necessary page on it. Since not only the last copy is saved, but the bot looks at the resources periodically, you can study all the changes made on a particular page over time, even if the site no longer exists. It is advisable to use the WWW prefix in the search bar.

Dead url

google sites cache

Dead Address provides users with similar capabilities. Copy the non-working URL from the address bar and paste it into the input field on the site. The service will think for a bit and give a few results. Some of them will refer to a resource of the Google company. Another part will lead the user to the pages of the Archive. Importantly, the sites cache is sorted by date, which is very convenient.

Down or not

If you need a cache of sites on the Internet due to the inaccessibility of a particular resource, but searches do not lead to anything, it is worth checking to see if there is a problem near you. For example, an Internet service provider performs technical work or replaces outdated equipment. To check who is to blame, it makes sense to use the Down Or Not service (alive or not).

sites cache by date

Enter the address of the portal you need in the search bar and press the ENTER button. After a short analysis, the service will produce a result. The word DOWN indicates the unavailability of the resource (temporary or permanent), if the word UP appears on the screen, then everything is in order with the portal.

Down Ot Not acts as a third-party and unbiased expert to determine what exactly is the source of the problem.

Source: https://habr.com/ru/post/C16893/


All Articles