Tuesday, October 11, 2011

Manual: How to remove information from Google?

Everyone wants to have information in Google, in short findable in Google. But an interesting and topical question is how the data can be removed from Google search results?

I answered that question in this guide, divided into different topics so you can quickly read what the topic is relevant to you:

     How do you remove information from Google?
     Which method should I use why?
     Requirements for removal from Google
     Third party data deleted from Google
     The Google cache
     How to remove personal information from Google?
     Google search history
     Other information from Google Delete

Removing information from Google?

Let me start with the good news: you can control what content of your website in the search index is included.

As a website owner you can be in the robots.txt file or Robots Meta tags which information from your website you do NOT want indexed by search engines.

It happens sometimes that information is already indexed before you've excluded from indexing by example robots.txt file. But how do you get that information off the search engines?

Let's look closely at the potential for Google to remove information.

     Tip: For more background and information on the manual "How does a search engine?".

Active vs.. passive removal

You can for a particular page of your site simply delete the page but Google does sometimes months in its cache for the index is stored.


It is possible for information to refrain from indexing by Google, both passive - using robots.txt or robots meta tags - and active - Google Webmaster Tools (only your own sites) or a removal request to Google for information from other sites (both require a Google Account).
Google Webmaster Tools

Google Webmaster Central, Google offers a variety of tools that enables webmasters to make extensive use of and insight into the way of indexing by Google. Google Webmaster Tools can only be used on your website (s).

In this Google Webmaster Tools it is possible possible to operate individual URLs (web pages, images or other files), folders and subfolders, your entire website or the stored copy (cache) version of the Google search results easily Google to remove.
Which method should I use why?

As said, you have several options, both passive and active, to remove information from the search engines. The following chart, prepared by Danny Sullivan (source), gives good weather which method is suitable for any purpose, including the tool from Yahoo (Yahoo Site Explorer) is included.



Stop Crawling: If 'Yes', this will stop the search engine to crawl everything is excluded. 'No' means that the page may be "read" by the crawler, but NOT in the index.
That does not mean that URL does not appear in search results! See "Link Only" recording for more information.
Stop Recording Index: the URLs that are excluded are not stored in the index of search engines. Again, the URL in the search results.
Stop Link Only 'Shooting: In this case a page is not indexed, but Google has the URL based on inbound links. The URL appears so at least put in the search results! See this example of Auping.nl. (Read the explanation from Google and Yahoo).Require removal URL
You can not just remove something from Google. The URL where the information on it that you want to remove from Google, one of the following requirements:

    
It must include the 404 (not found) or 410 (gone) error message
    
The URL must be blocked in robots.txt
    
It must include the robots meta tag with the value "noindex" contain
How long before the information is removed?
Where one of the above requirements are met, the URL (after the application like Google Webmaster Central) as soon as possible.
Google shows that the 3 to 5 days, but usually goes faster. Also, I have seen situations where it happened in half a day.How long will remove the URL?
The URL remains from that time on six months (180 days) from the Google index (unless you do a reinclusion request to get back in the index to come).
After six months going back to Google Google crawl and then index the URL, if the information is still excluded from indexation in the robots.txt file or robots meta tag.Third party information removed from Google
The above options are available for a website to your full control. It can however also be that you would like to have information removed from Google for websites that are not under your control.
In that case you Google the above mentioned removal request (Google Account).A page from a third party to remove from Google
It might be that you are a webmaster of a website have requested a page or delete them. This webmaster does this then neatly through robots.txt or robots meta tag.
Subsequently, however, as described above, sometimes even weeks to months before the information from the Google index is removed.
So in that case you can submit a request to remove the URL. It simply hasten the process of removal from the Google index. Note: This only works if the URL information to the above requirements!A cached version of a page from Google
If a URL is excluded (via robots.txt or robots meta tags) for indexing, and if the removal process is accelerated by the use of the tools discussed above, there is still a cached version of the information.
As a website owner can be so easy with Google Webmaster Tools, or the value "noarchive" Robots Meta tag in the cached version is based.
On each page, which you want to remove from Google cache, insert the following code into the HEAD section of your HTML document:
How to remove personal information from Google?
If, at any page of your personal details are and the respective webmaster will not delete, you remove the personal information to Google (via the aforementioned tool). The personal information should one of the following requirements:

    
Your Social Security number
    
Your bank account or credit card number
    
An image of your signature
    
Explicit content that violates Google's guidelines and your personal information.
Google search history
Interesting in this context is a trip to the Google search history that keeps track of you. This information is not true in the Google index, but it may be desirable to your Google search history delete (or disable).
A more detailed explanation, explanations and instructions on the Google search history and remove or disable it can be found in my web history guide Google search history (and consequences for results, and privacy).Other information from Google Delete
Google offers an overview of all types of information you can remove from the Google index, such as on image from Google Images and remove a blog from Google blog search remove.
 

No comments:

Post a Comment