Best practice for removing indexed internal search pages from Google?

HrThomsen

Hi Mozzers

I know that it’s best practice to block Google from indexing internal search pages, but what’s best practice when “the damage is done”?

I have a project where a substantial part of our visitors and income lands on an internal search page, because Google has indexed them (about 3 %).

I would like to block Google from indexing the search pages via the meta noindex,follow tag because:

Google Guidelines: “Use robots.txt to prevent crawling of search results pages or other auto-generated pages that don't add much value for users coming from search engines.” http://support.google.com/webmasters/bin/answer.py?hl=en&answer=35769
Bad user experience
The search pages are (probably) stealing rankings from our real landing pages
Webmaster Notification: “Googlebot found an extremely high number of URLs on your site” with links to our internal search results

I want to use the meta tag to keep the link juice flowing. Do you recommend using the robots.txt instead? If yes, why?

Should we just go dark on the internal search pages, or how shall we proceed with blocking them?

I’m looking forward to your answer!

Edit: Google have currently indexed several million of our internal search pages.

italominano

Hello,

Sorry for the late answer, I have the same problem and I think I found the solution. For me works this:

1. Add meta tag robots No Index , Follow for the internal search pages and wait for Google remove it from the index.

Be careful if you do **BOTH (**Adding meta tag robots and Disallow in Robots.txt ) Because of this:

Please note that if you do both: block the search engines in robots.txt and via the meta tags, then the robots.txt command is the primary driver, as they may not crawl the page to see the meta tags, so the URL may still appear in the search results listed URL-only. Souce: http://tools.seobook.com/robots-txt/

I hope this information can help you.

MagicDude4Eva

I would honestly exclude all your internal search pages from the Google index via robots.txt (noindex) exclusion. This will at least re-distribute crawl-time to other areas of your site.

Just having the noindex,follow in the meta-tag (without the robots.txt exclusion) will let GoogleBot crawl the page and then eventually remove it from the index.

I would also change your search-page canoncial to the search term (i.e. /search/iphone) and then have a noindex,follow on meta-tag.

AdamThompson

It sounds like the meta noindex,follow tag is what you want.

robots.txt will block googlebot from crawling your search pages, but Google can still keep the search pages in its index.

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Moz Q&A is closed.

Best practice for removing indexed internal search pages from Google?

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

Google Image Search - Is there a way to influence the related icons at the top of the image search results?

Mass Removal Request from Google Index

Pages are Indexed but not Cached by Google. Why?

Dev Subdomain Pages Indexed - How to Remove

Links from non-indexed pages

Google Indexing Feedburner Links???

Google is indexing wordpress attachment pages

Best way to de-index content from Google and not Bing?

Products

Moz Solutions

Free SEO Tools

Resources

About Moz

Why Moz

Get Involved