How to Disallow Tag Pages With Robot.txt

monster99

Hi i have a site which i'm dealing with that has tag pages for instant -

http://www.domain.com/news/?tag=choice

How can i exclude these tag pages (about 20+ being crawled and indexed by the search engines with robot.txt

Also sometimes they're created dynamically so i want something which automatically excludes tage pages from being crawled and indexed.

Any suggestions?

Cheers,

Mark

monster99

Hi Nakul, its Drupal

Mark

NakulGoyal

What CMS is it Mark ?

monster99

Thanks, is there a way to test it out before actually implementing it with the site.

The site is non-wordpress aswell.

Cheers,

Mark

NakulGoyal

I agree. I would suggest adding the noindex on the pages and letting the bots crawl them. Blocking them would prevent future crawl of these pages, but I am guessing you would also want to remove the existing pages.

Therefore add the noindex first, wait a few days and then add the disallow (Although technically if they are noindex, you don't really need the disallow).

DeanAndrews

Hi Mark

If your using Wordpress then I would recommend SEO Yoast to resolve the tag issue. If not then I suggest you amend the robots.txt file to resolve.

Here is an example:

Disallow: /?tag=
Disallow: /?subcats=
Disallow: /*?features_hash=

NOTE:

Be very careful when blocking search engines. Test and test again!

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Moz Q&A is closed.

How to Disallow Tag Pages With Robot.txt

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

Is it ok to repeat a (focus) keyword used on a previous page, on a new page?

Should I apply Canonical Links from my Landing Pages to Core Website Pages?

Multiple pages optimised for the same keywords but pages are functionally different and visually different

Robots.txt - Do I block Bots from crawling the non-www version if I use www.site.com ?

Meta Robot Tag:Index, Follow, Noodp, Noydir

Is it a problem to use a 301 redirect to a 404 error page, instead of serving directly a 404 page?

Block in robots.txt instead of using canonical?

Putting "noindex" on a page that's in an iframe... what will that mean for the parent page?

Products

Moz Solutions

Free SEO Tools

Resources

About Moz

Why Moz

Get Involved