Moz Q&A is closed.
After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.
URLs dropping from index (Crawled, currently not indexed)
-
I've noticed that some of our URLs have recently dropped completely out of Google's index.
When carrying out a URL inspection in GSC, it comes up with 'Crawled, currently not indexed'.
Strangely, I've also noticed that under referring page it says 'None detected', which is definitely not the case.
I wonder if it could be something to do with the following? https://www.seroundtable.com/google-ranking-index-drop-30192.html - It seems to be a bug affecting quite a few people.
Here are a few examples of the URLs that have gone missing:
https://www.ihasco.co.uk/courses/detail/sexual-harassment-awareness-training
https://www.ihasco.co.uk/courses/detail/conflict-resolution-training
https://www.ihasco.co.uk/courses/detail/prevent-duty-training
Any help here would be massively appreciated!
-
The same issue facing my website
-
It seems like this issue is quite common lately. I have experienced something similar with some pages on my site InstPro.net which are not getting indexed properly either. any advice would be appreciated.
-
It seems like this issue is quite common lately. I have experienced something similar with some pages on my site InstPro.net which are not getting indexed properly either. any advice would be appreciated.
-
@Philljones22 said in URLs dropping from index (Crawled, currently not indexed):
Same issue here, my most of the URLs are getting de indexed after indexing the search console.
I'm experiencing the same problem. Most of my URLs are getting de-indexed after being indexed by the search console.
https://www.stardewvalleyapk.me/
https://www.stardewvalleyapk.me/stardew-valley-mod-apk/ -
I don't know why but I am facing the same issue from past 3 monts.
My most of the URLs are getting de indexed after indexing the search console. -
@kingshah001 said in URLs dropping from index (Crawled, currently not indexed):
Same issue here, my most of the URLs are getting de indexed after indexing the search console.
Same issue here, my most of the URLs are getting de indexed after indexing the search console.
https://inshotproapps.com
https://instoproapps.com/inshot-for-pc/ -
Thanks for sharing details.
-
Same issue here, my most of the URLs are getting de indexed after indexing the search console.
Some of the URLs are below:
https://apkcroc.com/
https://apkcroc.com/vn-mod-apk/
https://apkcroc.com/kinemaster-mod-apk/
https://apkcroc.com/terragenesis-mod-apk/
https://apkcroc.com/sky-fighters-3d-mod-apk/ -
My site is also a victim of same the issue, collecting bits and actionable advice. I'm planning to post my experience on Moz forum soon.
-
Hello,
since the beginning of ladykiller.nl I am having the same issues with Google to crawl sitemap(s) and index urls. I am using Yoast as a plugin for the sitemap.
For the moment +3620 urls are indexed, but my website has +10.000 urls :(.
Also from time to time I get a notice in GSC that Google can not fetch certain sitemap urls f.e. https://ladykiller.nl/post-sitemap.xml. Mostly the issue is fixed after a week or so. Please find print screen here: https://prnt.sc/Pm_h2Arjxu-kAlready asked on numerous forums for help, as I can not find a solution to get this problem fixed. However, without any good results so far.
Therefore, I am trying it here again in the hope maybe some of you guys have some better understanding of what the issue might be and how it can be fixed. All help is highly appreciated!
Thanks in advance for having a look into it :)!
Warm regards,
John -
Hi there,
The third URL you are referencing, is actually indexed:
https://dmitrii-regexseo.tinytake.com/tt/NDY4NDY4N18xNDgzNjgzMA
As for "crawled, not indexed" - in most cases it happens because of one and only reason - Google is seeing your page as thin content, not worth being indexed. Typically it happens on bigger sites with a lot of similar pages. In your case, you got many courses, with exactly same structure. So, if the content is not completely different, then Google might deem it not worthy.
As for the bug you referenced - did your URLs drop off the index exactly at the time when this issue has been discovered? (aka within the last week?).
Do you have any cannibalization happening?
To me it looks like that's the case. If I do this search: "site:https://www.ihasco.co.uk/ Sexual Harassment Training course"
There are many pages that are indexed and are ranking: https://dmitrii-regexseo.tinytake.com/tt/NDY4NDcwN18xNDgzNjg4Mg
So, basically, you have pages that are more authoritative with similar content. Therefore your courses pages are dropping as thin content.
I would recommend doing some internal linking optimization to tell Google what is actually important. Look in GSC for internal links metrics.
Hope this helps.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Tools/Software that can crawl all image URLs in a site
Excluding Screaming Frog, what other tools/software to use in order to crawl all image URLs in a site? Because in Screaming Frog, they don't crawl image URLs which are not under the site domain. Example of an image URL outside the client site: http://cdn.shopify.com/images/this-is-just-a-sample.png If the client is: http://www.example.com, Screaming Frog only crawls images under it like, http://www.example.com/images/this-is-just-a-sample.png
Technical SEO | | jayoliverwright0 -
How google crawls images and which url shows as source?
Hi, I noticed that some websites host their images to a different url than the one their actually website is hosted but in the end google link to the one that the site is hosted. Here is an example: This is a page of a hotel in booking.com: http://www.booking.com/hotel/us/harrah-s-caesars-palace.en-gb.html When I try a search for this hotel in google images it shows up one of the images of the slideshow. When I click on the image on Google search, if I choose the Visit Page button it links to the url above but the actual image is located in a totally different url: http://r-ec.bstatic.com/images/hotel/840x460/135/13526198.jpg My question is can you host your images to one site but show it to another site and in the end google will lead to the second one?
Technical SEO | | Tz_Seo0 -
Will blocking the Wayback Machine (archive.org) have any impact on Google crawl and indexing/SEO?
Will blocking the Wayback Machine (archive.org) by adding the code they give have any impact on Google crawl and indexing/SEO? Anyone know? Thanks! ~Brett
Technical SEO | | BBuck0 -
How to Remove /feed URLs from Google's Index
Hey everyone, I have an issue with RSS /feed URLs being indexed by Google for some of our Wordpress sites. Have a look at this Google query, and click to show omitted search results. You'll see we have 500+ /feed URLs indexed by Google, for our many category pages/etc. Here is one of the example URLs: http://www.howdesign.com/design-creativity/fonts-typography/letterforms/attachment/gilhelveticatrade/feed/. Based on this content/code of the XML page, it looks like Wordpress is generating these: <generator>http://wordpress.org/?v=3.5.2</generator> Any idea how to get them out of Google's index without 301 redirecting them? We need the Wordpress-generated RSS feeds to work for various uses. My first two thoughts are trying to work with our Development team to see if we can get a "noindex" meta robots tag on the pages, by they are dynamically-generated pages...so I'm not sure if that will be possible. Or, perhaps we can add a "feed" paramater to GWT "URL Parameters" section...but I don't want to limit Google from crawling these again...I figure I need Google to crawl them and see some code that says to get the pages out of their index...and THEN not crawl the pages anymore. I don't think the "Remove URL" feature in GWT will work, since that tool only removes URLs from the search results, not the actual Google index. FWIW, this site is using the Yoast plugin. We set every page type to "noindex" except for the homepage, Posts, Pages and Categories. We have other sites on Yoast that do not have any /feed URLs indexed by Google at all. Side note, the /robots.txt file was previously blocking crawling of the /feed URLs on this site, which is why you'll see that note in the Google SERPs when you click on the query link given in the first paragraph.
Technical SEO | | M_D_Golden_Peak0 -
How to determine which pages are not indexed
Is there a way to determine which pages of a website are not being indexed by the search engines? I know Google Webmasters has a sitemap area where it tells you how many urls have been submitted and how many are indexed out of those submitted. However, it doesn't necessarily show which urls aren't being indexed.
Technical SEO | | priceseo1 -
Does it really matter to maintain 301 redirect after de-indexing of old URLs?
Today, I was reading latest blog post on SEOmoz blog about. Uncrawled 301s - A Quick Fix for When Relaunches Go Too Well This is very interesting study about 301 & How it useful to maintain traffic. I'm working on eCommerce website and I have done similar stuff on my website. I have big confusion to manage 301 redirect. My website generates new URLs due to following actions. Re-write dynamic URLs. Re-launch entire website on different eCommerce platform. [osCommerce to Magento Commerce] Re-name category. Trasfer one product from one category to another category. I'm managing my 301 redirect with old practice. Excel sheet data from Google webmaster tools and set specific new URLs for redirect. Hoooo... Now, I have 8.5K redirect in htaccess... And, I'm thinking it's too much. Can we remove old 301 redirect from htaccess or not? This is big question for me. Because, all pages are not hyperlink on external website. Google have just de-indexed old URLs and indexed new URLs. So, Is it require to maintain 301 redirect after Google process?
Technical SEO | | CommercePundit0 -
Drupal URL Aliases vs 301 Redirects + Do URL Aliases create duplicates?
Hi all! I have just begun work on a Drupal site which heavily uses the URL Aliases feature. I fear that it is creating duplicate links. For example:: we have http://www.URL.com/index.php and http://www.URL.com/ In addition we are about to switch a lot of links and want to keep the search engine benefit. Am I right in thinking URL aliases change the URL, while leaving the old URL live and without creating search engine friendly redirects such as 301s? Thanks for any help! Christian
Technical SEO | | ChristianMKTG0 -
Index forum sites
Hi Moz Team, somehow the last question i raised a few days ago not only wasnt answered up until now, it was also completely deleted and the credit was not "refunded" - obviously there was some data loss involved with your restructuring. Can you check whether you still find the last question and answer it quickly? I need the answer 🙂 Here is one more question: I bought a website that has a huge forum, loads of pages with user generated content. Overall around 500.000 Threads with 9 Million comments. The complete forum is noindex/nofollow when i bought the site, now i am thinking about what is the best way to unleash the potential. The current system is vBulletin 3.6.10. a) Shall i first do an update of vbulletin to version 4 and use the vSEO tool to make the URLs clean, more user and search engine friendly before i switch to index/follow? b) would you recommend to have the forum in the folder structure or on a subdomain? As far as i know subdomain does take lesser strenght from the TLD, however, it is safer because the subdomain is seen as a separate entity from the regular TLD. Having it in he folder makes it easiert to pass strenght from the TLD to the forum, however, it puts my TLD at risk c) Would you release all forum sites at once or section by section? I think section by section looks rather unnatural not only to search engines but also to users, however, i am afraid of blasting more than a millionpages into the index at once. d) Would you index the first page of a threat or all pages of a threat? I fear duplicate content as the different pages of the threat contain different body content but the same Title and possibly the same h1. Looking forward to hear from you soon! Best Fabian
Technical SEO | | fabiank0