Moz Q&A is closed.
After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.
Image Indexing Issue by Google
-
Hello All,My URL is: www.thesalebox.comI have Submitted my image Sitemap in google webmaster tool on 10th Oct 2013,Still google could not indexing any of my web images,Please refer my sitemap - www.thesalebox.com/AppliancesHomeEntertainment.xml and www.thesalebox.com/Hardware.xmland my webmaster status and image indexing status are below,
Can you please help me, why my images are not indexing in google yet? is there any issue? please give me suggestions?Thanks!
-
Hi there, I'm just checking in to see what the current status of this issue is. Please let us know, thanks!
Christy
-
Hi there, you've received a lot of thoughtful responses. Did any of them answer your question? Please let us know, thanks!
Christy
-
Hi Sorina,
Yes, That i can do, i will and let you update, whether it's work or not
Thanks for your suggestions
-
As I said, you can add reference to your sitemaps in the robots.txt file:
At the end of the file http://www.thesalebox.com/robots.txt add the following lines:
sitemap: http://www.thesalebox.com/AppliancesHomeEntertainment.xml
sitemap: http://www.thesalebox.com/Hardware.xml -
Hi, I have seen a situation before where GWT says that no images are indexed but they have indexed them. I don't know why.
Checking Google directly, by searching site:thesalebox.com and then clicking the Image tab shows that Google do have images indexed on your site, maybe not all, but there are some so maybe more are being indexed:
Peter
-
Hi Peter,
Thanks for your valuable suggestions,
But i would like to index image with sub domain path,
I have already verified this domain into Google Webmaster Tool and check Robotos.txt to block, but all things working proper,
Now can you please assist me still images are not indexing and How much time google will taken in first time.
Thanks,
-
Hi Sorina,
Thanks for the focus on google webmaster policy about image indexing with sub domain.
=> I have already verified my Sub domain http://pics.thesalebox.com in to Google Webmaster Tool.
=> Also, I have already added sitemap in to this account.
Please check following links for more informations,
http://pics.thesalebox.com/ShopByDepartment.xml
http://pics.thesalebox.com/SportingGoods.xml=> I have also verified current robots.txt to block this path, but there is no problem.
http://pics.thesalebox.com/robots.txt
Is there other way still i missing to work on it. please suggest me.
Thanks,
-
Here is a quote from Google's Webmasters Help:
In some cases, the image URL may not be on the same domain as your main site. This is fine, as long as both domains are verified in Webmaster Tools. If, for example, you use a content delivery network (CDN) to host your images, make sure that the hosting site is verified in Webmaster Tools OR that you submit your Sitemap using robots.txt. In addition, make sure that your robots.txt file doesn’t disallow the crawling of any content you want indexed.
Source: https://support.google.com/webmasters/answer/178636
According to the above, now that you have also verified the subdomain where you are hosting your images you should be fine.
You don't have to submit the sitemap to the GWT account of the subdomain where you host your images, but you may add reference to your sitemaps in the robots.txt located in the root folder of your website, by adding something like this to the robots.txt file:
sitemap: http://www.thesalebox.com/AppliancesHomeEntertainment.xml
sitemap: http://www.thesalebox.com/Hardware.xml -
Hi Will2112,
Thanks for focus on robots.txt, I have double check that all things that block by robots or not, but it's seems look perfect,
is there another suggestions?
Thanks!
-
Hi Sorina,
Thanks for your reply,
Yes, I have submitted http://pics.thesalebox.com into google WMT and verified and submitted same sitemap.
Now can you please look in to more in this issue??
Thanks!
-
Yes, if your images are on a CDN server you must add to GWT that subdomain too in order to be able to see if the images are indexed by Google or not.
-
If my images are hosted on a CDN server, would I need to add that subdomain to Webmaster Tools as well?
I have a site with lots of images and I can confirm that image indexing takes much longer than the regular webpages to be indexed. I see that your robots.txt has a lot of Disallows on it. Is it possible that you are blocking indexing of those images from the robots.txt?
-
Hi,
I noticed your images are all hosted on a subdomain, http://pics.thesalebox.com. Did you added this subdomain to Google Webmaster Tools?
-
Hi, from experience it can take Google quite a time to index images on a site and if this is the first time you have submitted a sitemap that is probably going to be a factor as well.
Just one thing though with the images on your site. The ecommerce CMS system you are using is not helping interest by search engines in the images because the images don't have a descriptive title. This is one I found on the home page: http://pics.thesalebox.com/catalog/product/cache/1/small_image/175x175/f33bcb0b82304f8755dbcdf9b59ce0e0/1/0/100706555.jpg - the image is named: 100706555.jpg which although you have used alt tags on your images the non-descriptive image name doesn't help. Neither does the depth of your URLs - the image is located 10 folders down.
I hope that helps,
Peter
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
URLs dropping from index (Crawled, currently not indexed)
I've noticed that some of our URLs have recently dropped completely out of Google's index. When carrying out a URL inspection in GSC, it comes up with 'Crawled, currently not indexed'. Strangely, I've also noticed that under referring page it says 'None detected', which is definitely not the case. I wonder if it could be something to do with the following? https://www.seroundtable.com/google-ranking-index-drop-30192.html - It seems to be a bug affecting quite a few people. Here are a few examples of the URLs that have gone missing: https://www.ihasco.co.uk/courses/detail/sexual-harassment-awareness-training https://www.ihasco.co.uk/courses/detail/conflict-resolution-training https://www.ihasco.co.uk/courses/detail/prevent-duty-training Any help here would be massively appreciated!
Technical SEO | | iHasco0 -
Is there a way to get a list of all pages of your website that are indexed in Google?
I am trying to put together a comprehensive list of all pages that are indexed in Google and have differing opinions on how to do this.
Technical SEO | | SpodekandCo0 -
Google tries to index non existing language URLs. Why?
Hi, I am working for a SAAS client. He uses two different language versions by using two different subdomains.
Technical SEO | | TheHecksler
de.domain.com/company for german and en.domain.com for english. Many thousands URLs has been indexed correctly. But Google Search Console tries to index URLs which were never existing before and are still not existing. de.domain.com**/en/company
en.domain.com/de/**company ... and an thousand more using the /en/ or /de/ in between. We never use this variant and calling these URLs will throw up a 404 Page correctly (but with wrong respond code - we`re fixing that 😉 ). But Google tries to index these kind of URLs again and again. And, I couldnt find any source of these URLs. No Website is using this as an out going link, etc.
We do see in our logfiles, that a Screaming Frog Installation and moz.com w opensiteexplorer were trying to access this earlier. My Question: How does Google comes up with that? From where did they get these URLs, that (to our knowledge) never existed? Any ideas? Thanks 🙂0 -
Vanity URLs are being indexed in Google
We are currently using vanity URLs to track offline marketing, the vanity URL is structured as www.clientdomain.com/publication, this URL then is 302 redirected to the actual URL on the website not a custom landing page. The resulting redirected URL looks like: www.clientdomain.com/xyzpage?utm_source=print&utm_medium=print&utm_campaign=printcampaign. We have started to notice that some of the vanity URLs are being indexed in Google search. To prevent this from happening should we be using a 301 redirect instead of a 302 and will the Google index ignore the utm parameters in the URL that is being 301 redirect to? If not, any suggestions on how to handle? Thanks,
Technical SEO | | seogirl221 -
Google will index us, but Bing won't. Why?
Bing is crawling our site, but not indexing it, and we cannot figure out why -- plus it's being indexed fine in Google. Any ideas on what the issue with Bing might be? Here's are some details to let you know what we've already checked/established: We have 4 301’s and the rest of our site checks out We’ve already established our Robots is ok, and that we are fixing our site map/it's in fine shape We do not see anything blocking bingbot access to the site There is no varnish or any load balancers, so nothing on that end that would be blocking the access We also don't see any rules in the apache or the .htaccess config that would be blocking the access
Technical SEO | | Alex_RevelInteractive1 -
Fake Links indexing in google
Hello everyone, I have an interesting situation occurring here, and hoping maybe someone here has seen something of this nature or be able to offer some sort of advice. So, we recently installed a wordpress to a subdomain for our business and have been blogging through it. We added the google webmaster tools meta tag and I've noticed an increase in 404 links. I brought this up to or server admin, and he verified that there were a lot of ip's pinging our server looking for these links that don't exist. We've combed through our server files and nothing seems to be compromised. Today, we noticed that when you do site:ourdomain.com into google the subdomain with wordpress shows hundreds of these fake links, that when you visit them, return a 404 page. Just curious if anyone has seen anything like this, what it may be, how we can stop it, could it negatively impact us in anyway? Should we even worry about it? Here's the link to the google results. https://www.google.com/search?q=site%3Amshowells.com&oq=site%3A&aqs=chrome.0.69i59j69i57j69i58.1905j0j1&sourceid=chrome&es_sm=91&ie=UTF-8 (odd links show up on pages 2-3+)
Technical SEO | | mshowells0 -
Can you noindex a page, but still index an image on that page?
If a blog is centered around visual images, and we have specific pages with high quality content that we plan to index and drive our traffic, but we have many pages with our images...what is the best way to go about getting these images indexed? We want to noindex all the pages with just images because they are thin content... Can you noindex,follow a page, but still index the images on that page? Please explain how to go about this concept.....
Technical SEO | | WebServiceConsulting.com0 -
How to Remove /feed URLs from Google's Index
Hey everyone, I have an issue with RSS /feed URLs being indexed by Google for some of our Wordpress sites. Have a look at this Google query, and click to show omitted search results. You'll see we have 500+ /feed URLs indexed by Google, for our many category pages/etc. Here is one of the example URLs: http://www.howdesign.com/design-creativity/fonts-typography/letterforms/attachment/gilhelveticatrade/feed/. Based on this content/code of the XML page, it looks like Wordpress is generating these: <generator>http://wordpress.org/?v=3.5.2</generator> Any idea how to get them out of Google's index without 301 redirecting them? We need the Wordpress-generated RSS feeds to work for various uses. My first two thoughts are trying to work with our Development team to see if we can get a "noindex" meta robots tag on the pages, by they are dynamically-generated pages...so I'm not sure if that will be possible. Or, perhaps we can add a "feed" paramater to GWT "URL Parameters" section...but I don't want to limit Google from crawling these again...I figure I need Google to crawl them and see some code that says to get the pages out of their index...and THEN not crawl the pages anymore. I don't think the "Remove URL" feature in GWT will work, since that tool only removes URLs from the search results, not the actual Google index. FWIW, this site is using the Yoast plugin. We set every page type to "noindex" except for the homepage, Posts, Pages and Categories. We have other sites on Yoast that do not have any /feed URLs indexed by Google at all. Side note, the /robots.txt file was previously blocking crawling of the /feed URLs on this site, which is why you'll see that note in the Google SERPs when you click on the query link given in the first paragraph.
Technical SEO | | M_D_Golden_Peak0