Moz Q&A is closed.
After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.
Does google scrape links from PDF files? do these links pass link juice?
-
Title is pretty much the whole question.
-
I made a test and it seems that yes, the links from pdf count for ranking.
The test is on my Romanian blog http://seogan.ro/link-building-pdf-urile-o-sursa-de-linkuri-test
You can find an English translation here: http://www.seogan.com/pdf-link-building
Hope it helps.
-
Yes it does according to Google tech spec http://code.google.com/apis/searchappliance/documentation/50/admin_crawl/Introduction.html
which specifically states if follows html links in pdf 'It follows HTML links in PDF files, Word documents, and Shockwave documents'. Google's own api docs carry more weight than a comment in a forum_._ If they are licencing this out as an application it would suggest that the same technology is available in the main engine as does Dunamis's comment about a listing in a pdf document being found in search results.
You can test for youself by publishing a pdf with a link to a info page that does not show up in any other links. Include the pdf in your sitemap but not the test page and check if it shows in googles index site:yoursite.com the next time it crawls.
This also gives some insight in an interview with Matt Cutts - http://www.stonetemple.com/articles/interview-matt-cutts-012510.shtml
Eric Enge: What about PDF files?
Matt Cutts: We absolutely do process PDF files. I am not going to talk about whether links in PDF files pass PageRank. But, a good way to think about PDFs is that they are kind of like Flash in that they aren't a file format that's inherent and native to the web, but they can be very useful. In the same way that we try to find useful content within a Flash file, we try to find the useful content within a PDF file. At the same time, users don't always like being sent to a PDF. If you can make your content in a Web-Native format, such as pure HTML, that's often a little more useful to users than just a pure PDF file.
-
Google definitely does index the contents of pdf files. I found this out the hard way as I had a real estate pdf on my site that I wanted to have listed in the index, but I didn't know that the contents would be crawled. The pdf contained some listings that I was not legally allowed to advertise on my site. (It was legal for me to give someone a report with the listings in it though).
When another realtor was searching for their own listing, my pdf came up. I got in trouble. I'm ok now though.
-
Have a look at this article http://searchenginewatch.com/article/2067225/Google-Does-PDF-Other-Changes it explains some of the doc library search for pdf files and Google's statement here http://googleblog.blogspot.com/2008/10/picture-of-thousand-words.html.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Value of Links? What is each link worth?
Morning Everyone, I just had this thought and wondered what everyone's opinions were in terms of link value in monetary terms. We'll assume for the purposes of this that the links come from contextually relevant sites and that the sites in question have got the Moz DA from being high quality and have a good quality incoming link profile. Its a bit of a theoretical question, but i guess imagine if the only way you could get links was to pay for them, what would they be worth to you. This is link value for SEO purposes, they will have in addition value from traffic from good sites, that no doubt varies wildly depending on topic. I assume everyone also agrees on: The first link from a domain is the most valuable High DA sites are worth more than low ones. So could anyone who has an opinion on the link value suggest a monetary value for links. Its really just using a monetary amount to see how best to target my time. Here is my example of what might be expected, but I am hoping people with more knowledge will perhaps correct it. DA Rating First Link 2nd-5th Link 5th-10th Link 10Plus Links 5 $5 $2 $1 $0 15 $7 $3 $2 $1 25 $25 $10 $5 $2 35 $45 $20 $7 $3 45 $65 $30 $11 $4 55 $95 $45 $19 $5 65 $200 $100 $45 $6 75 $350 $120 $65 $9 85 $700 $240 $95 $15 95 $1100 $450 $200 $30
Link Building | | wellandpower1 -
If I disavow bad links on "disavow link webmaster" will they still show up on my moz reports?
We recently found out we have a lot of bad links linking back to our website from spam sites, I disavowed them through the google disavow link webmaster. On my moz report it still shows the links, is that normal?
Link Building | | Ryan.Cruz0 -
Does the ratio of external nofollow links to external "do follow" links matter in terms of SERPs ranking?
My site has an external link nofollow:dofollow ratio of approximately 1:1 That is, there are about as many nofollow external links as "do follow" external links. I have an impression that the ratio of no-follow to "do follow" links is a factor in the way that our website shows up in SERPs. I have the impression from reading a variety of sources, and from looking at Seomoz, that calculate "trust" factors as if they mattered (in SERPs), that seem to value a relatively low nofollow:dofollow ratio. Am I correct about that? Thanks,
Link Building | | tcolling
Tim PS - I don't know whether or not this matters, but our website is at: www.trustworthycare.com - Tim0 -
Does Link Juice Flow From Banner Ads.
Hi Guys I have been checking out a couple of my competitors and noticed that in site explorer there were sites that were passing what I would consider strong amounts of link juice for their banner adverts. I checked the anchor text which was always (Img ALT) Also the links are follow and placed on the home page too. So I wondered how google view this Im sure adverts were supposed to be no follow and if so then do you think there is a benefit from me taking one of these banner adverts to increase the level of link juice passing to my site.
Link Building | | RankStealer0 -
Text Link vs image link?
Which passes most link juice a text link or an image with the correct 'alt' attribute? Do the pass the same amount or is one more valuable than the other?
Link Building | | SamCUK0 -
Value of Link Redirect from Google News
Some sites are linking to us with the URL from Google News instead of using a direct link. For example: "http://news.google.com/news/url?sa=t&fd=R&usg=ghjgjhggjhgkjghjg&url=http://www.oursite.com/awesome-news-010123/" Does this pass the same value as a normal link? I asked the publisher to replace the link with a normal link but they pushed back saying that the news URL is better.
Link Building | | ProjectLabs0 -
Image only badges giving link juice!?!?
I see some chatter about badges, but no clear definition. A collegue of mine instituted badges on clients website, but these badges are only an image w/ hyperlink; no textual content. He is confident that this has worked successfully before as a link building strategy, which blew my mind. I thought we needed some text, and obviously optimizes anchor text for biggest benefit. Are these simple badges helpful, or do we need some html in there!? He also routes them through bit.ly to track impressions and clicks.. does that have any effect as well? Thanks!
Link Building | | SwissNinja0 -
Etsy.com --Getting link juice through other pages on search results?
My sister has a store page on etsy.com where she sells home made crafts. And I want to help her rank higher on google with some of the other etsy stores. So i started to look at the other etsy store pages that are ranking well on google and found that they have a page authority between 48 to 52. So i looked at the backlinks of the ones ranking well on google with high page authority and found that many of their best links came from the internal search results page on etsy.com, and some only had one link from just an arbitrary etsy.com search page. I'm thinking this is because another product being listed on the seach page has a high page authority which then passes some of its link juice onto every other product on the page. But what is interesting is products are always being sold or getting added so even though you are on a search results page that happens to benifit from the link juice of another product the next time the page gets crawled you will be on a different search page. So i am thinking in order to maintain high page authority to you just have to have a lot of products listed so that there is a greater likely hood that you will find yourself on the same search page as another high authority page. I have not been doing SEO very long so i would love to hear what others think. I really have no idea, am i on the right track with this? (edited post) Thanks
Link Building | | doug5650