Does Google Crawl Google Drive? and Should a SEO Use it?

Google crawls Google Drive for SEO

As an SEO that manages teams of link builders, social media optimizers, bookmarkers and article writers, this question has become paramount to the way I manage my SEO reports.  My short answer is yes, Google can spy on your docs stored within Google Drive and therefore, you should not use this service for managing your SEO reports. Especially when you are managing large websites where millions of dollars are at stake. That being said, do not use Google Drive (formerly Docs) or even Gmail for SEO reports especially, when it comes to linkwheels and link networks.

Let’s take a lesson from other cloud-based services like MegaUpload getting shut down or how about what’s happening right now with RapidShare. TorrentFreak leaked this Rapidshare Internal Manifesto that outlines how they crawl your cloud-based files and can seize the content and terminate your account on the suspension that it violates a registered copyright. Bullshit laws not only allow Google to spy on your content but actually forces them to and here’s why:

“Services should terminate account holders or subscribers not merely upon proof that they are infringers but when sufficient copyright holders have called their conduct into question. In such cases, services deserve an explanation from the users as to why the suspicions are unfounded.”

Now thanks to laws like the Digital Millennium Copyright Act (DMCA) and the completely unconstitutional Cyber Intelligence Sharing and Protection Act (CISPA), cloud-based services like Google Drive will have no choice but to spy on your files! Even though Google supported Net Neutrality back in 2006 when they were still cool, they have now given their FULL support for CISPA along with these 800+ other major companies. Of course, the attempts to pass CISPA have failed so far but they will just change a few words, give it a new name and keep trying until it finally passes.

Here’s a sentence I snagged from Googles copyright and protection policies:

We provide information to help copyright holders manage their intellectual property online. If you think somebody is violating your copyrights and want to notify us, you can find information about submitting notices and Google’s policy about responding to notices in our Help Center.

Paranoid yet? Well, your paranoia is not unfounded because in Google’s new all-encompassing privacy policy that applies to ALL Google services they straight up tell you that they can and will use any and all information you provide.

Information we get from your use of our services. We may collect information about the services that you use and how you use them, like when you visit a website that uses our advertising services or you view and interact with our ads and content.

Read carefully, especially the part where it says to interact with our ads and content that means the SERPs too. Obviously, I have no proof that Google is actively using the information in your Docs aka Google Drive but this proves that they can if they want to. You must understand that the Google Ranking Algorithms are the most valued asset Google has and they will do anything to protect and enhance these algorithms. Even if that means scanning your Drive.

Google_Spies_On_You

Google Spies on Your Gmail Too

Most people have already realized that Google crawls Gmail as well. Just send someone an email about a mortgage and you will start seeing mortgage-related advertisements in your Gmail. They use keyword scanning bots to scan all the content of your incoming and outgoing messages and when the bots find something that is related to an ad, they then display those types of ads in your inbox. Furthermore, Google even crawls your email attachments. I’ve tested this on several occasions where I will not mention anything about the targeted advertisement keyword, yet I’ll attach a keyword rich document to the email and I will still get related ads. This is why I also strongly advise against using Gmail to send SEO reports and especially if you are going to send a list of websites that are part of a linkwheel or network.

The Future of Liberty in a Modern World

The Future of Liberty in a Modern World

constitution-of-the-united-states-of-america

lib·er·ty /ˈlibərtē/
Noun:

The state of being free within society from oppressive restrictions imposed by authority on one’s way of life.
An instance of this; a right or privilege, esp. a statutory one.

Synonyms:
freedom license license independence

_________________________________________________

Most of you already know what liberty is but just for clarity of understanding my point, I’ll say that the golden rule of liberty is that it protects rights that all human beings should be born with, but the catch is that these rights of liberty should never obstruct the rights of others. What does this mean?

If you don’t like gay marriage then don’t marry a gay person, if you don’t like abortion then don’t get an abortion, if you don’t like drugs then don’t use them, if you don’t like a certain type of sex then don’t have it , if you don’t like porn then don’t watch it and if you don’t like cigarettes then don’t smoke them but if you don’t like your rights being taken away then don’t vote to take away other peoples rights!

This code of liberty should be universal and every human should learn these manors in modern day society if we are to all get along as Rodney put it. As the global population grows, the world gets smaller and resources will become more scarce. This means some of our freedoms may become hindrances to others. Unless you think the world is going to end this December 21st, 2012, you might as well start thinking about how our liberties will be affected in the future. Which, in the scheme of things, could be tens of thousands of years all the way to 6 billion more years (enough time for dinosaurs or humans to evolve and become extinct 60 more times). I’m the most anti-regulation, anti-government kinda guy you’ll ever meet, I even got an Anarchy tattoo on my left ankle when I was 13 just to let people know I thought the government was unfair but If we are to survive and thrive in this world over the upcoming millennia, we must take extra precautions to not break this golden rule of Liberty.

How Does This Apply in the Modern World?

If you live in a modern-day, developing country that is overpopulated like India, China, Philippines or Indonesia and you decide to have ten children, then you will surely contribute to the suffering of others. In this situation, the right to have more children than you can support must be limited or it could lead to massive genocide. I have lived in India and the Philippines since 2006 and have visited China on several occasions. They are reaching maximum capacity and cannot sustain their rate of growth. Can you think of any other freedoms that we might have to loose as our world becomes smaller? Also feel free to list some freedoms that we should never sacrifice no matter what.

How to Use the Pagination Tags Rel= next and rel= prev

How to use Pagination Tags

Rel= next and rel= prev are brand new tags that Google just came up with to fix the dreaded pagination duplicate problem.

Here’s How to Use rel=next and rel=prev

The URLs below will be our example of how to implement this long awaited Google meta tag for Pagination content

http://www.yoursite.com/content-part1.html

http://www.yoursite.com/content-part2.html

http://www.yoursite.com/content-part3.html

http://www.yoursite.com/content-part4.html

pagination-tag-seo-300x218Step 1. On the first page of your pagination which in this case is http://www.yoursite.com/content-part1.html put the following markup in your <head>

<link rel="next" href="http://www.yoursite.com/content-part2.html">

Obviously, you don’t need to use rel=prev on the first page :-)

Step 2. On the next two pages in our example, you will include the previous page in the sequence and the next page by including both links in your markup as I have done below

<link rel="prev" href="http://www.yoursite.com/content-part1.html">
<link rel="next" href="http://www.yoursite.com/content-part3.html">

Step 3. Then on the last pagination page which is http://www.example.com/content-part4.html> you just use the previous link, as there is no next :-)

That’s it! No further adjustments needed, just a give it a few weeks for the duplicate titles to disappear in Google Webmaster Tools.

The Old School Solution

I initially wrote this post on August 8th, 2011 about Pagination Pages Causing Duplicate Content because, before the new Pagination tag that Google announced on August 15th of 2011, there was just too much misinformation out there on this topic.  (Spooky, were they reading my blog???)  Previously, not even Google gave straight answers on how to deal with it. Just look at the way a Google employee handled the question here on their webmaster’s support forum about duplicate pagination content.  Don’t you just love how Google always dances around the answers? Sometimes they make you read between the lines and sometimes they completely leave you clueless. Moving on, pagination pages create duplicate content and give you a bad mark in Google Webmaster tools for duplicate titles. Just 5 days after I posted this article, Google added this Pagination solution to WMT help section. That being said,  if you are interested in how I used to solve the problem, keep reading below.

indexing-through-pagination-pagesPrevent Pagination Junk

Part of the reason it is so difficult to answer the question is because not all sites are the same and in some cases, people might actually want to index their pagination pages and in other cases they just want them followed.

The objective is to guide the bots to the content that you want them to see and keep them away from redundant and duplicate content that is inevitably produced by many CMSs.

duplicate-content-pagination-194x300That being said, I suggest a couple of measures that in combination will work for at least 90% of all pagination duplicate content problems

1. Use the <META NAME=ROBOTS CONTENT=NOINDEX, FOLLOW> tag in the header of all your pagination pages except for the first page as you will want to get that one indexed as the category page. This allows for indexing through the pagination without actually indexing the pagination page itself.

2. Whether or not you want the pagination pages indexed or not I still suggest you use a script that rewrites your pagination titles to legitimize them. Just because Google is not indexing them, they are still crawling, following and using the data in their algorithms. Using a technique that I have seen favored by Googles video search results that they use on Youtube, simply rewrite your titles like this

Domainname.com | Category Name | Page 2 of 49

Again that was the old way of doing things prior to August 15th 20011 and now you can block duplicate content caused by pagination pages with the very simple Pagination tags, rel=“next” and rel=“prev”.