ArtsAutosBooksBusinessEducationEntertainmentFamilyFashionFoodGamesGenderHealthHolidaysHomeHubPagesPersonal FinancePetsPoliticsReligionSportsTechnologyTravel

Google to downgrade pirate sites in search results

Updated on August 11, 2012

Search Engine as Social Arbiter

Everyone with a browser uses a search engine. It's your TV Guide and Yellow Pages and Rand McNally Road Atlas in a convenient digital form. Try to spend one 24 hour period without searching for something online: all you have is your bookmarks and your friends. It can't be done.

Google, financially

A publicly held company, Google is traded on the Nasdaq under the symbol GOOG. Anyone can buy it, but in increments of $642: individual shares are relatively expensive. High-dollar shares are not indicative of company value, but they do present a daunting entry point for individual investors.

On paper the company is worth over 200 billion dollars. They don't have a storefront, like Apple, or a product line, like Microsoft. There's no inventory: were they to file bankruptcy tomorrow they would begin to sell off massive numbers of computers and related technology used to manage their search engine service. They also own real estate dedicated to office buildings and data centers throughout the world. Their intellectual property has a very limited shelf-life and would bring little value were it offered for sale.

Google, competitively

Is Google the best search engine? Yahoo and Microsoft don't think so, but those companies cannot compete with the number of search results returned by Google systems. No one outside of these companies knows how search results are computed, therefore it is impossible to compare one algorithm against another. It will always be this way. Google's competitive advantage comes from its secrecy. Their servers aren't faster or cooler or Made in America. If we knew how they ranked sites and returned search results, we'd have the knowledge to circumvent and evaluate the algorithms. All we can do now is judge the results, like eating a hot dog.

The secrecy is ironic because Google is a champion of Open Source software, outside of their search engine. The company sponsors programming competitions, tutorials, training, and academic programs designed to nurture the next generation of coders. Throughout these programs runs the common thread of transparency: if you code something, everyone should be able to access it. Perversely, don't bother to ask Google for their algorithms: they'll tell you to pick better keywords and to write better content.

The company employs people to hint at the algorithms and mete out helpful(?) bits of information when revisions take place. Matt Cutts is widely cited as a 'source' for understanding how Google searches and sorts and ranks. Don't mistake his generosity for altruism: he draws a paycheck from the company that he partially reveals. I immediately like the guy because he and I share a very similar academic pedigree. We've probably attended some of the same conferences and perhaps sat in the same breakout sessions.

Google, socially

No search engine can avoid becoming a social arbiter. Search engine users implicitly trust the results they are given. Search for "Toms shoes" and you get the top 10 results as decided by Google algorithms. It's probably good enough, but you'll never know if result #11 would have met your needs more effectively. Google will never know, either. They can measure what result you select and how long you might tarry on some sites, but they can't measure your satisfaction.

Recently Google admitted to adjusting their sorting algorithm to penalize web pages deemed as 'pirate sites.' The implication is that these sites are offering downloads that violate copyright laws in one or more countries.

Actually, the implication is that Google computers have decided that these sites are such offenders.

Keep in mind that humans are rarely involved in this process. Far too many web sites exist to be evaluated by eyeballs. This is not the exercise of a focus group. When a site is tagged as a pirate site, it's almost certainly because a computer algorithm added up some numbers and came up with a number in a range. The number, the range, and the algorithm are all under the control of Google.

You get to vote

Google hints that their pirate-flagging algorithms will consider the volume of "valid copyright removal notices" (their words) reported against individual sites. We cannot say if Google plans to interface with government agencies to validate complaints. Certainly they plan to use their own copyright violation reporting system, but that has obvious limitations.

Worst case, Google will evaluate third-party sites based on complaints received through Google. As a frequent user of Google's reporting system, I can confirm that it does work, except when it doesn't. Google holds no sway over sites that don't publish Google advertising (Ad Words.) A site in Nigeria proudly violating copyrights probably won't care if Google indexes them or not. I recently published a page that was shortly cloned on a European server: for several keywords, that pirated page ranks higher than my original content. I have reported it to Google, but the clone persists.

What if you want to search for a pirate site?

Google has decided that you probably won't have a pirate site in your search results. Keep in mind that the company is under fire on several fronts for, allegedly, facilitating copyright violations. They are being sued for copying and publishing millions of books that were supposedly out of print and not subject to copyright regulations. They are constantly harangued by movie, music, and book publishers who are hysterical over loss of revenue due to pirate sites. Adding new filtering logic to their search engine may appease some lawyers and judges.

Some folks would say that suing Google is akin to suing General Motors for making the car used to haul pirated DVDs. Those folks don't make their living producing digital media. Whether right or wrong, Google is permanently inserted into the debate.


    0 of 8192 characters used
    Post Comment
    • nicomp profile imageAUTHOR

      nicomp really 

      6 years ago from Ohio, USA

      Voted up by me. I liked it.

    • Patty Kenyon profile image

      Patty Kenyon 

      6 years ago from Ledyard, Connecticut

      Voted Up, Interesting, and Useful!! This was very interesting and Well Done!! I do use Google however I feel like it takes forever if I am looking for something very specific. Actually I find specific searches using Yahoo, or Alta Vista (which ironically is owned by Yahoo.) I am not a fan of Bing either.

      In regards to the pirated material, although they may say it has been down graded, however, those sites are so easy to find within a few pages of the searches...and NO, I do not use them but often times led to them through back links of something I am looking for.

      Wow, I cannot believe that there is a clone on a European server of something you published...errr, I can completely understand your frustration!!!

      Awesome Job!!!!

    • RedElf profile image


      6 years ago from Canada

      Wow - good read! Google used to be the champion of transparency and free internet for all - way back in the 80s, before they got really big. Also until Panda I, the first search results were NOT paid advertisements.

      Oh, well. Don't get me wrong, I still use Google, but I don't love them like I used to (not that they care a bunch, I'm sure). There is a saying about power and corruption... :)


    This website uses cookies

    As a user in the EEA, your approval is needed on a few things. To provide a better website experience, uses cookies (and other similar technologies) and may collect, process, and share personal data. Please choose which areas of our service you consent to our doing so.

    For more information on managing or withdrawing consents and how we handle data, visit our Privacy Policy at:

    Show Details
    HubPages Device IDThis is used to identify particular browsers or devices when the access the service, and is used for security reasons.
    LoginThis is necessary to sign in to the HubPages Service.
    Google RecaptchaThis is used to prevent bots and spam. (Privacy Policy)
    AkismetThis is used to detect comment spam. (Privacy Policy)
    HubPages Google AnalyticsThis is used to provide data on traffic to our website, all personally identifyable data is anonymized. (Privacy Policy)
    HubPages Traffic PixelThis is used to collect data on traffic to articles and other pages on our site. Unless you are signed in to a HubPages account, all personally identifiable information is anonymized.
    Amazon Web ServicesThis is a cloud services platform that we used to host our service. (Privacy Policy)
    CloudflareThis is a cloud CDN service that we use to efficiently deliver files required for our service to operate such as javascript, cascading style sheets, images, and videos. (Privacy Policy)
    Google Hosted LibrariesJavascript software libraries such as jQuery are loaded at endpoints on the or domains, for performance and efficiency reasons. (Privacy Policy)
    Google Custom SearchThis is feature allows you to search the site. (Privacy Policy)
    Google MapsSome articles have Google Maps embedded in them. (Privacy Policy)
    Google ChartsThis is used to display charts and graphs on articles and the author center. (Privacy Policy)
    Google AdSense Host APIThis service allows you to sign up for or associate a Google AdSense account with HubPages, so that you can earn money from ads on your articles. No data is shared unless you engage with this feature. (Privacy Policy)
    Google YouTubeSome articles have YouTube videos embedded in them. (Privacy Policy)
    VimeoSome articles have Vimeo videos embedded in them. (Privacy Policy)
    PaypalThis is used for a registered author who enrolls in the HubPages Earnings program and requests to be paid via PayPal. No data is shared with Paypal unless you engage with this feature. (Privacy Policy)
    Facebook LoginYou can use this to streamline signing up for, or signing in to your Hubpages account. No data is shared with Facebook unless you engage with this feature. (Privacy Policy)
    MavenThis supports the Maven widget and search functionality. (Privacy Policy)
    Google AdSenseThis is an ad network. (Privacy Policy)
    Google DoubleClickGoogle provides ad serving technology and runs an ad network. (Privacy Policy)
    Index ExchangeThis is an ad network. (Privacy Policy)
    SovrnThis is an ad network. (Privacy Policy)
    Facebook AdsThis is an ad network. (Privacy Policy)
    Amazon Unified Ad MarketplaceThis is an ad network. (Privacy Policy)
    AppNexusThis is an ad network. (Privacy Policy)
    OpenxThis is an ad network. (Privacy Policy)
    Rubicon ProjectThis is an ad network. (Privacy Policy)
    TripleLiftThis is an ad network. (Privacy Policy)
    Say MediaWe partner with Say Media to deliver ad campaigns on our sites. (Privacy Policy)
    Remarketing PixelsWe may use remarketing pixels from advertising networks such as Google AdWords, Bing Ads, and Facebook in order to advertise the HubPages Service to people that have visited our sites.
    Conversion Tracking PixelsWe may use conversion tracking pixels from advertising networks such as Google AdWords, Bing Ads, and Facebook in order to identify when an advertisement has successfully resulted in the desired action, such as signing up for the HubPages Service or publishing an article on the HubPages Service.
    Author Google AnalyticsThis is used to provide traffic data and reports to the authors of articles on the HubPages Service. (Privacy Policy)
    ComscoreComScore is a media measurement and analytics company providing marketing data and analytics to enterprises, media and advertising agencies, and publishers. Non-consent will result in ComScore only processing obfuscated personal data. (Privacy Policy)
    Amazon Tracking PixelSome articles display amazon products as part of the Amazon Affiliate program, this pixel provides traffic statistics for those products (Privacy Policy)
    ClickscoThis is a data management platform studying reader behavior (Privacy Policy)