ArtsAutosBooksBusinessEducationEntertainmentFamilyFashionFoodGamesGenderHealthHolidaysHomeHubPagesPersonal FinancePetsPoliticsReligionSportsTechnologyTravel

Quality Articles Fall Victim to Google Panda and Anti-Duplication Hysteria

Updated on February 1, 2022
LHwritings profile image

Lyndon Henry is a writer, journalist, editor, and writing-editing consultant.

Have you been a Panda victim lately?

Writers, and the public in general, increasingly have been encountering censorship, but who would have expected Google — and its Web-searching tools — to become a major driving force? However, it appears that Google — and in particular its Web content "filtering" process tagged as "Panda" — has emerged as a serious impediment to productive writing.

Panda — no, not the cute, lovable Chinese bear, but basically, Google's name for a kind of content-screening algorithm — was instituted by Google in about 2011) to try to address a real problem: rampant, deliberate, reckless, and unbridled duplication of content by a number of content providers (in quest of spreading a wide net to be found in Google searches and thus to catch Webpage ad revenue). Panda reduces the number of SERPs — Search Engine Results found in Google search queries. In response, so as not to be screened out in Web searches, various major online publishers and content providers have adopted similar "house rules" — in-house screening procedures to try to conform to this stringent new paradigm by imposing severe strictures against any duplicative content and insisting on entirely pristine, virgin content.

Lots of people, including many writers, have been applauding Google's move because it supposedly thwarts plagiarism. This kind of content piracy (stealing someone else's written work and portraying it as your own) has certainly been a problem ... but Google's robotic crusade against virtually any duplicative content is also creating an unforeseen, and perhaps ultimately more harmful, problem

Penalized for Adapting Your Own Work

What's the problem? Traditionally, many, possibly most, writers have reprocessed their own content, updating it, expanding it, tweaking it, inserting it into new articles. If you've expressed a concept well, why should you have to junk that wording entirely and somehow find entirely new and different words to try to express the same thing? Yet the emerging new paradigm seems to admonish any such re-use of past formulations and phrases, and to convey a mandate that all content emerge ever new from the writer's brain.

Many writers create new material (such as articles on important topics) by adapting and expanding their own older material, perhaps from relatively small sections of other much longer articles. Often this means that some of the new material will be duplicated (key phrases, especially those describing very specific objects, procedures, or concepts) — even though the overall wording may be changed substantially, with totally new case examples or other material, and new content added

However, more and more writers are finding that, because of the emerging Panda paradigm, their new articles may be rejected, or suspended, by the online publisher for a "duplicated content" violation. "Substantial similarity to another work" can mean not just duplicating the exact wording elsewhere on the Web, but may include "close paraphrasing, among other forms of misappropriation or copying of content...." In other words, you can't just change the wording of previous material, you can't even paraphrase it — which seems to mean you can't really use it again at all.

"Cure" Worse Than the "Disease"?

Unfortunately, the ultimate result of the Panda "remedy" may be worse than the disease if it stifles even useful and essential duplicative content. Possibly, Panda (and online publishing sites that seek to comply with this paradigm) may be relying on far too mechanistic and draconian an algorithm to assess duplication

What if you write a short article, with the intent to develop that later into a more extensive article on the same theme or topic? Watch out — the very words and phrases you use in your initial short article might trigger a "duplicated content" violation when you try to re-use some of the same expressions and phrases later.

So writers may find that plagiarism of their work has diminished (this is yet to be demonstrated). However, the price may be to frustrate or encumber their efforts to produce more new work in the future. It's hard to say exactly what the best solution to the problem of content piracy might be ... but when you leave it to the robots, you may get something far different from what you bargained for.

Originally published 2011/12/06

This content reflects the personal opinions of the author. It is accurate and true to the best of the author’s knowledge and should not be substituted for impartial fact or advice in legal, political, or personal matters.

© 2011 Lyndon Henry

working

This website uses cookies

As a user in the EEA, your approval is needed on a few things. To provide a better website experience, hubpages.com uses cookies (and other similar technologies) and may collect, process, and share personal data. Please choose which areas of our service you consent to our doing so.

For more information on managing or withdrawing consents and how we handle data, visit our Privacy Policy at: https://corp.maven.io/privacy-policy

Show Details
Necessary
HubPages Device IDThis is used to identify particular browsers or devices when the access the service, and is used for security reasons.
LoginThis is necessary to sign in to the HubPages Service.
Google RecaptchaThis is used to prevent bots and spam. (Privacy Policy)
AkismetThis is used to detect comment spam. (Privacy Policy)
HubPages Google AnalyticsThis is used to provide data on traffic to our website, all personally identifyable data is anonymized. (Privacy Policy)
HubPages Traffic PixelThis is used to collect data on traffic to articles and other pages on our site. Unless you are signed in to a HubPages account, all personally identifiable information is anonymized.
Amazon Web ServicesThis is a cloud services platform that we used to host our service. (Privacy Policy)
CloudflareThis is a cloud CDN service that we use to efficiently deliver files required for our service to operate such as javascript, cascading style sheets, images, and videos. (Privacy Policy)
Google Hosted LibrariesJavascript software libraries such as jQuery are loaded at endpoints on the googleapis.com or gstatic.com domains, for performance and efficiency reasons. (Privacy Policy)
Features
Google Custom SearchThis is feature allows you to search the site. (Privacy Policy)
Google MapsSome articles have Google Maps embedded in them. (Privacy Policy)
Google ChartsThis is used to display charts and graphs on articles and the author center. (Privacy Policy)
Google AdSense Host APIThis service allows you to sign up for or associate a Google AdSense account with HubPages, so that you can earn money from ads on your articles. No data is shared unless you engage with this feature. (Privacy Policy)
Google YouTubeSome articles have YouTube videos embedded in them. (Privacy Policy)
VimeoSome articles have Vimeo videos embedded in them. (Privacy Policy)
PaypalThis is used for a registered author who enrolls in the HubPages Earnings program and requests to be paid via PayPal. No data is shared with Paypal unless you engage with this feature. (Privacy Policy)
Facebook LoginYou can use this to streamline signing up for, or signing in to your Hubpages account. No data is shared with Facebook unless you engage with this feature. (Privacy Policy)
MavenThis supports the Maven widget and search functionality. (Privacy Policy)
Marketing
Google AdSenseThis is an ad network. (Privacy Policy)
Google DoubleClickGoogle provides ad serving technology and runs an ad network. (Privacy Policy)
Index ExchangeThis is an ad network. (Privacy Policy)
SovrnThis is an ad network. (Privacy Policy)
Facebook AdsThis is an ad network. (Privacy Policy)
Amazon Unified Ad MarketplaceThis is an ad network. (Privacy Policy)
AppNexusThis is an ad network. (Privacy Policy)
OpenxThis is an ad network. (Privacy Policy)
Rubicon ProjectThis is an ad network. (Privacy Policy)
TripleLiftThis is an ad network. (Privacy Policy)
Say MediaWe partner with Say Media to deliver ad campaigns on our sites. (Privacy Policy)
Remarketing PixelsWe may use remarketing pixels from advertising networks such as Google AdWords, Bing Ads, and Facebook in order to advertise the HubPages Service to people that have visited our sites.
Conversion Tracking PixelsWe may use conversion tracking pixels from advertising networks such as Google AdWords, Bing Ads, and Facebook in order to identify when an advertisement has successfully resulted in the desired action, such as signing up for the HubPages Service or publishing an article on the HubPages Service.
Statistics
Author Google AnalyticsThis is used to provide traffic data and reports to the authors of articles on the HubPages Service. (Privacy Policy)
ComscoreComScore is a media measurement and analytics company providing marketing data and analytics to enterprises, media and advertising agencies, and publishers. Non-consent will result in ComScore only processing obfuscated personal data. (Privacy Policy)
Amazon Tracking PixelSome articles display amazon products as part of the Amazon Affiliate program, this pixel provides traffic statistics for those products (Privacy Policy)
ClickscoThis is a data management platform studying reader behavior (Privacy Policy)