What does this mean in search results and what can I do about it?

Jump to Last Post 1-4 of 4 discussions (15 posts)

59
EricDockettposted 12 years ago
When searching for unscrupulously borrowed copies of my work sometimes I come across results that don't seem to really be there. Even though I search for a sentence in quotes, and the SERPs shows that quote in my results, when I click on the link either the text isn't really there, or I get a 404.

I assume this means the infringing content has been taken down, but G-Bot hasn't been back to the page in a while. I see this especially with that huge scraper site that stole, and is still stealing, HP Hubs.

So, I'm mostly concerned about these 404s from a "moving my content from one subdomain to another" point of view. I'm going through the process of actually removing copied content from google results, but I don't know how the 404s and false positives will impact any article I move. I don't want to trip any HP filters,and more importantly I don't want my article to appear as the second duplicate to the search engines.

How can I get rid of these 404s? I have a feeling if I submit a takedown with WMT they'll just tell me they can't find the content.
reply report
1. 93
  Judi Beeposted 12 years agoin reply to this
  You will get 404 error messages for the URLs from the scraper's original sites (the ones closed down by his host). He has now, of course, found a nice new host, so the sites are back up (not sure about the indexing of them though).
  
  The moving content issue is the one that concerns me - although Digby Adams suggested that I could do enough of a rewrite to get around the duplication issue - I suspect that's what I might have to do.
  reply report
  59
  EricDockettposted 12 years agoin reply to this
  Right, but my concern is how Google interprets content that seems to a duplicate according to their cache, but in reality has been taken down and is no longer present on the web. Will they devalue our content because they think it is a copy of something that is already out there?
  reply report
  0
  Ethan Greenposted 12 years agoin reply to this
  Google usually tries to work out which is the original and then doesn't index the latter copies, or if it does, doesn't give it the same credit as the original content.
  reply report
  59
  EricDockettposted 12 years agoin reply to this
  But if I move my Hub from one subdomain to another the copy will be the older version. Even though it's not really there, it is there in the cache.
  reply report
2. 43
  Writer Foxposted 12 years agoin reply to this
  To remove the cache of your Hubs (or the plagiarist's articles) on Google:
  
  http://support.google.com/bin/static.py … page=ts.cs
  
  (click inside the circle which says 'Web Search', then scroll down the page and click inside the circle which says 'A piece of content I am concerned about has already been removed by the webmaster but still appears among the search results.' Then, click on the link 'this tool.' You'll then be asked to sign in and then to enter the page URL and, if the page is still live, a word from the outdated cache page that you want to remove.
  
  To remove the cache on Bing/Yahoo:
  http://www.bing.com/webmaster/help/bing … l-cb6c294d
  
  It can take 24 – 48 hours for the removal process.
  reply report
  59
  EricDockettposted 12 years agoin reply to this
  Thanks Writer Fox. That's exactly what I'm looking for.
  reply report
61
chef-de-jourposted 12 years ago
Google doesn't give it the same credit? I'm not quite sure what you mean by this. If content is copied (stolen) and posted how does the crawler distinguish between original and duplicate? All content is indexed but what criteria are used to differentiate?
reply report
1. 59
  EricDockettposted 12 years agoin reply to this
  Chef: If your Hub is stolen it will generally rank well above the copy, as long as your account is healthy. The usual kind of site that takes our stuff can't compete with HP in terms of PageRank and authority. Plus, Google knows which content was there first, or at least they should, but they sometimes screw up.
  
  However, if you decide to move your Hub from one account to another it becomes de-indexed from the search results, and now the stolen version is the only one left on the web. So, when you re-post your Hub again, even though it's yours by copyright, it looks to search engines like you are posting duplicate content. It would probably be flagged by HP and/or banished into the depth of the SERPs.
  
  Search engines don't immediately know when a content thief changes a page or takes it down. It takes hours, days or even weeks (in this case apparently months) for the crawlers to come back and see the changes. So until G-Bot figures it out that leaves copies of your stolen work in the cache, and Google thinks they still exist even though if someone tries to go to them they'll get a 404.
  
  So that's the meat of my question: If Googlebot hasn't come back and recognized that stolen content is no longer on a site, will they penalize my Hub if I delete from one account and repost it on another?
  reply report
2. 0
  Ethan Greenposted 12 years agoin reply to this
  the date it was published
  reply report
61
chef-de-jourposted 12 years ago
That's a neat reply. I've recently experienced a first dose of plagiarism and it's not very pleasant! As a complete novice and a naive onliner I ask the basic question: Why can't a design guru/expert come up with a foolproof copyright button/icon/system whereby the original author's article is 'certified' original with date, time etc etc. Any subsequent copies will be stopped/automatically suspended/deleted?

Would this be possible? For all the prevention and care we take as authors it seems the bottom line still is...any article can stolen at will, by whoever, whenever, wherever.
reply report
1. 0
  Ethan Greenposted 12 years agoin reply to this
  I don't think it would be possible, though am not expert. What would be possible though is for Hubpages to put in a similar function as a popular wordpress plugin is does - an automatic backlink to the original article for content scrapers. It doesn't stop people copying the content, but in some circumstances does credit you with creating the article with a backlink bonus.
  reply report
  59
  EricDockettposted 12 years agoin reply to this
  The only problem I'd have with that is I don't want backlinks from some crummy scraper site pointed at my subdomain.
  reply report
2. 59
  EricDockettposted 12 years agoin reply to this
  Search engines aren't in the business of copyright enforcement, except to the extent that they must comply with the laws when a complaint is made. Remember also that there are still legitimate syndicated articles out there, particularly when it comes to news, so search engines don't always want or need to penalize a site with copied content. They just decide which is original and most relevant to the search query to the best of their ability, and push the rest down in the SERPs . . . at least that's what they're supposed to do!
  
  Enforcing your own copyright is part of being an online content creator, or any kind of creative person. Unfortunately, it goes with the gig. Any article can indeed be stolen. Don't let it get you down.
  reply report
60
mamachelliebooksposted 12 years ago
I am new but I think the search results help with the type of ads in regards to what you are writing about. Hope this helps. MamaChellie.
reply report

This website uses cookies

As a user in the EEA, your approval is needed on a few things. To provide a better website experience, hubpages.com uses cookies (and other similar technologies) and may collect, process, and share personal data. Please choose which areas of our service you consent to our doing so.

Necessary

Features

Marketing

Statistics

Approve All & Submit
Approve Checked Only

For more information on managing or withdrawing consents and how we handle data, visit our Privacy Policy at: https://corp.maven.io/privacy-policy

Show Details

Necessary
HubPages Device ID	This is used to identify particular browsers or devices when the access the service, and is used for security reasons.
Login	This is necessary to sign in to the HubPages Service.
Google Recaptcha	This is used to prevent bots and spam. (Privacy Policy)
Akismet	This is used to detect comment spam. (Privacy Policy)
HubPages Google Analytics	This is used to provide data on traffic to our website, all personally identifyable data is anonymized. (Privacy Policy)
HubPages Traffic Pixel	This is used to collect data on traffic to articles and other pages on our site. Unless you are signed in to a HubPages account, all personally identifiable information is anonymized.
Amazon Web Services	This is a cloud services platform that we used to host our service. (Privacy Policy)
Cloudflare	This is a cloud CDN service that we use to efficiently deliver files required for our service to operate such as javascript, cascading style sheets, images, and videos. (Privacy Policy)
Google Hosted Libraries	Javascript software libraries such as jQuery are loaded at endpoints on the googleapis.com or gstatic.com domains, for performance and efficiency reasons. (Privacy Policy)

Features
Google Custom Search	This is feature allows you to search the site. (Privacy Policy)
Google Maps	Some articles have Google Maps embedded in them. (Privacy Policy)
Google Charts	This is used to display charts and graphs on articles and the author center. (Privacy Policy)
Google AdSense Host API	This service allows you to sign up for or associate a Google AdSense account with HubPages, so that you can earn money from ads on your articles. No data is shared unless you engage with this feature. (Privacy Policy)
Google YouTube	Some articles have YouTube videos embedded in them. (Privacy Policy)
Vimeo	Some articles have Vimeo videos embedded in them. (Privacy Policy)
Paypal	This is used for a registered author who enrolls in the HubPages Earnings program and requests to be paid via PayPal. No data is shared with Paypal unless you engage with this feature. (Privacy Policy)
Facebook Login	You can use this to streamline signing up for, or signing in to your Hubpages account. No data is shared with Facebook unless you engage with this feature. (Privacy Policy)
Maven	This supports the Maven widget and search functionality. (Privacy Policy)

Marketing
Google AdSense	This is an ad network. (Privacy Policy)
Google DoubleClick	Google provides ad serving technology and runs an ad network. (Privacy Policy)
Index Exchange	This is an ad network. (Privacy Policy)
Sovrn	This is an ad network. (Privacy Policy)
Facebook Ads	This is an ad network. (Privacy Policy)
Amazon Unified Ad Marketplace	This is an ad network. (Privacy Policy)
AppNexus	This is an ad network. (Privacy Policy)
Openx	This is an ad network. (Privacy Policy)
Rubicon Project	This is an ad network. (Privacy Policy)
TripleLift	This is an ad network. (Privacy Policy)
Say Media	We partner with Say Media to deliver ad campaigns on our sites. (Privacy Policy)
Remarketing Pixels	We may use remarketing pixels from advertising networks such as Google AdWords, Bing Ads, and Facebook in order to advertise the HubPages Service to people that have visited our sites.
Conversion Tracking Pixels	We may use conversion tracking pixels from advertising networks such as Google AdWords, Bing Ads, and Facebook in order to identify when an advertisement has successfully resulted in the desired action, such as signing up for the HubPages Service or publishing an article on the HubPages Service.

Statistics
Author Google Analytics	This is used to provide traffic data and reports to the authors of articles on the HubPages Service. (Privacy Policy)
Comscore	ComScore is a media measurement and analytics company providing marketing data and analytics to enterprises, media and advertising agencies, and publishers. Non-consent will result in ComScore only processing obfuscated personal data. (Privacy Policy)
Amazon Tracking Pixel	Some articles display amazon products as part of the Amazon Affiliate program, this pixel provides traffic statistics for those products (Privacy Policy)
Clicksco	This is a data management platform studying reader behavior (Privacy Policy)

What does this mean in search results and what can I do about it?

Arts and Design

Autos

Books, Literature, and Writing

Business and Employment

Education and Science

Entertainment and Media

Family and Parenting

Fashion and Beauty

Food and Cooking

Games, Toys, and Hobbies

Gender and Relationships

Health

Holidays and Celebrations

Home and Garden

HubPages Tutorials and Community

Personal Finance

Pets and Animals

Politics and Social Issues

Religion and Philosophy

Sports and Recreation

Technology

Travel and Places

About Us

This website uses cookies

What does this mean in search results and what can I do about it?

Related Discussions

This website uses cookies