I've had 38 of my hubs scraped by the same outfit operating over different sites. I know that there are various threads now, but here's a list of the site names. They are scraping topic categories, as far as I can see. The sites are made up entirely of hubs, so please check for your content.
FinanceForPeoples Familywithparenting FoodforCooking EverythingAtHomes GenderRelationships Games-Toys-Hobbies BusinessEmployment EntertainmentwithMedia - a couple of mine here EducationforScience - lots of my history hubs here! CelebrationforHoliday - Christmas, Halloween and Easter Pets-and-Animals (can't actually find this site) artswithdesign religion-philosophy travelingandplaces - a lot of my hubs here too
I searched for Food for Cooking and came up with some info about this guy. This site was created on 3-04-13. He's already had two domain name changes in less than a year for this site. He has also had 2 IP address changes in a year.
. It looks like he changes names when he runs into problems like this.
I'm new to hub pages so doubt any of my hubs are stolen cause I hardly got anything... But a solution that I've got is that what if hubpages were to disable text selection...that way no one would be able to select and copy your text... This can be done using simple css embedded in the html source code of hubpages...
I agree - something should be done on a site-wide basis. I also think HP is within its rights to file a B to B complaint with Google. This lowers Google's marketability (since they are ranked by advertisers according to the value and integrity of their search results), and it affects the entire site here.
We cannot afford to deal with this every few weeks - which is what we are doing now. It is far more efficient to have a large-scale way to handle it than to have hundreds of individual writers here filing thousands of individual complaints, over and over. This is not to say that we should stop filing DMCAs, it just means we need a broader attack & defend strategy.
I'd better go sit in a Zen Garden with some herbal tea - I think the issues in Korea are affecting my choice of phrases here!
I went through all of them and found 17 of my hubs!! GRRR!! I put my hub name in search and they just popped up under the categories! I c/p'ed each page with all my hubs into a Word Doc for easy access for filing a DMCA or several... I haven't gotten that far.
Considering the size of the sites, they're probably a completely automated "auto-blog" that's set it and forget it. He enters in a few source websites / RSS feeds and then moves on to another project while the software continuously scrapes and publishes.
All it would end up doing is damaging the user experience for legitimate users.
Holy Crap! I am doing my DMCA's and each time I go back there are more scraped. He must be on the move right now. Originally, I c/p'ed my hub name search onto a Word Doc for easy access to the SOB's sites with our hubs. I am glad I did because it makes it easier to open tabs and stay on the same page. Although, however he has it set up, the page jumps around and it take patience to get the links to open. I have one that I can't get opened. Good Luck everyone!
Just caught Business Employment ripping off my content. What is the point of them doing this. I'm just glad I am not the only one. I'm not even sure what a DMCA is but I am willing to file it if it helps not index the stolen content.
I am fairly new to Hubpages but this pisses me off! I know I spend days and days researching and writing my hubs and to have them just stolen....anyway I have an idea, although I am not at all techy. Maybe someone could write a hub on this or start a petition and we could get as many hubbers as possible to sign it and send it to google, hubstaff, or the owners of the rip off sites? Maybe if we tackle this thing as a community? Just a thought.
I found a few of my hubs on their travel site after you posted a link the other day. I just checked one of the other sites in your list, and found 10 more of my hubs there I had a feeling that a couple of them had been stolen, as they had been getting good google traffic until recently, but I hadn't turned up anything in my searches. Off to file some more DMCA notices and look for more copied hubs on the other sites... What a pain!!!!!
No worries Kathy - glad it helped you find your hubs. I'm experiencing exactly the same situation as you - hubs that were getting traffic are now sliding. No wonder - I've just filed another batch of DMCAs which brings me up to 48 today and I've a feeling there may be more.
Yes - 48! Nearly half of all my hubs. I've searched most of his sites and found them by copying a sentence from the hub into the search bar. It's odd that he missed some - he has most of my history hubs, for example, but a few are not there. In other niches he has all of them though. Oh, and I found one from my other subdomain too.
Ack! I just found 5 more of mine on one of his sites. I've been able to find most of them by searching for my name and HubPages name. I'm finding the same thing with my niches- he's stolen most of my food / cooking hubs but seems to have missed a few. (Or maybe I just haven't found them yet).
Wow - even all the "join hubpages" links are copied, making it blatantly obvious that the material was taken. Seems to me when large portions of hubpages have been copied there ought to be something admin can do. The website I looked at didn't have any material that WASN'T copied from hubpages.
For copied hubs, I feel that the best thing to do is file a DCMA complaint. Doesn't that mean that if they are making money with google adsense, they can lose their accounts because they were duplicating content? Seems only fair.
I just checked, none of mine are copied. Otherwise I would file a complaint. I emailed a few hubbers when I found that their work has been stolen, I hope they file complaints. Let's hope these sites get shut down and the thieves lose their adsense accounts!!
I didn't have time to give the most likely sites for my stuff more than a quick glance, but it appears that I dodged the bullet this time. I know how much it sucks though, cuz a couple of weeks ago I had half a dozen Hubs get sniped by some wanna-be who built up an entire "music news" blog using stolen material. Release the hounds!!
FatFreddysCat, I typed your name in the site Entertainmentwithmedia after I typed my name and found that ten of my Hubs appeared. It looks like ten of your Hubs were copied by this site as well. It looks like the fun begins again.
I found one website using my online name stating, "@copyrights 2011-2012 All Rights Reserved by Thelma Alberts After Massage Care..." My name was added to the title of the article. What shall I do about it?
Namedrop yourself every chance you get, place links to your own related hubs all throughout ("As I mentioned in this hub..." or "I mentioned in this hub..."), and that'll be definitive proof that the stolen content is, indeed, stolen.
I just filed a bunch of removals with Google but its going to take forever to get everything. This dirtball is stealing links and all, so if you're using textual links to interlink your Hubs you now have a bunch of garbage links pointed back at your subdomain. Nice.
I've recently decided that I curse too much, and I've been trying to dial it back, but this is guy is making it tough today.
As a programmer, what is being asked is virtually impossible. The three main languages used to design websites and optimize content on websites, like HP, include solutions for scraping. Scraping is nearly identical to what Goo, Bing, Yahoo, Facebook Open Graph, etc do to display a rich snippet. Apart from a few minor differences, there is really no way to stop scraping.
However, it is not entirely impossible. The well known Panda and Penguin solutions were designed and implemented to stop content farmers who temporarily -or long-term- scrape and display portions of pages on multiple sites, eating up page positions in SERP and black-hat links, usually with ads attached to them. But humans are smarter than algorithms.
The solution which exists, but is not widely accepted/recommended, because it would literally drop thousands upon thousands of pages from the system, since the traditional meta-tag system has been depreciated, is PDF, which blocks "reading" of the content itself. With it, also blocks several other enhancements.
If this fellow is a pro scraper, he will undoubtedly switch domains every 60-90 days to avoid the spider crawl and is possibly using a Twitter like API to pull content.
Interim, it is best to authenticate content in-page, and report black-hat and scraping domains to the search engines.
Great...it's a hassle to search for your own stolen content and then have to file the complaints. I do not understand why google search engines can't detect these and disable the one that was posted - obviously second? I'm sure there's a reason but I sure cant figure it out.
i know how you feel, I just spent the last few hours looking for articles, and I didn't get any of my work done, oops! BTW Thanks for bringing this to everyone's attention. If I hadn't opened this forum thread today I would have never known about this at all!
Yeats & Keating! All the domains are registered to one person, in China. Most of the hosting looks like Korean servers. And they are mostly WordPress builds. If you want, report his/her ca-pub number to Google, it will definitely stop them, since all these sites are pointing to that AdSense Account. If you want the number, I can show you how to view it.
4.10 Edit: You can also report the PAGES and DOMAIN (both) to Goo, using this link
4.48 Edit: Writers must realize HP is not truly responsible for the protection of content, generally speaking. When sites get "too big" worms (spammers, scrapers, etc) pop up all over, like dandelions on a lawn, because they can go undetected by the bigger fish. Generally speaking, they are right. 90% of top tier sites do not have a spam watchdog team, hence rely on the search engine to do the "dirty" work. And search engines monitor nearly 250 million domains @ 10-1 million pages per domain. Quite a conundrum. To that end, explains why that winery created Camus Conundrum (delicious by the way). lol.
Hasn't "Whois" been a problem in the past? I am not tech savvy but if he is doing this over and over again, I would think his IP Address would be red flagged. I found more on healthforpeoples.com/. It appears to be the same dates and plain style as the others.
Judi, please keep us posted as to any effect your work is having. Looks like lots of folks, including me, will be filing; if G isn't going to do anything we need to find a different avenue. Adsense, maybe. Something...
For sure they will. But they have probably have had a lot of $s for the three months the sites have been active.
I am wondering why I haven't found the copied articles earlier than this. JudiBee and I found them independently of each other but roughly at the same time. I regularly check random hubs and these were mostly copied on 25th January, so I should have found them before now. Is it something to do with indexing or something?
this surprises me as well, I check often for copied hubs every few months and didn't find the copied work on my own. Only once I heard about the site, and searched for my username did I find out. If it hadn't been for this forum, I wouldn't have known about it at all!
the way I look for copied content is i take an excerpt of text, copy it, and then put it into quotations in the google search box to see what pops up. If more than your hub article link show up, you have a problem. This usually works for me.... It gets tough to do that with every article when you have more than a few. Even harder if you have to keep track of 1oo + articles!
Yep, that's what we do... that's why I'm flummoxed as to why I only found them now.
Interestingly, not one of my hubs published in 2013 has been snaffled. I have noticed that he is still 'republishing' as of today's date on a couple of his sites. Conclusion: he is targeting specific sub-domains on different days?
I also checked for lenses but nothing there. I think he's done Squidoo already.
the hubs that this person stole from me, are all over the place: some are fairly recent and some are older. I think you are right about targeting specific sub domains on different days. It looks like a lot of stuff from each website were taken on the same day...
We can certainly speed up the process by reporting them directly to Adsense. I did just that right after I filed my DMCA complaints with Google, letting them know I had just made the report in case they needed to cross-reference. You'd think that they would go hand in hand but I think since Google is so huge it's best to just do both, in case they don't happen to check!
Thanks, Barbara. This site looks professional. They have stolen 4 of my hubs, changed some words in the title, and changed some of the words in the text. They removed my copyright mark that I put at the top and bottom of each hub. It seems they are not English speaking because of the way they replace the words. They have taken more "pains" to disguise, rather than simply c/p'ing.
I've filed a complaint with Docstoc before - from memory, there's a "copyright" issues link at the bottom of their page - I filled in their form and they removed my hub. One of their contributors was copying a lot of hubs at that time. It's not so much the site as some dodgy contributors (I think).
I've just found two more hubs, so my total is up to 50 on this account.
I see from his home/gardening (everythingathomes .com) site that there are 1,450 + pages of hubs on the site. More hubs are starting to appear on the sites with today's date. Please keep looking (searching your name helps, but it hasn't caught all of mine - I had to search sentences from the hub) and filing DMCAs.
They like you. They really really like you! Sorry you have to go through all this, but I am so sure that you found these sites. What makes it so frustrating is that they have been around since January.
Thanks Judi, I found a whole bunch more this evening. This is another advantage of using your own photos and attributing them with your name. I was able to search for my name and came up with so many articles. For sure, these sites will be taken down once Google takes a look. Looks like all of my finance hubs were stolen and the ones about The Sims. I am sure there will be more, but it is tiring to look for them.
All three of my travel hubs were stolen. I have to work all day so I won't have time to file DMCAs until the weekend at the earliest. I venture to guess pretty much everyone on hubpages has had hubs stolen because these sites are nothing but stolen hubs. It bothers me that huge amounts of hubpages.com has been copied elsewhere but admin says they can't do anything in these situations. I know they can't file DMCAs but surely they can help bring the theft to google's attention?
I have at least 7 on that one, to go along with the 12 I found on the Entertainment and Media site. The one thing that concerns me a bit is one of the hubs that was scraped I no longer have published here. I deleted it and I ended up selling it on Constant Content. Before I listed it on Constant Content I Copyscaped it to make sure that it wasn't showing up in Google and there was no results. That being said if my copied hubs are not showing up in search results maybe that is at least a good thing.
Thanks for this thread Judith, though it's so disheartening. So sorry we're all being ripped off
I'll try to find some time to check on my hubs now that I've learned at least 4 have been copied, and will report the crimes in time, but sense this one- by -one reporting isn't going to stop this full-on stealing.
I just found a bunch of my hub articles copied - my most popular one as well. No wonder my traffic and earnings have dropped!! This is BS. Guess I'll be spending the weekend filing these DCMA complaints and setting up Google Alerts. Fun times. Losing faith in HP...
Well, after having at least one hub copied on Squidoo, reporting it and having NO response from them, I am not hugely confident in them, either. Google is taking it down, but the 'lens' is still there. And the person has an account here (which I feel I documented quite well), but HP did not feel there were enough facts to remove the person.
This is so frustrating!!! I don't understand why HubPages won't jump in and help. This thing is huge and we could really use their support!!! It seems they are purposefully staying out of all threads on this topic.
Yep, it would be a great help if and when HubPages jump in and help us. I think I have to stop writing new hubs until something good happens here. I don´t see why I waste my time and effort in creating new hubs just to be stolen by those thieves. I´ll file DMCA, read and comments here but no more hubs to be publish online. Sorry to say this, but my ulcer is coming back due to this copyright infringement.
I just filed DMCAs on a bunch of hubs - this is pretty much ALL I have been able to do for about three months, aside from my real life and knocking out a few hubs to keep current here.
Judi - thanks for posting the list of site names - you've done us all a tremendous favor.
FYI - at least one site name isn't popping up (it's like the search engine stalls or something). I am hopeful maybe that's a sign Google is working on it.
Does anyone know if we should be looking at Bing, Yahoo, etc.? I did not know life on a user-generated content site would require more hours than I have to give of tracking down stolen content. I appreciate that I am learning as part of the process. But this is tiring.
I just spent some time reporting 9 of my hubs stolen. It's very odd, because my hubs, especially the ones they took, are all research type hubs. They're not for the faint of heart and you really got to love the topic to sit through and read them.
After some consideration, it appears they took the hubs that has multiple links to other hubs. For example, they took an in depth look comparing violence in the movies The Bridge and Grizzly Man. They then took short film reviews for each movie that I did separately. They did the same for an overview of Russian cinema, then took the film review for Mother.
The most interesting part, is that though they are taking my hubs that are extremely specific, they haven't really touched the ones that get the most hits.
When I used to review adcopy for Bing, I used to see spam sights like this all the time. Very lazily done. Little consideration about content, if any at all. Hopefully Google will be more effective about removing/baning sights like these than their counterpart.
On the plus side (and it's a very big plus me thinks) - I've done a google search for the titles of each of my stolen hubs - mine appears on the first page - the 'stolen' content doesn't - so it's not having any impact on my traffic as it's not visible....
Judi Bee - a HUGE thank you for compiling these sites. You made it a lot easier to search, find the links and file a DMCA complaint. Now that I've spent the last 4 and a half hours doing this, I'm still not sure if I found all my hubs. :'(
what makes this even more frustrating is I just checked out the site financeforpeoples, and a slew of new stolen hubs posted today have just popped up. One of them is even about the 30 day 30 hub challenge!
The articles are so weird...I just looked at financingforpeoples and it didn't even make any sense to me. This is obviously an offshore hack whose English leaves a lot to be desired. It's too bad Google doesn't offer more ways for authors to protect their work as a DMCA only works if the hacker is in the US.
When I first went searching I saw that my hubs were copied in full but when I looked at them just now it appears that only parts of my hubs are showing up. However, any parts of my hubs are still mine so I have filed a complaint with Adsense to go along with the DMCA complaints I filed yesterday.
i know what you mean about the articles being weird, like everything is copied including polls and quizzes with all the questions and results. It's such a hack job and It's obvious it didn't take the thief long time to copy each article.
I didn't find any of mine yet but I am sure i found one that was written by another writer on Hubpages because I remember reading it, the writer was teaches12345. The hub they took is "The Gift of a Child's Drawing" and it's on educationforscience.com.
There have been several theories about what content Google is penalising and rewarding in the search results but at the moment it does seem a bit random (from my end). Let's compare notes and hopefully we can see some...
In his hub 'What We Don't Know About Google Panda?', Paul Edmondson points out that Google seems to expect sites to leap high above any bar that might reasonably be set for quality purposes.Paul seems less than happy...
Hello all my fellow hubbers, hope you all are fine and enjoying your work!!This is my first forum topic and i hope i am able to explain myself well here- as to what i intend to ask you all.My question is: I read...
My search of Q/A and the forums didn't turn up much that wasn't old or inadequately answered, so thought I'd stir the soup again and see what floats up. The answers to my questions could be applicable to all hubs, but,...
what will you do? do you have the legal rights to sue them for damages or whatsoever?what are your backup plans should this eventuality occurs now?What backup plans does hubpages have to compensate every hubber and to...
Only the best Hubs on HubPages are being moved to niches, so everything on the niche sites is spam free and trash free. There are no pieces written in broken English or written in ways that appear to be spun. Everything...