ArtsAutosBooksBusinessEducationEntertainmentFamilyFashionFoodGamesGenderHealthHolidaysHomeHubPagesPersonal FinancePetsPoliticsReligionSportsTechnologyTravel

Data-Mining: Social Media, User Privacy and Big Brother

Updated on January 3, 2013
There's a lot of data on the Internet. So, who's making money off of it and what does it mean for you?
There's a lot of data on the Internet. So, who's making money off of it and what does it mean for you? | Source


It’s 11:45 a.m. and you’re hungry, so you grab your laptop and head over to the local Perkins restaurant. As you wait to be seated, you notice a sign on the wall that says “’Like’ Perkins on Facebook to have a chance to win a cruise to the Caribbean!” The idea of a relaxing vacation excites you, so you log onto Facebook and “like” the Perkins Restaurant and Bakery page, even before taking your first sip of freshly-poured coffee. On the right-hand side of the page, you come across a few friends who have also “liked” the Perkins page. You decide to go on a virtual adventure to discover what else your friends are fans of. Before you know it, your food has arrived, and you’ve “liked” 20 additional Facebook pages.

Whether you know it or not, those seemingly unimportant “likes” are extremely valuable, maybe not to you, but, to the businesses that want to sell you things, they’re priceless. Well, almost priceless. These businesses will purchase your personal information from “data-mining” companies. The more information they know about you, the easier it is for them to decide what to sell you, or what not to sell you.

A data-mining company is “any organization that gathers personal information from the Internet with the intent to sell it for a profit.” For example, Google offers a data-mining service known as “DoubleClick.” When you visit a website, such as, data-tracking services install a code into your cookies. That code basically records everything you do on a website. If you were to click on the “Health” hyperlink on the TIME website, the data-tracker would record that and subsequently have something to sell to their clients: the information that you are interested in "health" topics. Why would a business want to buy that information? Because then it could tailor its ads specifically to your interests (assuming, of course, that you didn’t accidentally click the link).

Mark Zuckerberg - Facebook Creator
Mark Zuckerberg - Facebook Creator | Source


According to Facebook, there are more than 500 million active users, 50 percent of which log on in any given day. Facebook was designed for people to share information, regardless of how pointless the information may be. It’s no stretch of the imagination, then, that Facebook is an excellent place to mine for data. Facebook’s privacy policy states that they don’t sell any of your personal information to third parties. They don’t have to. Facebook users “like” and “share” things with their friends all the time, which happens to be a fairly good indicator of one's interests.

According to Joel Stein’s article entitled “Your Data, Yourself,” 23.1 percent of all online ads not on search engines, video or e-mail run on Facebook (Stein, 2011). That is an incredibly large number of ads. Not only do advertisers have the ability to specify a demographic of people that they’d like to target their ads toward, but Facebook has also developed a system that shows a user products their friends have “liked” or purchased.

Zuckerberg explains this concept more thoroughly:

"So this isn't an ad that's going to go to a lot of people. Basically it- when you put that information in your profile that you bought a scarf and that you like that scarf, that’s something that your friends might find interesting, right? So what we’d do is we might show that information to your friends a little bit more proactively as an ad.”

This method of marketing doesn’t seem to be invasive. After all, the user is the one sharing the information and their friend is the one “endorsing” it. However, Facebook didn’t always use such an innocent marketing scheme. During an interview with 60 Minutes in January of 2008, Zuckerberg admitted to using a program known as “Beacon.” The program tracked what a Facebook user was buying on other websites and then notified the user’s friends via Facebook. One unfortunate Facebook user bought a wedding ring online for his girlfriend, but the surprise was, uh, offset when Facebook informed all of his friends of the purchase (Stahl, 2008). Once users figured out what Beacon was doing, there was an immediate resistance to the software. Beacon was shutdown in September of 2009 (Facebook Beacon, 2011).


The Government: Big Brother is Watching You

While fiction books like George Orwell’s 1984 seem to be too farfetched for any basis in reality, the Internet has begun to challenge that notion. In fact, if you think private businesses are the only ones interested in your personal information, you are wildly mistaken. Unlike those private businesses, however, the government uses the information in a more deterministic way: it looks for trends. The more information it can obtain, the more accurately it can identify patterns.

According to a 2004 Data Mining report from the United States General Accounting Office, the United States’ top six reasons for data-mining are to (1) improve service and performance; (2) detect fraud, waste and abuse; (3) analyze scientific and research information; (4) manage human resources; (5) detect criminal activities and patterns; and (6) analyze intelligence and detect terrorist activities (Nelligan, 2004).

The Department of Defense, however, is primarily interested in using this large amount of data to detect terrorist activities, most notably since 9/11. The program is certainly for a worthy cause, but some people believe it goes too far. Mark Clayton of Christian Science Monitor explains:

"The system - parts of which are operational, parts of which are still under development - is already credited with helping to foil some plots. It is the federal government's latest attempt to use broad data-collection and powerful analysis in the fight against terrorism. But, by delving deeply into the digital minutiae of American life, the program is also raising concerns that the government is intruding too deeply into citizens' privacy (Clayton, 2006)."

Dr. Kielman, another member of the Christian Science Monitor, had this to say:

“Consider Starlight, which along with other “visualization” software tools can give human analysts a graphical view of data. Viewing data in this way could reveal patterns not obvious in text or number form. Understanding the relationships among people, organizations, places, and things – using social-behavior analysis and other techniques – is essential to going beyond mere data mining to comprehensive “knowledge discovery in databases” (Hampton, 2006).”


Privacy is a difficult concept to define, especially in the online world. Most of the information that data-mining companies are after is the information that you’re sharing with the world anyway. That’s why people create online personas: they want people to know who they are. Is it that big of a surprise, then, that companies are interested in knowing what you like? If anything, they’re doing you a favor by providing advertisements of thing you may actually be interested in.

If you are concerned about people judging your online profile, then change your privacy settings to reflect your desired visibility. If you are concerned about computers collecting your personal information, then you’re either paranoid or you simply don’t understand that these computers aren’t out to judge you, they simply want to suggest a product to you.

According to Facebook, users spend more than 700 billion minutes per month on the website. That is a lot of server requests for a website that has never crashed, and that kind of service needs to be paid for. In order to finance a website of that magnitude without charging its users, Facebook sells ad space. They don’t sell your personal information, as many people have claimed. In fact, if your privacy settings are set to ‘private,’ third parties can only access your basic information, which won't do them much good.

The Internet is information; the data-mining trend will persist – and grow.


Clayton, M. (2006, February 9). Us plans massive data sweep. The Christian Science Monitor, Retrieved from

Facebook Beacon. (2011, March 8). In Wikipedia, The Free Encyclopedia. Retrieved 03:05, April 15, 2011, from

Hampton, M. (2006, February 06). Advise: now everyone can be a terrorist, or a crime victim. Retrieved from

Nelligan, J. (2004, May). Gao data mining report. Retrieved from

Stahl, L. (2008, January 13). The face behind facebook. Retrieved from

Stein, J. (2011, March 21). Your data, yourself. TIME, 40-46.


    0 of 8192 characters used
    Post Comment

    No comments yet.


    This website uses cookies

    As a user in the EEA, your approval is needed on a few things. To provide a better website experience, uses cookies (and other similar technologies) and may collect, process, and share personal data. Please choose which areas of our service you consent to our doing so.

    For more information on managing or withdrawing consents and how we handle data, visit our Privacy Policy at:

    Show Details
    HubPages Device IDThis is used to identify particular browsers or devices when the access the service, and is used for security reasons.
    LoginThis is necessary to sign in to the HubPages Service.
    Google RecaptchaThis is used to prevent bots and spam. (Privacy Policy)
    AkismetThis is used to detect comment spam. (Privacy Policy)
    HubPages Google AnalyticsThis is used to provide data on traffic to our website, all personally identifyable data is anonymized. (Privacy Policy)
    HubPages Traffic PixelThis is used to collect data on traffic to articles and other pages on our site. Unless you are signed in to a HubPages account, all personally identifiable information is anonymized.
    Amazon Web ServicesThis is a cloud services platform that we used to host our service. (Privacy Policy)
    CloudflareThis is a cloud CDN service that we use to efficiently deliver files required for our service to operate such as javascript, cascading style sheets, images, and videos. (Privacy Policy)
    Google Hosted LibrariesJavascript software libraries such as jQuery are loaded at endpoints on the or domains, for performance and efficiency reasons. (Privacy Policy)
    Google Custom SearchThis is feature allows you to search the site. (Privacy Policy)
    Google MapsSome articles have Google Maps embedded in them. (Privacy Policy)
    Google ChartsThis is used to display charts and graphs on articles and the author center. (Privacy Policy)
    Google AdSense Host APIThis service allows you to sign up for or associate a Google AdSense account with HubPages, so that you can earn money from ads on your articles. No data is shared unless you engage with this feature. (Privacy Policy)
    Google YouTubeSome articles have YouTube videos embedded in them. (Privacy Policy)
    VimeoSome articles have Vimeo videos embedded in them. (Privacy Policy)
    PaypalThis is used for a registered author who enrolls in the HubPages Earnings program and requests to be paid via PayPal. No data is shared with Paypal unless you engage with this feature. (Privacy Policy)
    Facebook LoginYou can use this to streamline signing up for, or signing in to your Hubpages account. No data is shared with Facebook unless you engage with this feature. (Privacy Policy)
    MavenThis supports the Maven widget and search functionality. (Privacy Policy)
    Google AdSenseThis is an ad network. (Privacy Policy)
    Google DoubleClickGoogle provides ad serving technology and runs an ad network. (Privacy Policy)
    Index ExchangeThis is an ad network. (Privacy Policy)
    SovrnThis is an ad network. (Privacy Policy)
    Facebook AdsThis is an ad network. (Privacy Policy)
    Amazon Unified Ad MarketplaceThis is an ad network. (Privacy Policy)
    AppNexusThis is an ad network. (Privacy Policy)
    OpenxThis is an ad network. (Privacy Policy)
    Rubicon ProjectThis is an ad network. (Privacy Policy)
    TripleLiftThis is an ad network. (Privacy Policy)
    Say MediaWe partner with Say Media to deliver ad campaigns on our sites. (Privacy Policy)
    Remarketing PixelsWe may use remarketing pixels from advertising networks such as Google AdWords, Bing Ads, and Facebook in order to advertise the HubPages Service to people that have visited our sites.
    Conversion Tracking PixelsWe may use conversion tracking pixels from advertising networks such as Google AdWords, Bing Ads, and Facebook in order to identify when an advertisement has successfully resulted in the desired action, such as signing up for the HubPages Service or publishing an article on the HubPages Service.
    Author Google AnalyticsThis is used to provide traffic data and reports to the authors of articles on the HubPages Service. (Privacy Policy)
    ComscoreComScore is a media measurement and analytics company providing marketing data and analytics to enterprises, media and advertising agencies, and publishers. Non-consent will result in ComScore only processing obfuscated personal data. (Privacy Policy)
    Amazon Tracking PixelSome articles display amazon products as part of the Amazon Affiliate program, this pixel provides traffic statistics for those products (Privacy Policy)
    ClickscoThis is a data management platform studying reader behavior (Privacy Policy)