• About BuildZoom
  • BuildZoom Data
  • Find a Contractor
  • Permit Map

BuildZoom News & Analysis

Articles about the housing market in the USA and Homeowner Guides

  • Home
  • Building News
  • Analysis
  • Economic Indicators
  • Rankings
  • Homeowner Guides
  • Other Resources

Defining Big Data – How Data has Begun to Breed Like Rabbits

April 10, 2012 by Jiyan 1 Comment

This is Part 1 of a 5-part series devoted to exploring the concept of Big Data to determine what makes it different from other hyped data “revolutions”  of the past. 

About three months ago I posted a simple question to my Facebook wall, asking whether my next computer should be an Apple or a PC.

Over the next five hours I received over thirty opinions from students, senior executives, computer programmers, analysts and a random assortment of other friends and family.  The feedback helped me make an important consumer decision.

From the perspective of your typical Facebook user, I had posted a question and received feedback from my network – a seemingly innocuous act.

Let’s consider what happened from a technological perspective:

My first action (posing the initial question) would have inserted a row in a table (let’s call it StatusUpdate), which would have contained attributes including a post date and post content.  This initial action would triggered several subsequent processes (I’ve taken some creative liberties here):

  1. A task would have run to make certain inferences based on the content.  These inferences would update a table where Facebook compiles an index of my interests (ImplicitUserInterests).
  2. Facebook would use these inferences to determine which contacts (UserContacts) within my network should receive the status update.
  3. These contacts would have their walls (WallContent) updated with my status update.
  4. As each contact responded, an additional table indexing their response would be updated (StatusUpdateResponse).
  5. The content of the response would allow Facebook to make inferences about their interests (ImplicitUserInterests).
  6. Their response would be shared on their wall (WallContent).
  7. Facebook would have been able to make an inference about the relationship between the respondent and myself (UserRelationship).
  8. A bunch of other stuff that I can’t even imagine would have also happened.

We can quickly see how one simple act sets a sequence of events into motion, culminating in the creation of a large set of data.

Now consider the following:

  • There are over 500 million users on Facebook.
  • Over 700 billion minutes are spent on Facebook each month.
  • There are a host of other sites creating an amount of data that is in the same order of magnitude as Facebook.

What does it all add up to?

Here’s a hint: It’s not small data.

“Welcome to the Age of Big Data,” invites Steve Lohr in his recent NY Times piece on Big Data.

While acknowledging that Big Data has the scent of a “meme and a marketing term,” Lohr points out that according to IDC, a technology research firm, data is growing at a rate of 50 percent per year.

Regardless of whether the actual you believe in the veracity of IDC’s estimated rate-of-increase,  it’s difficult to dispel the notion that every year, there seems to be significantly more information at your fingertips than the previous year.

It’s this broad acknowledgment that has allowed the concept of “”Big Data” to catch fire.

Like most memes, Big Data comes with a slight feeling of déjà vu – a faint voice in the background tells us that we have seen this before.   Those of us who have been working in information technology for the past decade (or longer) recall things like the “Invisible Web” and the “Deep Web.”

Yet there seems to be something categorically different about this latest information “revolution,” that makes us stand up and pay attention.

Over the next several weeks, we’ll be endeavoring to deconstruct what is different about “Big Data” in the hopes that understanding these differences will help us better understand what Big Data actually is and whether we are talking about more hype or perhaps something real.

Filed Under: Opinion

About Jiyan

I'm one of the BuildZoom founders. On the blog, I'm primarily interested in writing about emerging trends and technologies in the construction industry; and online marketplaces. I did my studies at Georgetown University and the London School of Economics.

If you'd like to connect, my Twitter handle is @jiyannwei or you can e-mail me at [email protected].

Trackbacks

  1. Big Data & Social Media: Do Two Memes Make a Right? | BuildZoom Blog says:
    April 13, 2012 at 7:16 pm

    […] In Part 1 of our series exploring the concept of “Big Data,” we explored how a seemingly innocuous act (posting to your Facebook wall) can have significant ramifications.  The significance of this causal relationship is two-fold: First, it shows how an online action can trigger a multitude of  data-producing results as well as induce subsequent actions from interconnected users, which in turn produce more data-producing results.   Second, it show (on a micro level) the broader impact of social media when it comes to data production. […]

    Reply

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Subscribe to our mailing list

* indicates required

Unlock the power of BuildZoom Data.

Data on every licensed contractor in the US and 350M+ building permits.

Apply for Data Access >

About BuildZoom:

BuildZoom has analyzed nearly 6 million contractors and over 350 million building permits to help you find the perfect general contractor for any type of remodeling project. Hire a great contractor through our bidding system.

About BuildZoom

  • About BuildZoom
  • BuildZoom Data
  • Privacy Policy
  • Terms of Service
  • Careers
  • Contact
  • Find a Contractor

  • San Francisco Contractors
  • New York Contractors
  • Los Angeles Contractors
  • Chicago Contractors
  • Boston Contractors
  • Seattle Contractors
  • All Locations
  • BuildZoom

    © 2023. All Rights Reserved.
    301 Howard St.
    San Francisco, CA 94105

    Copyright © 2026 · Magazine Pro Theme on Genesis Framework · WordPress · Log in

    Let Us Know Your Contact Details and We'll Add You to Our Subscription
    Please enable JavaScript in your browser to complete this form.
    Name *
    Loading