{"id":410,"date":"2012-04-10T19:48:36","date_gmt":"2012-04-10T19:48:36","guid":{"rendered":"https:\/\/www.buildzoom.com\/blog\/?p=410"},"modified":"2016-08-22T23:06:31","modified_gmt":"2016-08-22T23:06:31","slug":"defining-big-data-part-1-how-data-has-begun-to-breed-like-rabbits","status":"publish","type":"post","link":"https:\/\/www.buildzoom.com\/blog\/defining-big-data-part-1-how-data-has-begun-to-breed-like-rabbits","title":{"rendered":"Defining Big Data &#8211; How Data has Begun to Breed Like Rabbits"},"content":{"rendered":"<p><em>This is Part 1 of a 5-part series devoted to exploring the concept of Big Data to determine what makes it different from other <\/em><em>hyped data &#8220;revolutions&#8221; \u00a0of the past.\u00a0<\/em><\/p>\n<p>About three months ago I posted a simple question to my Facebook wall, asking whether my next computer should be an Apple or a PC.<\/p>\n<p>Over the next five hours I received over thirty opinions from students, senior executives, computer programmers, analysts and a random assortment of other friends and family. \u00a0The feedback helped me make an important consumer decision.<\/p>\n<p>From the perspective of your typical Facebook user, I had posted a question and received feedback from my network &#8211; a seemingly innocuous act.<\/p>\n<p>Let&#8217;s consider what happened from a technological perspective:<\/p>\n<p>My first action (posing the initial question) would have inserted a row in a table (let&#8217;s call it StatusUpdate), which would have contained attributes including a post date and post content. \u00a0This initial action would triggered several subsequent processes (I&#8217;ve taken some creative liberties here):<\/p>\n<ol>\n<li>A task would have run to make certain inferences based on the content. \u00a0These inferences would update a table where Facebook compiles an index of my interests (ImplicitUserInterests).<\/li>\n<li>Facebook would use these inferences to determine which contacts (UserContacts) within my network should receive the status update.<\/li>\n<li>These contacts would have their walls (WallContent) updated with my status update.<\/li>\n<li>As each contact responded, an additional table indexing their response would be updated (StatusUpdateResponse).<\/li>\n<li>The content of the response would allow Facebook to make inferences about their interests (ImplicitUserInterests).<\/li>\n<li>Their response would be shared on their wall (WallContent).<\/li>\n<li>Facebook would have been able to make an inference about the relationship between the respondent and myself (UserRelationship).<\/li>\n<li>A bunch of other stuff that I can&#8217;t even imagine would have also happened.<\/li>\n<\/ol>\n<p>We can quickly see how one simple act sets a sequence of events into motion, culminating in the creation of a large set of data.<\/p>\n<p>Now consider the following:<\/p>\n<ul>\n<li>There are over 500 million users on Facebook.<\/li>\n<li>Over 700 billion minutes are spent on Facebook each month.<\/li>\n<li>There are a host of other sites creating an amount of data that is in the same order of magnitude as Facebook.<\/li>\n<\/ul>\n<p>What does it all add up to?<\/p>\n<div>\n<p>Here&#8217;s a hint: It&#8217;s not small data.<\/p>\n<p>&#8220;Welcome to the Age of Big Data,&#8221; invites Steve Lohr in his recent <a href=\"http:\/\/www.nytimes.com\/2012\/02\/12\/sunday-review\/big-datas-impact-in-the-world.html\">NY Times piece on Big Data<\/a>.<\/p>\n<p>While acknowledging that Big Data has the scent of a &#8220;meme and a marketing term,&#8221; Lohr points out that according to IDC, a technology research firm, data is growing at a rate of 50 percent per year.<\/p>\n<p>Regardless of whether the actual you believe in the veracity of IDC\u2019s estimated rate-of-increase, \u00a0it&#8217;s difficult to dispel the notion that every year, there seems to be significantly more information at your fingertips than the previous year.<\/p>\n<p>It&#8217;s this broad acknowledgment that has allowed the concept of &#8220;&#8221;Big Data&#8221; to catch fire.<\/p>\n<p>Like most memes, Big Data comes with a slight feeling of d\u00e9j\u00e0 vu &#8211; a faint voice in the background tells us that we have seen this before. \u00a0\u00a0Those of us who have been working in information technology for the past decade (or longer) recall things like the &#8220;Invisible Web&#8221; and the &#8220;Deep Web.&#8221;<\/p>\n<p>Yet there seems to be something categorically different about this latest information &#8220;revolution,&#8221; that makes us stand up and pay attention.<\/p>\n<p>Over the next several weeks, we&#8217;ll be endeavoring to deconstruct what is different about &#8220;Big Data&#8221; in the hopes that understanding these differences will help us better understand what Big Data actually is and whether\u00a0we are talking about more hype or perhaps something real.<\/p>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>This is Part 1 of a 5-part series devoted to exploring the concept of Big Data to determine what makes it different from other hyped data &#8220;revolutions&#8221; \u00a0of the past.\u00a0 About three months ago I posted a simple question to my Facebook wall, asking whether my next computer should be an Apple or a PC. [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":431,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_genesis_hide_title":false,"_genesis_hide_breadcrumbs":false,"_genesis_hide_singular_image":false,"_genesis_hide_footer_widgets":false,"_genesis_custom_body_class":"","_genesis_custom_post_class":"","_genesis_layout":"","footnotes":""},"categories":[42],"tags":[],"class_list":{"0":"post-410","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-gn-opinion","8":"entry"},"_links":{"self":[{"href":"https:\/\/www.buildzoom.com\/blog\/wp-json\/wp\/v2\/posts\/410","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.buildzoom.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.buildzoom.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.buildzoom.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.buildzoom.com\/blog\/wp-json\/wp\/v2\/comments?post=410"}],"version-history":[{"count":0,"href":"https:\/\/www.buildzoom.com\/blog\/wp-json\/wp\/v2\/posts\/410\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.buildzoom.com\/blog\/wp-json\/wp\/v2\/media\/431"}],"wp:attachment":[{"href":"https:\/\/www.buildzoom.com\/blog\/wp-json\/wp\/v2\/media?parent=410"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.buildzoom.com\/blog\/wp-json\/wp\/v2\/categories?post=410"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.buildzoom.com\/blog\/wp-json\/wp\/v2\/tags?post=410"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}