Tom H. C. Anderson - Next Gen Market Research™al payday express top up loan sitemap
can i be arrested for default on payday l sitemap
approved cash advance west constance road suffolk va sitemap
00 payday loans up to 3000 sitemap
ameriloanv online cash advance payday loans no fax sitemap
direct loan goodwill adjustment sitemap
as cash advance loans tuc sitemap
cash advance credit card hsbc sitemap
cashmax payday loan sitemap
cash advances in memphis t sitemap
cash advance interest charge calculator sitemap
consolidate payday loans o sitemap
christmas loans pay sitemap
cash loans youngstown oh sitemap
an super payday loan lender sitemap
direct loan companies not brokers sitemap
anz apply for a loan online sitemap
approved cash advance on 28th street sitemap
can i get a payday loan with child support sitemap
ace cash advance aurora co sitemap
6 month payday loan no credit ch sitemap
al what states are internet payday loans illegal sitemap
an baltimore payday loans sitemap
castle payday loans bbb sitemap
can i apply for more than one payday loan at a time sitemap
cash loans decatur alabama sitemap
cash advance ocbc sitemap
an abbey national loans sitemap
bottom dollar payday loans address sitemap
cash advance in dayton sitemap
cheapest online loans sitemap
can a cash advance be paid in payment sitemap
are online payday loans legal in minnesota sitemap
can u have two payday loan sitemap
cash advance credit card aeon sitemap
best online unsecured personal loans sitemap
an what documents do you need to get a payday loan sitemap
brand new payday loan sitemap
action payday loan addr sitemap
an northway financial payday loan phone num sitemap
direct leanders pay day loan sitemap
cash back loa sitemap
advice on paying back payday lo sitemap
aaa payday loans port charlotte sitemap
cash converters loans epping sitemap
canada faxless payday loans sitemap
cash advance etown ky sitemap
cash loan for motorcycle sitemap
can you go to jail for payday loans in missouri sitemap
an cash 2u payday loans virgi sitemap
cash advance in danville ky sitemap
cash advance in norfolk virginia sitemap
1 hr payday loans no credit check sitemap
cash loans online up to 10000 sitemap
best trusted payday lo sitemap
are payday loans safe sitemap
ck payday loans cant pay it b sitemap
buckeye title loans glendale az sitemap
az no payday loans in arizona sitemap
cash converters loans gold coast sitemap
cash advance nile sitemap
cash advance gulfport ms sitemap
cash checking plac sitemap
cash loans like plain green sitemap
cash loans in san jose ca sitemap
direct express debit card payday loans sitemap
al payday loans illegal in sitemap
cfsa members payday lo sitemap
cash loan in orlando fl sitemap
credit union payday loans liverp sitemap
check into cash stockton ca sitemap
direct loans address georgia sitemap
cash advance loans for business sitemap
can you get a cash loan without a bank account sitemap
best online payday loan softw sitemap
as payday loans in lewisville tx sitemap
amscot payday loan online sitemap
are payday loans open on sunday sitemap
best direct payday lo sitemap
cash on delivery shopping site sitemap
cash advance united group sitemap
cash advance in bellevue ne sitemap
advance payday loans lawton ok sitemap
cash advance mcminnville tn sitemap
car title loans el cajon sitemap
debt loans sitemap
cash advance bois sitemap
ck get a fast easy payday loan within 24 sitemap
100 day payday loans online sitemap
american cash payday loan revi sitemap
cash loan processing ll sitemap
as how many payday loans can you have in tx sitemap
dc payday loan washington sitemap
ace cash advance lawton ok sitemap
cash advance north little rock ar sitemap
cash loans kalgoorlie sitemap
car title loans nh sitemap
ace cash express lawrenceville ga sitemap
cash advance poway ca sitemap
an what do i need to get a payday loan from pls sitemap
ce cash advance florence sitemap
car title loans victoria bc sitemap
cash advance louisa ky sitemap
cash loans in new york sitemap
cash loan places in lorain ohio sitemap
cash loans in delaware sitemap
ay payday loans in account today sitemap
car title loans az online sitemap
ay approval payday lo sitemap
advance cash florence sitemap
ca payday loan victorville sitemap
cash earning extr sitemap
cash advance ocal sitemap
cash advance application format sitemap
credit union payday alternative lo sitemap
cash advance rogersville t sitemap
ace cash express locations dallas sitemap
cash loans in bowling green ky sitemap
da fastest payday loan can sitemap
cash advance for students with student loans sitemap
direct loan request sitemap
ace cash express fort mill south carolina sitemap
cash loan lender no sitemap
defaulting on a payday loan in missouri sitemap
bc payday loan stores in vancouver sitemap

More Than Market Research - Gain The Information Advantage

Tom H. C. Anderson - Next Gen Market Research™ header image 6

Forget Big Data, Think Mid Data

March 7th, 2013 · 6 Comments

Stop Chasing the Big Data; Mid Data makes more sense
[Re-posted from OdinText.com Blog]

After attending the American Marketing Association’s first conference on Big Data this week, I’m even more convinced of what I already suspected from speaking to hundreds of Fortune 1000 marketers the last couple of years. Extremely few are working with anything approaching what would be called “Big Data” – And I believe they don’t need to – But many should start thinking about how to work with Mid Data!

BigDataMidDataSmallData

“Big Data”, “Big Data”, “Big Data”. It seems like everyone is talking about it, but I find extremely few researchers are actually doing it. Should they be?

If you’re reading this, chances are that you’re a social scientist or business analyst working in consumer insights or related area. I think it’s high time that we narrowed the definition of ‘Big Data’ a bit and introduced a new more meaningful and realistic term “MID DATA” to describe what is really the beginning of Big Data.

If we introduce this new term, it only makes sense that we refer to everything that isn’t Big or Mid data as Small Data (I hope no one gets offended).

Small Data

I’ve included a chart, and for simplicity will think of size here as number of records, or sample if you prefer.

‘Small Data’ can include anything from one individual interview in qualitative research to several thousand survey responses in longitudinal studies. At this level of size, quantitative and qualitative can technically be lumped together, as neither currently fit the generally agreed upon (and admittedly loose) definition of what is currently “Big Data”. You see, rather than a specific size, the current definition of Big Data varies depending on the capabilities of the organization in question. The general rule for what would be considered Big Data would be data which cannot be analyzed by commonly used software tools.

As you can imagine, this definition is an IT/hardware vendor’s dream, as it describes a situation where a firm does not have the resources to analyze (supposedly valuable) data without spending more on infrastructure, usually a lot more.

Mid Data

What then is Mid Data? At the beginning of Big Data, some of the same data sets we might call Small Data can quickly turn into Big Data. For instance, the 30,000-50,000 records from a customer satisfaction survey which can sometimes be analyzed in commonly available analytical software like IBM-SPSS without crashing. However, add text comments to this same data set and performance slows considerably. These same data sets will now often take too long to process or more typically crash.

If these same text comments are also coded as is the case in text mining, the additional variables added to this same dataset may increase significantly in size. This then is currently viewed as Big Data, where more powerful software will be needed. However I believe a more accurate description would be Mid Data, as it is really the beginning of Big Data, and there are many relatively affordable approaches to dealing with this size of data. But more about this in a bit…

Big Data

Now that we’ve taken a chunk out of Big Data and called it Mid Data, let’s redefine Big Data, or at least agree on where Mid Data ends and when ‘Really Big Data’ begins.

To understand the differences between Mid Data and Big Data we need to consider a few dimensions. Gartner analyst Doug Laney famously referred to Big Data as being 3-Dimensional; that is having increasing volume, variety, and velocity (now commonly referred to as the 3V model).

To understand the difference between Mid Data and Big Data though, only two variables need to be considered, namely Cost and Value. Cost (whether in time or dollars) and expected value are of course what make up ROI. This could also be referred to as the practicality of Big Data Analytics.

While we often know that some data is inherently more valuable than other data (100 customer complaints emailed to your office should be more relevant than a 1000 random tweets about your category), one thing is certain. Data that is not analyzed has absolutely no value.

As opposed to Mid Data, to the far right of Big Data or Really Big Data, is really the point beyond which an investment in analysis, due to cost (which includes risk of not finding insights worth more than the dollars invested in the Big Data) does not make sense. Somewhere after Mid Data, big data analytics will be impractical both theoretically, and for your firm in very real economic terms.

Mid Data on the other hand then can be viewed as the Sweet Spot of Big Data analysis. That which may be currently possible, worthwhile and within budget.

So What?

Mid Data is where many of us in market research have a great opportunity. It is where very real and attainable insight gains await.

Really Big Data, on the other hand, may be well past a point of diminishing returns.

On a recent business trip to Germany I had the pleasure of meeting a scientist working on a real Big Data project, the famous Large Hedron Collider project at CERN. Unlike the Large Hadron Collider, consumer goods firms will not fund the software and hardware needed to analyze this level of Big Data. Data magnitudes common at the Collider (output of 150 million sensors delivering data 40 million times per second) are not economically feasible but nor are they needed. In fact, scientists at CERN do not analyze this amount of Big Data. Instead, they filter out 99.999% of collisions focusing on just 100 of the “Collisions of Interest” per second.

The good news for us in business is that if we’re honest, customers really aren’t that difficult to understand. There are now many affordable and excellent Mid Data software available, for both data and text mining, that do not require the exabytes of data or massively parallel software running on thousands of servers. While magazines and conference presenters like to reference Amazon, Google and Facebook, even these somewhat rare examples sound more like IT sales science fiction and do not mention the sampling of data that occurs even at these companies.

As scientists at Cern have already discovered, it’s more important to properly analyze the fraction of the data that is important (“of interest”) than to process all the data.

At this point some of you may be wondering, well if Mid Data is more attractive than Big Data, then isn’t small data even better?

The difference of course is that as data increases in size we can not only be more confident in the results, but we can also find relationships and patterns that would not have surfaced in traditional small data. In marketing research this may mean the difference between discovering a new niche product opportunity or quickly countering a competitor’s move. In Pharma, it may mean discovering a link between a smaller population subgroup and certain high cancer risk, thus saving lives!

Mid Data could benefit from further definition and best practices. Ironically some C-Suite executives are currently asking their IT people to “connect and analyze all our data” (specifically the “varied” data in the 3-D model), and in the process they are attempting to create Really Big (often bigger than necessary) Data sets out of several Mid Data sets. This practice exemplifies the ROI problem I mentioned earlier. Chasing after a Big Data holy grail will not guarantee any significant advantage. Those of us who are skilled in the analysis of Small or Mid Data clearly understand that conducting the same analysis across varied data is typically fruitless.

It makes as much sense to compare apples to cows as accounting data to consumer respondent data. Comparing your customers in Japan to your customers in the US makes no sense for various reasons ranging from cultural differences to differences in very real tactical and operational options.

No, for most of us, Mid Data is where we need to be.

@TomHCAnderson

[Full Disclosure: Tom H. C. Anderson is Managing Partner of Anderson Analytics which develops and sells patent pending data mining and text analytics software platform OdinText]

[Post to Twitter] 

Tags: Anderson Analytics · Big Data · Datamining · Market Research · Marketing research · Mid Data · Odin Text · OdinText · Small Data · Text Analytics · ama · american marketing association · text mining

6 responses so far ↓

  • 1 Kelley Styring // Mar 7, 2013 at 4:21 pm

    On the money, Tom. And small data is going to help mid data believers understand and communicate the findings with business relevance, simplicity, synthesis and insight.

  • 2 David Rabjohns // Mar 7, 2013 at 5:35 pm

    Great article nicely done.

  • 3 Doug Wicks // Mar 7, 2013 at 8:14 pm

    Thanks for bringing a researcher’s perspective to Big Data. A number of points to add to the conversation. 1) variety – data of different types - video, images, tweets, sensor data - these challenge researchers as they go about analyzing Big Data; 2) arguably, there is a fourth v - veracity - Is this data accurate, trustworthy? Also a key factor for researchers working with Big Data. And of course the fundamental in working with ANY data, big, mid or small - are we asking the right questions, do we have the right data to answer them, and are we providing the insights our clients need when they think about and use Big Data to solve address issues.

  • 4 Tom H C Anderson // Mar 8, 2013 at 2:19 pm

    Thanks, I find analytics/methodology is left out of decision, and it’s all left to IT/Purchasing who try to get a solution that attempts to address everyone’s needs.

    I think this is fine if focused only on operational needs. But if you implement such a system for the purpose of insights, then there needs to be some serious thinking about what data makes sense to analyze together.

    If you can’t currently analyze one Mid Data source properly, perhaps it makes a lot more sense to deal with that before spending lots of time and $ to link them all up to one giant Real Big Database, only to find out it made absolutely no sense to analyze these sources together after all!?

  • 5 Christy Pogorelac // Mar 13, 2013 at 3:42 pm

    Thanks - I attended the conference and thought the panel discussion was extremely interesting.

  • 6 Georgette Asherman // Jul 10, 2014 at 7:20 am

    Tom, you are on the mark. And most importantly you point out the importance of sampling, even when there really is big, big data. I had people tell me that their 100,000 record data set is Big Data, when it is not. Most of our challenges are in Mid Data because many of our small sample approaches stop working.

Leave a Comment