Big Data Vs. Dark Data
Looking at what the buzzword terms really describe for financial data management
In a story last week, I had looked at the use of the term "dark data" and what it means. That is a newer buzzword than one that has been thrown around much more over the past several years, "big data."
Often when "big data" has been used in financial services operations discussions, particularly in data management operations discussions, it really pertains to what is being done with data, such as how firms mine larger amounts of data or how they organize it to get the most insight and value out of it.
So this raises the question of whether "big data" is really even an appropriate term. Increased activity concerning know-your-customer (KYC) data might lead one to think that type of data is rising to the level of "big data," but if compared to something like phone call records, even from just one major cell service provider, KYC data could look small in volume by comparison.
Is there a threshold that KYC data or financial services industry data as a whole needs to cross to truly be "big" and not just "medium," perhaps? Customer details, if cross-referenced with extensive transaction data, could be that "big" in this industry.
Failing that, firms may be better off not getting caught up in "big data" hysteria. It's more effective to work to know exactly what data you do have, how frequently you are getting it and what you are trying to achieve with it—namely, what insights you are trying to generate from it.
In effect, dark data and big data are really about the same thing—data management. Dark data describes knowing where all the meaningful data is and how to aggregate it, while big data infers understanding the complete picture of what data is coming in, especially if the volume of that data is getting larger.
To join a discussion on how "big data" should be defined, visit Inside Reference Data's LinkedIn discussion group.
Only users who have a paid subscription or are part of a corporate subscription are able to print or copy content.
To access these options, along with all other subscription benefits, please contact info@waterstechnology.com or view our subscription options here: https://subscriptions.waterstechnology.com/subscribe
You are currently unable to print this content. Please contact info@waterstechnology.com to find out more.
You are currently unable to copy this content. Please contact info@waterstechnology.com to find out more.
Copyright Infopro Digital Limited. All rights reserved.
As outlined in our terms and conditions, https://www.infopro-digital.com/terms-and-conditions/subscriptions/ (point 2.4), printing is limited to a single copy.
If you would like to purchase additional rights please email info@waterstechnology.com
Copyright Infopro Digital Limited. All rights reserved.
You may share this content using our article tools. As outlined in our terms and conditions, https://www.infopro-digital.com/terms-and-conditions/subscriptions/ (clause 2.4), an Authorised User may only make one copy of the materials for their own personal use. You must also comply with the restrictions in clause 2.5.
If you would like to purchase additional rights please email info@waterstechnology.com
More on Data Management
Editor’s Picks: Our best from 2025
Anthony Malakian picks out 10 stories from the past 12 months that set the stage for the new year.
Market data costs defy cyclicality
Trading firms continue to grapple with escalating market data costs. Can innovative solutions and strategic approaches bring relief?
LSEG partners with Citi, DTCC goes on-chain, AI on the brain, and more
The Waters Cooler: Trading Technologies buys OpenGamma, CT Plan updates, and the beginning of benchmarking in this week’s news roundup.
AI & data enablement: A looming reality or pipe dream?
Waters Wrap: The promise of AI and agents is massive, and real-world success stories are trickling out. But Anthony notes that firms still need to be hyper-focused on getting the data foundation correct before adding layers.
Data managers worry lack of funding, staffing will hinder AI ambitions
Nearly two-thirds of respondents to WatersTechnology’s data benchmark survey rated the pressure they’re receiving from senior executives and the board as very high. But is the money flowing for talent and data management?
Data standardization is the ‘trust accelerator’ for broader AI adoption
In this guest column, data product managers at Fitch Solutions explain AI’s impact on credit and investment risk management.
As AI pressures mount, banks split on how to handle staffing
Benchmarking: Over the next 12 months, almost a third of G-Sib respondents said they plan to decrease headcount in their data function.
Everyone wants to tokenize the assets. What about the data?
The IMD Wrap: With exchanges moving market data on-chain, Wei-Shen believes there’s a need to standardize licensing agreements.