Death of the Data Warehouse

Two panelists at the Buy-Side Technology North American Summit talk about their firms' use of big data lakes in place of data warehouses.

big-data-lake
Panelists at the Buy-Side Technology North American Summit 2015 discuss the future of data governance processes.

Hunger for data isn't going to slow down anytime soon. But as the quantity of data used by firms climbs, so too do the issues surrounding it.

A good data governance strategy isn't exactly a sexy topic, but it's a necessary one to tackle with the increasing demand for data.

Scott Burleigh, executive director for JPMorgan Asset Management, said about a year and a half ago his firm made heavy investments into technology around data governance. Burleigh, who spoke on a panel at this year's Buy-Side Technology North American Summit, said the firm found there were multiple copies of data and places where the same data was processed over and over again.

"What evolved over time was that we didn't have a single version of the truth," Burleigh said. "You had different answers for the same instrument. Different rights and returns for the same security. You had weighted average credit ratings that were different between reports. Multiple answers for the same question."

Trip to the Lake

A consolidated area to store the data was the answer, but not via a warehouse. Instead, the firm chose to build a big data lake.

Rashmi Gupta, a data manager at MetLife and fellow panelist, said her firm has taken the exact same approach. Instead of having a traditional centralized warehouse, everything is put into a big data lake, which serves as a data acquisition layer.

A semantics layer ─ a data translation layer that sits on top of the data acquisition layer ─ maps to the enterprise data model. Gupta said big data lakes are one of the biggest trends she sees in the industry now.

"So you have one set of information, one single version of truth, but you don't have all the cost associated and the work and labor involved in creating one single warehouse," Gupta said.

It takes very little time to build up big data lakes, according to Gupta, and they have great scalability. If there is a new application a firm wants to use, all it has to do is put it in the lake and build a translation layer on top of it.

Gupta said there are some issues around data integrity, which makes the translation layer such a critical part of the entire operation.

"It boils down to, very simply put, the whole data warehouse is now being replaced by a high-technology data service layer," Burleigh said.

Tapping at the Source

Burleigh used solvency-related data as an example of how it works. With the data lake, a logical data model brings in data from multiple sources. The data is delivered through a search layer, meaning the user can ask for the type of data or data elements without specifying the source.

"You just talk to the service layer, tell it what data elements you want and it knows where they are," Burleigh said. "It serves it up to you as though it was one source."

JPMorgan has taken it a step further, according to Burleigh, by governing data at the source before it enters the data lake. By doing so, Burleigh said the firm doesn't have to worry about altering the data once it's in the data lake.

"We're identifying the source for the data that goes into the lake and we make changes, or the governance says we need to make changes to the data element," Burleigh said. "We make it at the source and it gets reflected in the data lake."

The Bottom Line

  • As firms look to consolidate their data, big data lakes have become popular amongst some firms.
  • Big data lakes are an efficient, cost-effective and scalable way to manage large amounts of data thanks to the layers that can be built on top of them.
  • Governance functions can also be added to the source of the data, allowing data to be altered or changed before entering the big data lake.

Only users who have a paid subscription or are part of a corporate subscription are able to print or copy content.

To access these options, along with all other subscription benefits, please contact info@waterstechnology.com or view our subscription options here: http://subscriptions.waterstechnology.com/subscribe

You are currently unable to copy this content. Please contact info@waterstechnology.com to find out more.

A tech revolution in an old-school industry: FX

FX is in a state of transition, as asset managers and financial firms explore modernizing their operating processes. But manual processes persist. MillTechFX’s Eric Huttman makes the case for doubling down on new technology and embracing automation to increase operational efficiency in FX.

Most read articles loading...

You need to sign in to use this feature. If you don’t have a WatersTechnology account, please register for a trial.

Sign in
You are currently on corporate access.

To use this feature you will need an individual account. If you have one already please sign in.

Sign in.

Alternatively you can request an individual account here