Death of the data warehouse
Two panellists at this year's Buy-Side Technology North American Summit talk about their firms' use of big data lakes in place of data warehouses
Hunger for data isn't going to slow down any time soon. But as the quantity of data used by firms climbs, so too do the issues surrounding it.
A good data governance strategy isn't exactly a sexy topic, but it's a necessary one to tackle with the increasing demand for data.
Scott Burleigh, executive director for JP Morgan Asset Management, said about a year and a half ago his firm made heavy investments into technology around data governance. Burleigh, who spoke on a panel at this year's Buy-Side Technology North American Summit on October 7, said the firm found there were multiple copies of data and places where the same data was processed over and over again.
"What evolved over time was that we didn't have a single version of the truth," Burleigh said. "You had different answers for the same instrument. Different rights and returns for the same security. You had weighted average credit ratings that were different between reports. Multiple answers for the same question."
Trip to the lake
A consolidated area to store the data was the answer, but not via a warehouse. Instead, the firm chose to build a big data lake.
Rashmi Gupta, a data manager at MetLife and fellow panelist, said her firm has taken the exact same approach. Instead of having a traditional centralised warehouse, everything is put into a big data lake, which serves as a data acquisition layer.
A semantics layer – a data translation layer that sits on top of the data acquisition layer – maps to the enterprise data model. Gupta said big data lakes are one of the biggest trends she sees in the industry now.
"So you have one set of information, one single version of truth, but you don't have all the cost associated and the work and labour involved in creating one single warehouse," Gupta said.
It takes very little time to build up big data lakes, according to Gupta, and they have great scalability. If there is a new application a firm wants to use, all it has to do is put it in the lake and build a translation layer on top of it.
Gupta said there are some issues around data integrity, which makes the translation layer such a critical part of the entire operation.
"It boils down to, very simply put, the whole data warehouse is now being replaced by a high-technology data service layer," Burleigh said.
Tapping at the source
Burleigh used solvency-related data as an example of how it works. With the data lake, a logical data model brings in data from multiple sources. The data is delivered through a search layer, meaning the user can ask for the type of data or data elements without specifying the source.
"You just talk to the service layer, tell it what data elements you want and it knows where they are," Burleigh said. "It serves it up to you as though it was one source."
JP Morgan has taken it a step further, according to Burleigh, by governing data at the source before it enters the data lake. By doing so, Burleigh said the firm doesn't have to worry about altering the data once it's in the data lake.
"We're identifying the source for the data that goes into the lake and we make changes, or the governance says we need to make changes to the data element," Burleigh said. "We make it at the source and it gets reflected in the data lake."
This article was originally published on sister website WatersTechnology.com.
Only users who have a paid subscription or are part of a corporate subscription are able to print or copy content.
To access these options, along with all other subscription benefits, please contact info@risk.net or view our subscription options here: http://subscriptions.risk.net/subscribe
You are currently unable to print this content. Please contact info@risk.net to find out more.
You are currently unable to copy this content. Please contact info@risk.net to find out more.
Copyright Infopro Digital Limited. All rights reserved.
As outlined in our terms and conditions, https://www.infopro-digital.com/terms-and-conditions/subscriptions/ (point 2.4), printing is limited to a single copy.
If you would like to purchase additional rights please email info@risk.net
Copyright Infopro Digital Limited. All rights reserved.
You may share this content using our article tools. As outlined in our terms and conditions, https://www.infopro-digital.com/terms-and-conditions/subscriptions/ (clause 2.4), an Authorised User may only make one copy of the materials for their own personal use. You must also comply with the restrictions in clause 2.5.
If you would like to purchase additional rights please email info@risk.net
More on Technology
Dismantling the zeal and the hype: the real GenAI use cases in risk management
Chartis explores the advantages and drawbacks of GenAI applications in risk management – firmly within the well-established and continuously evolving AI landscape
Chartis RiskTech100® 2024
The latest iteration of the Chartis RiskTech100®, a comprehensive independent study of the world’s major players in risk and compliance technology, is acknowledged as the go-to for clear, accurate analysis of the risk technology marketplace. With its…
T+1: complacency before the storm?
This paper, created by WatersTechnology in association with Gresham Technologies, outlines what the move to T+1 (next-day settlement) of broker/dealer-executed trades in the US and Canadian markets means for buy-side and sell-side firms
Empowering risk management with AI
This webinar explores how artificial intelligence (AI) can strip out the overheads and effort of rapidly modelling, monitoring and mitigating risk
Core-Payments for business leaders: why real-time access to payment data is key to long‑term business success
Business leaders require easy access to timely, reliable and complete information across post-trade processes. Aside from the usual requirements of senior managers to optimise for risk, revenues and costs, they increasingly need to demonstrate to their…
Risk applications and the cloud: driving better value and performance from key risk management architecture
Today's financial services organisations are increasingly looking to move their financial risk management applications to the cloud. But, according to a recent survey by Risk.net and SS&C Algorithmics, many risk professionals believe there is room for…
Machine learning models: the validation challenge
Machine learning models are seeing increasing demand across the capital markets spectrum. But how can firms improve their chances of gaining internal and regulatory approval for these type of models?