CTO Blog

CTO Blog

Opinions expressed on this blog reflect the writer’s views and not the position of the Capgemini Group

TechnoVision 2015 - Data Apart Together

Category : Technology

Thriving On Data #4 - Data Apart Together

What if there was not one single source of truth in corporate data after all? Many enterprises are now adapting to a 'federated' business reality in which many different sources of data exist, as well as even more different views on how to use this data. In order to deal with this situation, and ultimately thrive on it, this requires the smart use of the next generation of Master Data Management and Business Process Management Tools, combined with all the goodness that comes from the Big Data Technologies wave. Yet, there is much more; it is a matter of corporate governance in which collaboration is critical to bringing the right data together. Thus, data that is apart together becomes a part of a powerful digital platform that enables the enterprise to go any current and future way, enabled by data.

As Capgemini’s research with MIT Sloan, the Digital Advantage, found companies that become digital outperform their peers. Most tellingly however it was companies that took the conservative route to digitization that delivered the most managed route towards becoming a digital enterprise.  The challenge for any organization looking to become digital is how to leverage all of its data and to enable the business to combine that data. The 'fashionista' approach is to opportunistically look towards technology silos for point solutions, the 'conservative' approach is to look towards governance and a consistent way for all the business to combine the information to their individual needs.

Well, how about leveraging some good old conservatism while becoming more agile and innovative at the same time?

This view on information was what underpins the Business Data Lake which Capgemini co-innovated with Pivotal.  Data Apart Together, is not simply about how you combine data, its about how you enable different business units to combine data in different ways.  This is where governance, in particular MDM and RDM deliver huge benefits to businesses.  The role of governance here is not to constrain the business by placing a single view upon them but instead to concentrate on how a business can collaborate around information.  This view on governance is essential when thinking about how business users actually leverage information.

There has been for many years an approach of focusing on the data schema and trying to create a single consistent view for all parts of a company.  The problem is that this doesn’t reflect how people actually use information in their jobs.  People look to create personal views that reflect the individual challenges that they and their teams face.  Thus the marketing lead for an airline looks towards the customers as being the center of their view, while the maintenance department looks for aircraft to be at the center.  To bring disparate data sources together for the business therefore, is about enabling them to create the right insight for their problems, or to put it another way, it’s about insight at the point of action.

Governance therefore needs to focus not on so much on schemas, or even data quality, but towards how data sets can be combined and therefore the identifiers that can be used to link those data sets consistently.  Data quality therefore becomes a side effect of governance rather than its goal.  This approach to governance is essential when looking at Big Data solutions.  It is ridiculous to think that you can possibly create a single schema that includes all of the internal and external data that a company uses, information from Facebook and other social media feeds is ever changing, information available from open government sources is being continually added to and unstructured information sets such as email and documents defy any sort of traditional approach.

In the Business Data Lake therefore we have concentrated on governance from a business perspective not from a technical IT schema approach.  This approach focuses on enabling collaboration and allowing the business to combine the various data sets within the lake to create their own local views and from there to see where more governance, and data quality, is required rather than creating a central plan which turns out to be wrong. This focusing on identification and the cross reference means that both transactional systems as well as post transactional analytics can leverage the full range of information in an organization in a managed approach that aligns with the business model and value.  It thus delivers on the promise of digitization and gives earlier delivery of its benefits than plain technology-centric approaches.

'Data Apart Together' is a key trend for businesses and a key trend for IT to recognize how the market has changed.  It's about creating the platform that helps the business brings fragmented data together for its local purposes, not how IT tries to impose a single view on information that constrains the agility of the business.

Your expert: Steve Jones  

Part of Capgemini's TechnoVision 2015 update series. See the overview here.

About the author

Ron Tolido

Leave a comment

Your email address will not be published. Required fields are marked *.