ZDnet

Apple’s Tim Cook: Silicon Valley has created privacy-violating ‘chaos factory’

Big tech companies are failing to take responsibility for the chaos that their innovations have created, according to Apple CEO Tim Cook...
/
Tech Crunch

Libra currently looks more like a fiat currency than a cryptocurrency

Facebook unveiled a cryptocurrency called Libra yesterday as well as the Libra Association, a not-for-profit that will oversee all things Libra. While Libra’s white paper draws a lot of inspiration from other cryptocurrencies, the current governance model and blockchain implementation remind me of banks more than bitcoin...
/

I recently wrote a blog “Interweaving Design Thinking and Data Science to Unleash Economic Value of Data”  that discussed the power of interweaving Design Thinking and Data Science to make our analytic efforts more effective.  Our approach was validated by a recent McKinsey article titled “Fusing data and design to supercharge innovation” that stated:

“While many organizations are investing in data and design capabilities, only those that tightly weave these disciplines together will unlock their full benefits.”

I even developed some Data Science playing cards that one could use to help guide this Design Thinking-Data Science interweaving process (see Figure 1).

No alt text provided for this image

Figure 1:  The Design Thinking-Data Science Winning Hand

And while I wholeheartedly believe that Design Thinking and Data Science are two synergist disciplines for accelerating the creation of more effective, more predictive, more relevant analytic models, they really only address the first part of the Data Science development process – analytic model development.

The purpose of this blog is to discuss the critically important role of DevOps in driving the second part of the Data Science development process – analytic model operationalization and monetization.  

Let’s jump into it…

Data Science and Analytic Model Development

I discussed in the blog “Why Is Data Science Different than Software Development?” how the data science and software development approaches are fundamentally different because Software Development Defines Criteria for Success; Data Science Discovers It.

As Data Science moves into the mainstream of more organizations, Product Development needs to understand the process for developing analytic models is different than the process for developing software.  While they share many of the same foundational capabilities (i.e., strong team alignment, clearly defined roles, mastering version control, regular communications rhythm), the data science development process has some unique requirements such as:

·     The presence of data to work with

·     The Collaborative Hypothesis Development Process

·     Data Exploration and Discovery Curiosity

·     Mastering the Art of Failure

·     Understanding When “Good Enough” is “Good Enough”

·     Embrace a Continuously Learning/Retuning Process

The Data Science analytic model development journey is fraught with unknowns. The “known unknowns” and “unknown unknowns” only surface as the data science team moves along the analytic model development journey.  Like in the original movie “Jason and the Argonauts”, a good data science team must be prepared for whatever evil monsters appear and adjust accordingly (see Figure 2).

No alt text provided for this image

Figure 2: Why Data Science is Different Than Software Development

But the operationalization (and ultimately the monetization) of the analytic models ultimately requires close collaboration between the Data Science and Software Development / DevOps teams.

Data Science and Analytic Model Operationalization

The Operationalization of the analytic models requires the integration of the Data Science and Software Development processes, and this integration process occurs around the organizations packaged and re-usable analytic modules (see Figure 3).

No alt text provided for this image

Figure 3: Integrating Analytics Development and Software Development

In the blog “Driving #AI Revolution with Pre-built Analytic Modules“, I asked “What is the Intelligence Revolution equivalent to the 1/4” bolt?” One of the key aspects of the Industrial Revolution was the creation of standardized parts – like the ¼” bolt – that could be used to assemble versus hand-craft solutions. So, what is the ¼” bolt equivalent for the AI Revolution? I think the answer is packaged, reusable and extensible Analytic Modules (see Figure 4)!

No alt text provided for this image

Figure 4:  Packaged Analytic Modules

Analytic Modules form the basis for accelerating time-to-value and de-risking analytics projects because the Analytic Modules enable the sharing, re-using and refinement of the organization’s key analytic capabilities. They also address one of the greatest destroyers of the economic value of data and analytics – orphaned analytics.

Challenge of Orphaned Analytics

From our University of San Francisco Research paper on “Applying Economic Concepts to Determine the Financial (Economic) Value of Your Data”, several companies complained about the curse of “orphaned analytics”; which are one-off analytics developed to address a specific business need but never “operationalized” for re-use across the organization.

Unfortunately, many organizations lack an overarching model to ensure that the resulting analytics and associated organizational intellectual capital can be captured and re-used across multiple use cases. Without this over-arching analytics framework, organizations end up playing a game of analytics “whack-a-mole” where the analytics team focuses their precious and valuable resources on those immediate (urgent) problems, short-changing the larger, more strategic (important) analytic opportunities[1].

If you can’t package, share, re-use and refine your analytic models, then your organization misses the chance to exploit the almost famous “Schmarzo Economic Digital Asset Valuation Theorem” – the ability to leverage data and analytics to simultaneously drive down marginal costs while accelerating the economic value creation of digital asset as explained in Figure 5.

No alt text provided for this image

Figure 5: Schmarzo Economic Digital Asset Valuation Theorem

See the blog “Why Tomorrow’s Leaders MUST Embrace the Economics of Digital Transformation” for more details on the three economic effects that result from the sharing, re-use and refinement of the organization’s data and analytics digital assets.

Integrating Data Science and DevOps to Avoid Orphaned Analytics

The Operationalization of the analytic models requires the integration of the Data Science and software development or DevOps processes.  DevOps is a set of software development practices that combines software development (Dev) and information technology operations (Ops) to shorten the systems development life cycle while delivering features, fixes, and updates frequently in close alignment with business objectives[2] (see Figure 6).

No alt text provided for this image

Figure 6:  The Operationalizing Analytic Modules

The first half of Figure 6 covers the non-linear data science, analytic development process where the data science team will try different combinations of data, analytic algorithms, data enrichment and feature engineering techniques to create analytic models that are “good enough” given the required analytic accuracy and goodness of fit metrics. See the blog “Interweaving Design Thinking and Data Science to Unleash Economic Value of Data” for more details how Design Thinking supports the Data Science development process.

However, the second half of Figure 6 is focused on the more traditional software development (DevOps) cycle, a linear process that supports the scaling, operationalization and ultimately monetization of the analytics.  The analytic modules must be treated and managed as intellectual property (IP) or software assets complete with version control, check in/check out, and regression testing of analytic module modifications.  Organizations must develop the ability to track and maintain model lineage and metadata in order to answer questions such as “What data was used to train this model?” and “Which libraries were used in producing these scores?”

It is at this point, where Effect #3: Economic Value of Digital Assets Accelerates of the “Economic Digital Asset Valuation Theorem” (see Figure 7).

No alt text provided for this image

Figure 7:  Effect #3: Economic Value of Digital Assets Accelerates

Figure 7 highlights how the cumulative Economic Value of the analytic assets accelerates through the continuous refinement of the analytic asset.  Economic value of analytic models accelerates because refinement (via analytic module continuous learning and improvements) of one analytic module lifts the value of all associated use cases.  See the blog “Economic Value of Learning and Why Google Open Sourced TensorFlow” to learn how leading digital companies like Google are exploiting the economics of digital assets to accelerate the creation of value.

I will be exploring the Data Science-DevOps relationship in more detail in future blogs as well explore the “Digital Asset Value Chain”from DataOps to Data Science to DevOps. Watch this space for more details!!

[1]See the blog “How to Avoid Orphaned Analytics” for more details on challenge of Orphaned Analytics.

[2]Source: Wikipedia: DevOps https://en.wikipedia.org/wiki/DevOps

Bill Schmarzo

CTO, IoT and Analytics at Hitachi Vantara (aka “Dean of Big Data”)

Bill Schmarzo, author of “Big Data: Understanding How Data Powers Big Business” and “Big Data MBA: Driving Business Strategies with Data Science”. He’s written white papers, is an avid blogger and is a frequent speaker on the use of Big Data and data science to power an organization’s key business initiatives. He is a University of San Francisco School of Management (SOM) Executive Fellow where he teaches the “Big Data MBA” course. Bill also just completed a research paper on “Determining The Economic Value of Data”. Onalytica recently ranked Bill as #4 Big Data Influencer worldwide.

Bill has over three decades of experience in data warehousing, BI and analytics. Bill authored the Vision Workshop methodology that links an organization’s strategic business initiatives with their supporting data and analytic requirements. Bill serves on the City of San Jose’s Technology Innovation Board, and on the faculties of The Data Warehouse Institute and Strata.

Previously, Bill was vice president of Analytics at Yahoo where he was responsible for the development of Yahoo’s Advertiser and Website analytics products, including the delivery of “actionable insights” through a holistic user experience. Before that, Bill oversaw the Analytic Applications business unit at Business Objects, including the development, marketing and sales of their industry-defining analytic applications.

Bill holds a Masters Business Administration from University of Iowa and a Bachelor of Science degree in Mathematics, Computer Science and Business Administration from Coe College.

Legal Tech - How to Create Long-Term Growth for Your Practice

Legal Tech – How to Create Long-Term Growth for Your Practice

Legal Tech Your Practice Your law firm is a business. Like all businesses, growth and profitability is paramount. You want ...
Cloud Developers are Using the Programmable Infrastructure

Cloud Developers are Using the Programmable Infrastructure

In the past few years, we have seen a surge of advancement in cloud development. New platforms, developer tools, and ...
James Kessinger

Want to dip your toe into the cloud? Challenges of a Large Migration

Challenges of a Large Migration Migrating to the cloud can be a daunting task. First you have to go through ...
The Benefits of Virtualizing SD-WAN and Security

The Benefits of Virtualizing SD-WAN and Security

Benefits of Virtualizing SD-WAN As more companies adopt SD-WAN technology to enhance the agility of their networking architecture, they must ...
Cisco

7 Questions about the Firewall: A Chat with Cisco featuring Gartner

/
The firewall remains the front line of cyber-defense for most organizations. The firewall protects an organization’s network, and that function isn’t going away anytime soon. Remember when people used to ...
huawei

U.S. chipmakers quietly lobby to ease Huawei ban

/
SAN FRANCISCO/WASHINGTON (Reuters) - Huawei’s American chip suppliers, including Qualcomm and Intel, are quietly pressing the U.S. government to ease its ban on sales to the Chinese tech giant, even ...
Tech Crunch

Libra currently looks more like a fiat currency than a cryptocurrency

/
Facebook unveiled a cryptocurrency called Libra yesterday as well as the Libra Association, a not-for-profit that will oversee all things Libra. While Libra’s white paper draws a lot of inspiration from other cryptocurrencies, the current ...