Does Proprietary Data Hinder Research?

Does Proprietary Data Hinder Research?

A widely-discussed article at Newsweek about the ‘data problem in medicine’ sheds light on the fact that doctors don’t have access to data about the very medicines the prescribe. In fact, of all clinical trials, as many as half are never published, leaving doctors in the dark and patients at risk.

In the perhaps most extreme example, an antiarrhythmic drug called lorcainide was tested in the 1980s, and 9 people of the lorcanide group died vs. just one of the placebo group. This study could have prevented thousands of deaths during the decade, but was, for some reason, never published until the researchers apologized for the fact in 1993.

This is, in part, a problem of the community: journals are more likely to publish positive results as they can, well, sell more copies this way. On the other hand, manufacture of pharmaceuticals is a multi-billion dollar industry so money tends to slip into the equation when it shouldn’t.

For there is a real commercial boon in not sharing all trial data: on the surface, pharmaceutical companies can suffer greatly from the competitors if the published studies reveal the way the drugs work. On another level, there’s, also the incentive to push a drug that has swallowed a lot of money for fear of not making up for the investment. This, of course, extends beyond the pharmaceutical industry.

Proprietary data in economics

An intriguing article at Quartz acknowledges that more and more studies are based on proprietary data. In fact, from the studies published in the American Economic Review, a very prestigious economic journal, the number that use proprietary (either Government or private) data has risen from 8% in 2006 to 46% in 2014. That is, researchers have asked either the government or private companies for data, and more and more virtually unreplicable studies are being published.

Here’s the rub. Companies like Amazon, Facebook and Google, which have hoarded petabytes upon petabytes of data, have a) no real incentive to do anything noncommercial with the data; b) a negative incentive to share the data; c) but even if they do, they are likely to share it with those who would paint a rosy picture about them. A recent study on proprietary data states just that: “To obtain those data, academic economists have to develop a reputation to treat their sources nicely.” And treat them nicely they will, because such data sets are too unique and (one might imagine) too interesting to pass by.


Proprietary data, then, in some fields can be seen to be plainly bad, like in the cases of drug trials. In other fields, the effect cannot be measured, but there’s a very real danger that we’ll have a publication bias towards praising the company that released the data set to the researcher. It’s inevitably a trade-off, but for now it’s the only way scientists can access that data.

By Lauris Veips

Meta Data

Data-Driven PPC and The Benefits Of Drilling Down On The Data

Drilling Down On Big Data Running a pay per click campaign for your business, which isn’t driven by detailed metrics, offers no more than the hit-and-hope approach which a billboard in the 80’s may have ...
Mark Barrenechea

Introducing the Information Advantage

Technology. Information. Disruption. The world is moving faster than ever before at unprecedented scale. Businesses today are operating in the next industrial revolution, and the rules have changed. This is Industry 4.0. It is imposing ...
Move bot migration

MoveBot – New Data Transfer Platform

Data Transfer Platform Branded post by Movebot As cloud computing and storage continue to provide enhanced ROI to organizations, businesses are storing their data on the cloud– instead of on-premise servers. Storage migration is an ...
Patrick Joggerst

Payments Companies Will Always See ROI on Embedded Real Time Communications

ROI on Embedded Real Time Communications Without secure, real time communications applications, the financial services industry could literally come to a standstill. While transactions are driven by data, the human voice and human messaging continues ...
Signal Messenger: How to Successfully Resist Wiretapping Attempts

Signal Messenger: How to Successfully Resist Wiretapping Attempts

Successfully Resist Wiretapping Attempts Against the backdrop of events in the US, the popularity of the Signal secure messenger has grown sharply - from 6,000 to 26,000 downloads per day. This software uses strong cryptography ...

New Security Regulation – Cybersecurity Maturity Model Certification (CMMC)

Cybersecurity Maturity Model Certification Changes are on the horizon for the Department of Defense (DoD) and its contractors. Late last year, the DoD announced the Cybersecurity Maturity Model Certification (CMMC), which officially released in January. The ...