Does Proprietary Data Hinder Research?

Does Proprietary Data Hinder Research?

A widely-discussed article at Newsweek about the ‘data problem in medicine’ sheds light on the fact that doctors don’t have access to data about the very medicines the prescribe. In fact, of all clinical trials, as many as half are never published, leaving doctors in the dark and patients at risk.

In the perhaps most extreme example, an antiarrhythmic drug called lorcainide was tested in the 1980s, and 9 people of the lorcanide group died vs. just one of the placebo group. This study could have prevented thousands of deaths during the decade, but was, for some reason, never published until the researchers apologized for the fact in 1993.

This is, in part, a problem of the community: journals are more likely to publish positive results as they can, well, sell more copies this way. On the other hand, manufacture of pharmaceuticals is a multi-billion dollar industry so money tends to slip into the equation when it shouldn’t.

For there is a real commercial boon in not sharing all trial data: on the surface, pharmaceutical companies can suffer greatly from the competitors if the published studies reveal the way the drugs work. On another level, there’s, also the incentive to push a drug that has swallowed a lot of money for fear of not making up for the investment. This, of course, extends beyond the pharmaceutical industry.

Proprietary data in economics

An intriguing article at Quartz acknowledges that more and more studies are based on proprietary data. In fact, from the studies published in the American Economic Review, a very prestigious economic journal, the number that use proprietary (either Government or private) data has risen from 8% in 2006 to 46% in 2014. That is, researchers have asked either the government or private companies for data, and more and more virtually unreplicable studies are being published.

Here’s the rub. Companies like Amazon, Facebook and Google, which have hoarded petabytes upon petabytes of data, have a) no real incentive to do anything noncommercial with the data; b) a negative incentive to share the data; c) but even if they do, they are likely to share it with those who would paint a rosy picture about them. A recent study on proprietary data states just that: “To obtain those data, academic economists have to develop a reputation to treat their sources nicely.” And treat them nicely they will, because such data sets are too unique and (one might imagine) too interesting to pass by.

Conclusions

Proprietary data, then, in some fields can be seen to be plainly bad, like in the cases of drug trials. In other fields, the effect cannot be measured, but there’s a very real danger that we’ll have a publication bias towards praising the company that released the data set to the researcher. It’s inevitably a trade-off, but for now it’s the only way scientists can access that data.

By Lauris Veips

Twitbook.png
Recovery Experts.png
Disaster Recovery Plan.png
Data Bed.png
Dinesh Varadharajan
The Future with Automation Many entrepreneurs believe digital technologies will transform the way their companies work. By 2022, the worldwide hyper-automation technology market is expected to be worth $596.6 billion. And by 2055, almost half ...
Dan Teichman
Cloud-Native Communications Historically, Communication Service Providers (CSPs) networks ran on purpose-built hardware. However, in the early 2000s organizations started to update their infrastructure, moving to virtualization. Now, providers are looking to take the next step, ...
Derrek Schutman
Implementing Digital Capabilities Successfully Building robust digital capabilities can deliver huge benefits to Digital Service Providers (DSPs). A recent TMForum survey shows that building digital capabilities (including digitization of customer experience and operations), is the ...
Shireesh Thota
Here’s How to Position Your Organization for the Era of Data Intensity We live in a data-intensive era. Data is booming. Companies are realizing that data is one of the most important assets and they ...
Louis
Manufacturers’ Top Demands For Quality Software Competing on product quality has never been more urgent as rising raw material and component costs continue to squeeze manufacturers’ margins. At the same time, unpredictable supply chains make ...
  • Plural Site

    Pluralsite

    Pluralsight provides online courses on popular programming languages and developer tools. Other courses cover fields such as IT security best practices, server infrastructure, and virtualization.

  • Isc2

    ISC2

    (ISC)² provides IT training, certifications, and exams that run online, on your premises, or in classrooms. Self-study resources are available. You can also train groups of 10 or more of your employees. If you want a job in cybersecurity, this is the route to take.

  • App Academy

    App Academy

    Immersive software engineering programs. No experience required. Pay $0 until you're hired. Join an online info session to learn more

  • Cybrary

    Cybrary

    CYBRARY Open source Cyber Security learning. Free for everyone, forever. The world's largest cyber security community. Cybrary provides free IT training and paid IT certificates. Courses for beginners, intermediates, and advanced users are available.