Big Data And Global Trend Monitoring: The Language Gap

Big Data and Global Trend Monitoring: The Language Gap

Data traces left by humans both on the internet and elsewhere allow marketers to find trending products; organizations can detect potential health hazards and epidemies; researchers can gain valuable insights on the way ideas spread; and so on. However, all of this depends on that they have access to a more or less complete dataset for the places or topics they are interested in.

Problems with acquiring fuller datasets

For one industry ‘trend’ is the operative word. Fashion, of course. Giants like Editd and WGSN are scaling the heights in fashion data analytics, empowering retailers to better control their prices (a practice that may very well translate to ‘less discounts’) and control their inventory. On top of having billions of data points on inventory, its past performance and prices from countless web stores, their Social Monitor feature delivers real-time updates aggregated from more than 800,000 fashion experts and influencers.

The inventory data points give real power to fashion experts across the globe. As for the Social Monitor, though, one might ask: how many languages does it speak? That is, it might be prudent to guess that their social trend insights are bound steadfastly to the English-tweeting world. If this doesn’t alarm you (it probably shouldn’t), implications of a graver nature following:

In an intriguing article on Foreign Policy, Kalev Leetaru states how HealthMap, the global, news-powered powered disease mapping project, actually missed the Ebola outbreak by a day after the World Health Organization was notified. The problem? “Ils ne parlent pas le français.” They [Big Data] don’t speak French. In many countries one or two local newspapers are published on the internet in English, and only minuscule parts of the rest are translated. Most never are.

HealthMap 

This is a serious hindrance for global Big Data initiatives of any kind. You can yield useful translations with Google Translate, but you can also get rubbish. A workaround could perhaps be found (at least in the Ebola outbreak case) in developing computer-readable standards for publishing important news concerning health hazards; but who’s to say that anyone would be willing to comply? Another workaround lies in crowdsourced translation, like in the Duolingo app where collective understanding provides accurate translations.

A simple solution doesn’t exist, but, as more and more nations join global data initiatives and develop their own, the language gap will be closer to being bridged. Some might object that the real beauty lies in that it never will.

By Lauris Veips

Episode 4: The Power of Regulatory Compliant Cloud: A European Case Study

An interview with Johan Christenson, CEO of CityNetwork With the world focusing on the big ...

Episode 6: Cloud Migration: Why It’s More Important Than Ever

The Importance of Cloud Migration Moving fully to the cloud is still a concern for ...

Episode 2: Coronavirus Phishing Emails and Work-from-Home Meetings

Coronavirus Phishing Emails What to watch out for as scammers exploit pandemic panic, and tips ...
Johan

Why the digital infrastructure is a matter of national interest!

Digital Infrastructure National Interest When the Internet was born, it promised a form of democracy and guarantee that everybody could be part and setup their ...
Marty

Digital Transformation: Adapting Your Business Online

The Age of Digital Transformation There is little doubt that the transition to cloud computing is driving an insatiable demand for digital transformation. Countless organizations ...
Steve Prentice

Episode 1: Why Small and Medium Sized Businesses Need an MSP

Small and Medium Sized Businesses Need an MSP Small and medium-sized businesses don’t enjoy the benefits of a large IT department. What should they consider ...
Data Issues DBMS

Advantages of Database Management Systems (DBMS)

What Is A Database Management System (DBMS)? (Updated: 04,22,2020) A Database Management System, or DBMS, allows its users to create, read, delete and update data within ...
Cloud Image Migration

The Best Web Migration Should Be Invisible to Your Customers

How you approach a migration of your assets to the AWS Cloud is important to getting it right When the British-bank TSB decided to migrate ...
Kokumai

History, Current Status and Future Scenarios of Expanded Password System

Future Scenarios of Expanded Password System Passwords are so hard to manage that some people are urging the removal of passwords from digital identity altogether. What ...
Steve Prentice

Episode 4: The Power of Regulatory Compliant Cloud: A European Case Study

An interview with Johan Christenson, CEO of CityNetwork With the world focusing on the big three hyperscalers, there is still room – and much necessity ...
Yuri Sagalov

IT Culture Clash Where Employees Use Multiple Devices To Collaborate

Employees use multiple devices to collaborate It used to be that company IT decision makers could simply dictate the software that business units would use ...
Anita Raj

Will there be a normal to go back to after COVID-19?

The COVID-19 Aftermath Until November last year, not one of us would have expected life to take such a dramatic turn in as short as ...
Ajay

Explainable Intelligence Part 3 – The Strategy for XAI

The Strategy for XAI It is not enough to say that something is true just because 'I know it’s true!' – we have to have ...