NoSQL databases, MapReduce & Hadoop

Big Data – Productivity, Innovation And Competitiveness

NoSQL databases, MapReduce & Hadoop

Big data refers to datasets that are so large, diverse, and fast-changing which need advanced and unique storage, management, analysis, and visualization technologies.  According to McKinsey, Big Data is “the next frontier for innovation, competition and productivity”.  The right use of Big Data can increase productivity, innovation, and competitiveness for organizations. Inhi Suh, IBM vice president of big data, stated that businesses should place a greater emphasis on analytics projects. In fact, big data analytic is an important step to extract knowledge from a huge amount of data. It is a competitive advantage for most companies.

NoSQL databases, MapReduce & Hadoop

According to Gupta and Jyoti (2014), “Big data analytics is the process of analysing big data to find hidden patterns, unknown correlations and other useful information that can be extracted to make better decisions”.Agrawal et al. (2011) described the multiple phases in the big data analysis which are Data Acquisition and Recording; Information Extraction and Cleaning; Data Integration, Aggregation, and Representation; Data Modeling and Analysis; and Interpretation. All these phases are crucial and high accuracy in each of these steps will lead to effective big data analytic. In this way, the promised benefits of big data will be achieved.

A wide variety of analytical techniques and technologies can be used to extract useful information from large collections of data. Such information helps companies to gain valuable insights to predict customer behaviour, effective marketing, increased revenue and so on. Maltby (2011) reviewed several literatures on big data analytics and introduced several techniques, such as Machine learning, Data mining, Text analytics, Crowdsourcing, Cluster analysis, Time series analysis, Network analysis, Predictive modelling, Association rule, and Regression, that can be used to extract information from a data set and transform it into an understandable structure for further use . In fact, using data analytic techniques depends on the research objectives/ questions, nature of the data, and the available technologies.

Visualization products

In addition, there are a wide variety of software products and technologies to facilitate big data analytics. EDWs, Visualization products, NoSQL databases, MapReduce & Hadoop, and cloud computing are examples of the more common technologies used in big data analytics. All these techniques and technologies cannot be used for every project or organization. Needs and potential of each organization should be evaluated in order to choosing the appropriate tools for big data analytic.

Studies indicates that data analysis is considerably more challenging than simply locating, identifying, understanding, and citing data. Many researchers believe that the most of the challenges and concerns with data is related to volume and velocity. However, a recent survey conducted by the creator of open source computational database management system on more than 100 data scientist indicates that variety of data sources (not just data volume & velocity) is the main challenge in analysing data. Furthermore, results of this study indicated that Hadoop cannot be a viable solution for some cases that require complex analytics.  It would seem that data analysis is a clear bottleneck in many applications. In line with this idea, Agrawal and his colleagues (2011) reported common challenges in big data analysis: Heterogeneity and Incompleteness of data, Scale, Timeliness, Privacy, error-handling, lack of structure, and visualization. It is recommended that the highlighted challenges should be addressed for effective data analysis.

By Mojgan Afshari

Marty

How cloud technologies improve innovation in the healthcare industry?

How cloud technologies improve innovation in the healthcare industry? The uptake of VPS hosting in the cloud within the heavily regulated healthcare industry has until ...
Kevin Ovalle Anderson Frank

How cloud-based business management can help an SMB go global

Global SMB Business Management Most companies today are familiar with the cloud; using software-as-a-service (SaaS) apps and customer relationship management (CRM) for years. However, many ...
Kokumai

Identity Assurance – Sufficient and Necessary Conditions

Identity Assurance It is not easy to define the 'sufficient condition' for describing a set of processes used to establish that a natural person is ...
Mark Kirstein

BitTitan Cloud Predictions and IT Migration Trends

IT Migration Trends The beginning of a new year is an ambitious time for people and businesses. Strategic initiatives are finalized, goals are set and ...
Jeremy Daniel

Find Competitive Advantage through AWS by Partnering With The Experts

Setting up your cloud configuration is too important to not involve the experts MediaTemple & CloudTweaks Thought Leadership Brand Series So many great business ideas ...
Steve Prentice

Episode 2: Coronavirus Phishing Emails and Work-from-Home Meetings

Coronavirus Phishing Emails What to watch out for as scammers exploit pandemic panic, and tips on how to attend meetings while working from home. Working ...
David Shearer

Looking Back – and Looking Forward to 2020

As we celebrate our thirtieth anniversary here at (ISC)², it’s incredible to look back at the changes our industry has been through. From advances in ...
Sergey lypchenko 

The Top 7 Latest DevOps Trends to Follow

DevOps Trends to Follow Awareness of the latest DevOps trends is important for companies which consider the integration of DevOps into their development processes as ...
David Gevorkian

Website Accessibility: Compliancy, Laws and Best Practices

Key to Making Your Website Accessible The internet has changed the education sector in so many ways. With e-learning, more people around the globe are ...
Kaylamatthews

What Amazon’s Kendra Means for the AI and Machine Learning Future

Amazon's Kendra Learning Future Most people feel a bit astounded when they type a query into Google and get relevant results in milliseconds. They're probably ...