Pinup: Qubole – The Growth Of Big Data as a Service

Pinup: Qubole – The Growth Of Big Data as a Service

Big Data as a Service

Qubole-BDaaS

In late 2013, the growth of ‘Big Data as a Service’ (BDaaS) was being tipped as one of 2014’s standout trends. It was speculated that BDaaS was even the natural replacement of Software as a Service (SaaS) – the SaaS approach to acquiring specialised enterprise applications had proved itself to be a valid technology model, so it made sense to market analysts that big data applications would follow a similar path.

The three principal benefits of BDaaS are similar to those of SaaS. Firstly, BDaaS solutions are highly scalable; it’s an important benefit because big data can become incredibly resource hungry in a very short space of time – more so than any other current technology can. As data repositories grow in size, so does the demand for storage space – the upshot being that hosting big data applications in-house can quickly lead to scalability issues.

Secondly, BDaaS offers improved security; the nature of big data is hard to monitor and audit and small companies would face massive resource overheads of trying to secure repositories. BDaaS transfers the responsibility for security the host.

Finally, quality and level of service. Big data and related technologies are still emerging, therefore, there is a significant skills shortage in the market. It means that developing big data applications in-house is not something most companies can consider. BDaaS offers the client company a way to procure ready made big data services that will be maintained and extended by the service host.

(Chart Image Source: BigData 50)

Big-Data-Investments

A startup that has embraced BDaaS is Qubole, and the quality of their service has already earned them an excellent reputation in the market place. The company was features in Jeff Vance’s ‘Big Data 50 – The Hottest Big Data Startup’s of 2014’ and won a DataWeek award for their ‘Presto as a Service’ offering. After receiving $7 million in funding from Lightspeed Ventures and Charles River Venture their growth has been explosive; they’ve already secured clients as varied as Pinterest, MediaMath, Nextdoor and Saavn, with more big tech industry names undoubtedly set to follow.

Pinterest data engineer Mohammed Shahangian recently enthused about the product in an interview, saying “Qubole has been a huge win for us. Qubole has proven to be stable at petabyte scale and has given us 30%-60% higher throughput than Amazon EMR. It has also made it extremely easy to onboard non-technical users”. It’s a glowing endorsement.

Their feature-rich big data platform was designed by the creators of Facebook’s and Apache Hive’s big data infrastructure, meaning their cloud-based big data service offers the same advanced capabilities as those used by large big data organisations. Its features include Hadoop as a Service, an intuitive GUI, optimised Hive, improved S3 performance and managed clusters – though arguably its biggest benefit to small businesses is the auto-scaling. The feature means that Qubole actively saves clients money by “spinning up users’ clusters when a job is started, automatically scaling or contracting them based on the workload, and spinning the servers back down once the job is done”.

For more information about Qubole’s pricing structure, head to their website to find out more.

By Daniel Price

If you are an exciting new startup in the Cloud, Big data, IoT, Wearable tech space and looking to be covered in the fantastic CloudTweaks community. Drop us a line and you may just be featured in our Pinup series

About Daniel Price

Daniel is a Manchester-born UK native who has abandoned cold and wet Northern Europe and currently lives on the Caribbean coast of Mexico. A former Financial Consultant, he now balances his time between writing articles for several industry-leading tech (CloudTweaks.com & MakeUseOf.com), sports, and travel sites and looking after his three dogs.

Find out more
View All Articles

2 Responses to Pinup: Qubole – The Growth Of Big Data as a Service

  1. This technology capability will become simple and performant enough that clients will deploy their own BDaaS on public clouds, private clouds, and hybrid clouds. The need for Qubole will shrink to the longtail until even the tail will deploy themselves. Look at past app platforms like Engine Yard and the like.

  2. Jim – would love to catchup. I almost wish I could agree with you. I wish I could go back to spending more time programming query engine internals and worrying less about our service. But what I have learnt in this business (and in prior life) is that running large distributed runtimes efficiently is hard. We are doing weekly pushes of new features and bug fixes without disrupting customers. That is hard. Building adaptive strategies into software that work reasonably well for the largest to the smallest customers *autmatically* – that’s hard. Being able to optimize workloads for customers *automatically* given their runtime history – that’s hard. Dealing with AWS and Azure and GCE and their quirks – and not losing sleep over it – that’s hard. Dealing with deployments all over the world at minimum operational cost – that’s hard.

    The comparison to running a web server farm (Engineyard) is naiive. Those are stateless. They aren’t coupled/distributed in the same manner. There’s just no comparison – and again in a way – I am glad if competitors think running Hadoop or other services is like running a web farm :-)

Comic
How Secure Is Your School Campus Network?

How Secure Is Your School Campus Network?

School Networks School related networks are one of the most attacked sectors today, coming in third worldwide to healthcare and retail. Because of the ever growing threat of cybercrime, IT professionals everywhere aren’t thinking in terms of “what if our network gets attacked?” Now, they think in terms of “when will our network be attacked?”…

IBM and VMware Expand Partnership to Enable Easy Hybrid Cloud Adoption

IBM and VMware Expand Partnership to Enable Easy Hybrid Cloud Adoption

IBM and VMware Expand Partnership More than 500 new clients, including Marriott International are now running VMware software on IBM Cloud since the strategic cloud partnership was announced;Introduction of VMware Cloud Foundation on IBM Cloud helps move existing apps to the cloud within hours; More than 4,000 IBM service professionals trained to help organizations extend…

Fully Autonomous Cars: How’s It REALLY Going To Work?

Fully Autonomous Cars: How’s It REALLY Going To Work?

Pros and Cons and What the Experts Think Science fiction meets reality, and modern civilization is excitedly looking forward to the ubiquity of self-driving cars. However, an omnipresence of fully autonomous cars won’t happen as quickly as even some hopeful experts anticipate. While the autonomous car pros versus the cons race (See infographic discovered via…

The Lighter Side Of The Cloud – Bottlenecking

The Lighter Side Of The Cloud – Bottlenecking

By David Fletcher Please feel free to share our comics via social media networks such as Twitter, Facebook, LinkedIn, Instagram, Pinterest. Clear attribution (Twitter example: via @cloudtweaks) to our original comic sources is greatly appreciated.

Recent Articles - Posted by
How To Humanize Your Data (And Why You Need To)

How To Humanize Your Data (And Why You Need To)

How To Humanize Your Data The modern enterprise is digital. It relies on accurate and timely data to support the information and process needs of its workforce and its customers. However, data suffers from a likability crisis. It’s as essential to us as oxygen, but because we don’t see it, we take it for granted.…

Data Breaches: Incident Response Planning – Part 1

Data Breaches: Incident Response Planning – Part 1

Incident Response Planning – Part 1 The topic of cybersecurity has become part of the boardroom agendas in the last couple of years, and not surprisingly — these days, it’s almost impossible to read news headlines without noticing yet another story about a data breach. As cybersecurity shifts from being a strictly IT issue to…

5 Things To Consider About Your Next Enterprise File Sharing Solution

5 Things To Consider About Your Next Enterprise File Sharing Solution

Enterprise File Sharing Solution Businesses have varying file sharing needs. Large, multi-regional businesses need to synchronize folders across a large number of sites, whereas small businesses may only need to support a handful of users in a single site. Construction or advertising firms require sharing and collaboration with very large (several Gigabytes) files. Financial services…

The Age of Data: The Era of Homo Digitus

The Age of Data: The Era of Homo Digitus

The Age of Data In our digital era data deluge – soaring amounts of data, is an overriding feature. That’s why it’s fitting to focus on the concept of Homo Digitus, which I first learned about about in“The creative destruction of medicine: How the digital revolution will create better health care,” by Eric Topol, and…

Cloud Infographic – Disaster Recovery

Cloud Infographic – Disaster Recovery

Disaster Recovery Business downtime can be detrimental without a proper disaster recovery plan in place. Only 6% of businesses that experience downtime without a plan will survive long term. Less than half of all businesses that experience a disaster are likely to reopen their doors. There are many causes of data loss and downtime —…

Disaster Recovery And The Cloud

Disaster Recovery And The Cloud

Disaster Recovery And The Cloud One of the least considered benefits of cloud computing in the average small or mid-sized business manager’s mind is the aspect of disaster recovery. Part of the reason for this is that so few small and mid-size businesses have ever contemplated the impact of a major disaster on their IT…

Protecting Devices From Data Breach: Identity of Things (IDoT)

Protecting Devices From Data Breach: Identity of Things (IDoT)

How to Identify and Authenticate in the Expanding IoT Ecosystem It is a necessity to protect IoT devices and their associated data. As the IoT ecosystem continues to expand, the need to create an identity to newly-connected things is becoming increasingly crucial. These ‘things’ can include anything from basic sensors and gateways to industrial controls…

Cloud Infographic – Cloud Public, Private & Hybrid Differences

Cloud Infographic – Cloud Public, Private & Hybrid Differences

Cloud Public, Private & Hybrid Differences Many people have heard of cloud computing. There is however a tremendous number of people who still cannot differentiate between Public, Private & Hybrid cloud offerings.  Here is an excellent infographic provided by the group at iWeb which goes into greater detail on this subject. Infographic source: iWeb

15 Cloud Data Performance Monitoring Companies

15 Cloud Data Performance Monitoring Companies

Cloud Data Performance Monitoring Companies (Updated: Originally Published Feb 9th, 2015) We have decided to put together a small list of some of our favorite cloud performance monitoring services. In this day and age it is extremely important to stay on top of critical issues as they arise. These services will accompany you in monitoring…

Cloud Infographic – The Data Scientist

Cloud Infographic – The Data Scientist

Data Scientist Report The amount of data in our world has been exploding in recent years. Managing big data has become an integral part of many businesses, generating billions of dollars of competitive innovations, productivity and job growth. Forecasting where the big data industry is going has become vital to corporate strategy. Enter the Data…