Cloud Computing Helps Decode German E. Coli Strain

Cloud Computing Helps Decode German E. Coli Strain

When a nasty strain of E. coli flooded hospitals in Germany this summer, it struck its victims with life-threatening complications far more often than most strains — and the search for an explanation began.

Over a feverish weekend after the rogue bacterium’s genome was sequenced, scientists from all over the world submitted the E. coli genome to rounds of rigorous study. Thanks to a unique Argonne-developed computer program and cloud computing testbed, researchers mapped the strain’s genes — and came a little closer to understanding the bacterium’s secrets.

A team of Argonne scientists near Chicago, Illinois, developed the Rapid Annotation using Subsystems Technology (RAST) program in 2007. The program, which is free and open to any scientist, is designed to make sense of the jumble of letters that makes up an organism’s DNA.

A genome is a long, incomprehensible string of letters in a four-letter alphabet: G, A, T, C. Sections of the string are divided into genes. Each one describes how to build a protein, and proteins build all of the parts of the cell.

If we can figure out what DNA codes for which protein, and what that protein does, then we can look at any bug and have an idea of what it can do,” explained Ross Overbeek, an Argonne computer scientist who helped design RAST.

“For example, bugs with multi-drug resistance often turn out to have little pumps that drain the drug out of the cell as fast as it comes in,” Overbeek said. “Once you know what those pumps look like, you can think about how to get around them.”

RAST matches sections of the new string with its enormous catalogue of previously sequenced genes and proteins. At the end it spits out an annotated genome with a sort of ‘Cliffs Notes’ to the organism’s probable genes and proteins.

When scientists announced they had sequenced the genome to the E. coli strain that plagued Europe on June 3, researchers from around the world began sending versions of the genome to RAST for annotation. They wanted to compare the new strain with past strains to tease out its origins and vulnerabilities.

Genomes can vary even within a strain,” Overbeek said. “You can get slightly different genomes in the same outbreak, even from the same patient. You compare genomes to see how the organism is mutating even as it’s wreaking havoc.”

RAST servers were already overwhelmed by a flush of genomes and the new requests began to pile up — reaching more than 200 genomes an hour at one point. Its operators wanted to prioritize the E. coli work, so they turned to a resource designed for just such a possibility.

Magellan is a test cloud computing project, run by the U.S. Department of Energy, designed to boost research by making additional servers available on demand for scientific computing. The program, partially funded by the Recovery Act, has two sites — one at the Argonne Leadership Computing Facility and one at the National Energy Research Scientific Computing Center in California — but is designed to give researchers across the nation access to computing power in times of need.

The Argonne team duplicated the RAST server on Magellan, rapidly increasing the available computing power. “Our system is designed to use clusters, so we engineered it so that a piece of Magellan became part of the cluster that we use for RAST,” explained Bob Olson, an Argonne computer scientist who maintains RAST.

It worked — so well that even more submissions poured in. Argonne and Virginia Tech teams worked around the clock that weekend to keep the servers running smoothly.

They found exactly what they were looking for,” Overbeek said. “The difference between this new strain and older ones came down to just a few genes. Apparently, the new strain included a combination of virulence factors present in other studied strains.”

The operation was a perfect case for Magellan, Olson said, because each genome submission is an independent problem. Simply adding more processors to handle the extra jobs is easy — unlike many other computations, which often must solve successive problems; processors must wait to start their jobs until another processor finishes.

Overbeek remembered the early days of annotating genomes in the mid-1990s, when it took four or five scientists more than a year to analyze just one genome. “Now we can spit them out in a few hours,” he said, and the team has already tested the next generation of RAST — a version so fast that it cuts the time for annotating a typical E. coli genome from eight hours to just 15 minutes.

RAST is really revolutionary,” Overbeek said. “It’s turned a problem that used to be insurmountable into one that is trivial.”

There is even an iPhone app to submit and receive genomes from RAST servers.

Developed at Argonne, RAST is funded through the U.S. National Institutes of Health and run by the Pathosystems Resource Integration Center (PATRIC) at the Virginia Bioinformatics Institute. PATRIC keeps a publicly available database of sequenced genome information.

Contribution By Louis Lerner/http://www.isgtw.org/

version of this story first appeared on the ANL website.

About CloudTweaks

Established in 2009, CloudTweaks is recognized as one of the leading authorities in connected technology information and services.

We embrace and instill thought leadership insights, relevant and timely news related stories, unbiased benchmark reporting as well as offer green/cleantech learning and consultive services around the world.

Our vision is to create awareness and to help find innovative ways to connect our planet in a positive eco-friendly manner.

In the meantime, you may connect with CloudTweaks by following and sharing our resources.

View All Articles

Sorry, comments are closed for this post.

E-Commerce Advances For Savvy Marketers

E-Commerce Advances For Savvy Marketers

Digital Marketing Platforms Advertising and marketing techniques have progressed rapidly in the last decade with both channel focus and the direction of content shifting considerably due primarily to advances in cloud technology. Gartner’s Magic Quadrant for Digital Commerce 2016 singles out a few ecommerce providers who are topping their sector in both ability to execute…

The Lighter Side Of The Cloud – Energy Battle

The Lighter Side Of The Cloud – Energy Battle

By David Fletcher Please feel free to share our comics via social media networks such as Twitter, Facebook, LinkedIn, Instagram, Pinterest. Clear attribution (Twitter example: via@cloudtweaks) to our original comic sources is greatly appreciated.

Recent Articles - Posted by
Cloud Comings and Goings

Cloud Comings and Goings

Cloud Power Amazon Web Services – the giant of cloud computing – is on track to do $10 Billion in revenue this year. Yet, rumors swirl that Apple may take a huge chunk of business away from them and Dropbox has definitely left AWS. Is something wrong at AWS? Wait, Salesforce.com (SFDC) – the granddaddy of…

Digital Transformation: Not Just For Large Enterprises Anymore

Digital Transformation: Not Just For Large Enterprises Anymore

Digital Transformation Digital transformation is the acceleration of business activities, processes, and operational models to fully embrace the changes and opportunities of digital technologies. The concept is not new; we’ve been talking about it in one way or another for decades: paperless office, BYOD, user experience, consumerization of IT – all of these were stepping…

Connecting the Digital Dots with the Internet of Things

Connecting the Digital Dots with the Internet of Things

The IoT Explosion In The Cross-Industry Category Gartner estimates that 6.4 billion Internet of Things (IoT) devices will be in use in 2016, nearly a third more than last year, and with attentive patrons such as Samsung, recently committing to invest $1.2 billion into IoT research in the US over four years, we can happily…

Using Big Data To Make Cities Smarter

Using Big Data To Make Cities Smarter

Using Big Data To Make Cities Smarter The city of the future is impeccably documented. Sensors are used to measure air quality, traffic patterns, and crowd movement. Emerging neighborhoods are quickly recognized, public safety threats are found via social networks, and emergencies are dealt with quicklier. Crowdsourcing reduces commuting times, provides people with better transportation…

10 Trending US Cities For Tech Jobs And Startups

10 Trending US Cities For Tech Jobs And Startups

10 Trending US Cities For Tech Jobs And Startups Traditionally actors headed for Hollywood while techies made a beeline for Silicon Valley. But times are changing, and with technological job opportunities expanding (Infographic), new hotspots are emerging that offer fantastic opportunities for tech jobs and startup companies in the industry. ZipRecruiter, an online recruitment and job…

Are Women Discriminated Against In The Tech Sector?

Are Women Discriminated Against In The Tech Sector?

Women Discriminated Against In Tech Sector It is no secret that the tech industry is considered sexist since most women are paid less than men; there are considerably fewer women in tech jobs; and generally men get promoted above women. Yet the irony is twofold. Firstly, there is an enormous demand for employees with skills…

4 Industries Being Transformed By The Internet of Things

4 Industries Being Transformed By The Internet of Things

Compelling IoT Industries Every year, more and more media organizations race to predict the trends that will come to shape the online landscape over the next twelve months. Many of these are wild and outlandish and should be consumed with a pinch of salt, yet others stand out for their sober and well-researched judgements. Online…

Cloud Infographic – The Data Scientist

Cloud Infographic – The Data Scientist

Data Scientist Report The amount of data in our world has been exploding in recent years. Managing big data has become an integral part of many businesses, generating billions of dollars of competitive innovations, productivity and job growth. Forecasting where the big data industry is going has become vital to corporate strategy. Enter the Data…

Digital Marketing Driven by Cloud, Big Data and IoT

Digital Marketing Driven by Cloud, Big Data and IoT

Digital Marketing Successful digital marketing campaigns are being driven largely by trending technologies, specifically the Internet of Things (IoT), Big Data, and The Cloud. These may be used for a huge number of marketing applications, from optimizing the performance of sports teams to improving science and research, even helping to aid law enforcement. Amazon Web…

How Big Data Is Influencing Web Design

How Big Data Is Influencing Web Design

How Big Data Is Influencing Web Design For all you non-techies… You’re probably wondering what big data is (I know I was….a few years back) so let’s get the definitions out of the way so we’re on the same page, okay? Big data is A LOT of data – really, it is. It is a…