The Cat Within – Google Image Recognition

Google can see cats

The unofficial star of the Internet is the cat. As a running joke concerning the ease by which time can be wasted online, cat videos and the grumpy cat are recurring stars. Google has now reinjected legitimacy into this cultural meme by using visuals of cats as a flagship for its new image recognition technology which uses artificial-intelligence software to interpret what is going on in a photograph, and then produce a descriptive caption.

Although such an activity seems relatively easy for a human, the ability to separate out the shapes and colors in a picture and place them into context requires an enormous amount of processing power, not only to determine what the objects represent in a picture, but the circumstances surrounding why they are there at all.

The Cat Connection

The cat connection comes from one of the more recent Google projects, in which, in 2012, a joint Google/Stanford team showed millions of images from YouTube videos to a computer that then taught itself how to recognize cats in the images.

According to Google’s own blog, the model for image recognition comes from adapting language processing applications, such as those that would convert a French phrase into a vector representation, which could then be translated into German. Vector recognition is described in a separate Google blog as follows: “[The computer] understands that Paris and France are related the same way Berlin and Germany are (capital and country), and not the same way Madrid and Italy are. [From this] it can learn the concept of capital cities, just by reading lots of news articles — with no human supervision.” The adaptation of vector representation is then blended with image processing software to essentially determine what is a cat and also what is not a cat.

Google concedes that the process is as yet far from perfect. Many of the photographs are misinterpreted, from mildly to wildly as the computers struggle to connect what they see with what actually occurs in the human world.

However as the image of their November 17 2014 post shows, “Two pizzas sitting on top of a stove top oven” is a fair assessment of what is going on in this image.

The potential for image recognition of this sort is limitless. Some immediate uses range from describing images to the visually impaired to faster and more accurate placement on Google Maps by reading house numbers.

By Steve Prentice

Will Crump

The Key to a Successful M&A = Data

Successful M&A = Data Data is often the single point of failure for many organizations. Divestitures, privatization, leveraged buyouts, and management buyouts are all on the rise, but data too often remains an afterthought, rather ...
Mark Banfield

A Seamless Customer Experience Is Essential to Success in Today’s Digital Economy

Implement A Seamless Customer Experience The need for digital interaction has never seemed more critical than it does today. As the coronavirus continues to spread, citizens around the world are being asked to hunker down ...
David Friend

Cloud 2.0 will not be Ushered in by AWS or other Cloud Giants

Cloud 2.0 Trends Amazon, Google, and Microsoft are all pursuing similar business strategies: they want it all. ‘It,’ in this case, means the entire IT infrastructure in their cloud. Furthermore, they want you to buy ...
Sebastian Grady

Leveraging Hybrid IT Now to Power Digital Transformation 

Leveraging Hybrid IT Summary: Cloud is a dominant force in enterprise software today. Global market turbulence is forcing some companies to accelerate moving parts of IT to the cloud sooner than expected to adapt to ...
Fahim Kahn

The 5 Biggest Hybrid Cloud Management Challenges—And How to Overcome Them

Hybrid Cloud Management Challenges The benefits of the cloud—reduced costs, greater IT flexibility, and more—are well-established. But now many organizations are moving to hybrid cloud management platforms. While hybrid clouds do offer a greater level ...
Episode 4: The Power of Regulatory Compliant Cloud: A European Case Study

Episode 4: The Power of Regulatory Compliant Cloud: A European Case Study

An interview with Johan Christenson, CEO of CityNetwork With the world focusing on the big three hyperscalers, there is still room – and much necessity for – more local cloud providers who are better suited ...