The Cat Within – Google Image Recognition

Google can see cats

The unofficial star of the Internet is the cat. As a running joke concerning the ease by which time can be wasted online, cat videos and the grumpy cat are recurring stars. Google has now reinjected legitimacy into this cultural meme by using visuals of cats as a flagship for its new image recognition technology which uses artificial-intelligence software to interpret what is going on in a photograph, and then produce a descriptive caption.

Although such an activity seems relatively easy for a human, the ability to separate out the shapes and colors in a picture and place them into context requires an enormous amount of processing power, not only to determine what the objects represent in a picture, but the circumstances surrounding why they are there at all.

The Cat Connection

The cat connection comes from one of the more recent Google projects, in which, in 2012, a joint Google/Stanford team showed millions of images from YouTube videos to a computer that then taught itself how to recognize cats in the images.

According to Google’s own blog, the model for image recognition comes from adapting language processing applications, such as those that would convert a French phrase into a vector representation, which could then be translated into German. Vector recognition is described in a separate Google blog as follows: “[The computer] understands that Paris and France are related the same way Berlin and Germany are (capital and country), and not the same way Madrid and Italy are. [From this] it can learn the concept of capital cities, just by reading lots of news articles — with no human supervision.” The adaptation of vector representation is then blended with image processing software to essentially determine what is a cat and also what is not a cat.

Google concedes that the process is as yet far from perfect. Many of the photographs are misinterpreted, from mildly to wildly as the computers struggle to connect what they see with what actually occurs in the human world.

However as the image of their November 17 2014 post shows, “Two pizzas sitting on top of a stove top oven” is a fair assessment of what is going on in this image.

The potential for image recognition of this sort is limitless. Some immediate uses range from describing images to the visually impaired to faster and more accurate placement on Google Maps by reading house numbers.

By Steve Prentice

Holiday Access.png
Viral Infection Wearabletech
Data Fallout.png
Disaster Plan.png
Dana Gardner
Just as cloud computing initially seeped into organizations under the cloak of shadow IT, application programming interface (API) adoption has often followed an organic, inexact, and unaudited path. IT leaders know they’re benefiting from APIs -- ...
Derrek Schutman
Implementing Digital Capabilities Successfully Building robust digital capabilities can deliver huge benefits to Digital Service Providers (DSPs). A recent TMForum survey shows that building digital capabilities (including digitization of customer experience and operations), is the ...
Dmitry Chekalin
How Much Should a Modern Website Cost? A website is a valuable instrument for growing your business. Your website presents your brand to users. Also, it compels your prospects to become your customers. So, how ...
Manoj Kalyanaraman
Counting on the Cloud in 2022 As we close out the year and approach 2022, the new year offers an opportunity to address burgeoning activity on the horizon. The last 18 months have brought significant ...
Kamal Maggon
Mining Business Value Traditional industries like mining have been slow to adapt to changing IP technology.  Of course, coal and other mining types have adopted new technologies starting with mechanical drills powered by pistons, then ...