Skip to main content
  1. Home
  2. Computing
  3. Social Media
  4. News

Facebook opens up its image-recognition AI software to everyone

Add as a preferred source on Google

The AI research division at Facebook is open sourcing its image recognition software with the aim of advancing the tech so it can one day be applied to live video. Facebook’s DeepMask, SharpMask, and MultiPathNet software is now available to everyone on GitHub.

Facebook previously laid out its image-recognition systems in a number of research papers, which are also being made available to the public along with its demos. At present, the company’s algorithms work in conjunction with its MultiPathNet convolutional neural networks — an AI that is fed huge amounts of data until it can autonomously recognize other data — allowing Facebook to understand an image based on each pixel it contains.

Recommended Videos

In order to classify and label the objects in an image, Facebook couples its DeepMask segmentation framework with its SharpMask segment refinement module. The final stage in Facebook’s machine vision system utilizes its MultiPathNet deep learning AI to label each object in the photo.

According to Facebook, AI machine vision software has progressed in leaps and bounds over the past few years, allowing the type of image classification that didn’t even exist a short while ago. Facebook claims that open sourcing the software is critical to its advancement.

Example images scanned by Facebook's complete image-recognition system
Example images scanned by Facebook’s complete image-recognition system Image used with permission by copyright holder

Deep learning techniques are springing up all over the big blue behemoth. The AI powers Facebook’s (controversial) facial-recognition feature, manages curation on its News Feed, and is even utilized within its digital assistant for Messenger.

This isn’t the first time Facebook has open sourced its AI. In fact, the company is somewhat of a trailblazer when it comes to sharing its tech. In December, Facebook submitted its state-of-the-art computer server dedicated to AI to the Open Compute Project — a group consisting of tech giants, such as Apple and Microsoft, that share the designs of their respective computer infrastructures.

Facebook is already predicting the future use cases for the image-recognition tech. The company reveals that it could potentially help it to build upon its existing AI generated image descriptions for the visually impaired.

“Currently, visually impaired users browsing photos on Facebook only hear the name of the person who shared the photo, followed by the term “photo,” when they come upon an image in their News Feed,” writes Piotr Dollar, research scientist at Facebook AI Research (FAIR), in a blog post. “Instead we aim to offer richer descriptions, such as ‘Photo contains beach, trees, and three smiling people.’”

Additionally, Facebook claims that its next challenge is to apply its image-recognition techniques to video, “where objects are moving, interacting, and changing over time,” and even Facebook Live broadcasts. “Real-time classification could help surface relevant and important Live videos on Facebook, while applying more refined techniques to detect scenes, objects, and actions over space and time could one day allow for real-time narration,” Dollar adds.

Saqib Shah
Saqib Shah is a Twitter addict and film fan with an obsessive interest in pop culture trends. In his spare time he can be…
Sony shows off AI-touched Xperia 1 VIII camera samples. It’s an epic self-own that I can’t digest
Sony built the Xperia 1 series for people who know what a histogram looks like. Xperia Intelligence appears to have been built for everyone else, and the sample images make that tension impossible to ignore.
Sony aggressive AI photography featured.

Sony has a camera legacy that most brands, regardless of whether they make cameras or smartphones, dream of. The company rewrote what full-frame sensors could do with its Alpha series. 

That particular rendering of skin tones, that restraint with saturation, the commitment to accurate white balance; the company’s color science is precisely why cinematographers, videographers, and photographers like me, in the consumer tech space, swear by its color science and camera hardware. 

Read more
Razer’s new Blade 18 gets Arrow Lake refresh and a modest $3,999.99 starting price
For $3,999.99, you get the base model with Nvidia RTX 5070 Ti. A 5090 variant is available, too.
Razer Blade 18.

Razer has officially unveiled the 2026 Blade 18 today, and at the heart of all three configurations is an Intel Arrow Lake processor. 

I’m talking about the Core Ultra 9 290HX Plus, which features 24 cores, up to 5.5GHz clock speed (with boost), 36MB cache, and an onboard NPU that delivers up to 13 TOPS of compute power. 

Read more
Windows 11 will clean up its own driver mess so you don’t have to
Say goodbye to the nightmare of hunting down broken drivers after a bad Windows update.
Surface laptop on wooden table

It seems that Microsoft is keeping up its promise of making Windows 11 better. After introducing a new low-latency mode that speeds up app launches and an update that fixes the RAM memory leak issue, the tech giant is testing a new feature that addresses one of its most prominent problems. 

The new feature is called Cloud-Initiated Driver Recovery, and it can automatically roll back a broken driver that was pushed to your PC through Windows Update. 

Read more