Skip to main content
  1. Home
  2. Phones
  3. Mobile
  4. News

HuggingSnap app serves Apple’s best AI tool, with a convenient twist

Add as a preferred source on Google
HuggingSnap recognizing contents on a table.
Nadeem Sarwar / DigitalTrends

Machine learning platform, Hugging Face, has released an iOS app that will make sense of the world around you as seen by your iPhone’s camera. Just point it at a scene, or click a picture, and it will deploy an AI to describe it, identify objects, perform translation, or pull text-based details.

Named HuggingSnap, the app takes a multi-model approach to understanding the scene around you as an input, and it’s now available for free on the App Store. It is powered by SmolVLM2, an open AI model that can handle text, image, and video as input formats.

Recommended Videos

The overarching goal of the app is to let people learn about the objects and scenery around them, including plant and animal recognition. The idea is not too different from Visual Intelligence on iPhones, but HuggingSnap has a crucial leg-up over its Apple rival.

It doesn’t require internet to work

SmolVLM2 running in an iPhone

All it needs is an iPhone running iOS 18 and you’re good to go. The UI of HuggingSnap is not too different from what you get with Visual Intelligence. But there’s a fundamental difference here.

Apple relies on ChatGPT for Visual Intelligence to work. That’s because Siri is currently not capable of acting like a generative AI tool, such as ChatGPT or Google’s Gemini, both of which have their own knowledge bank. Instead, it offloads all such user requests and queries to ChatGPT.

That requires an internet connection since ChatGPT can’t work in offline mode. HuggingSnap, on the other hand, works just fine. Moreover, an offline approach means no user data ever leaves your phone, which is always a welcome change from a privacy perspective. 

What can you do with HuggingSnap?

HuggingSnap identifying perfume bottle.
Nadeem Sarwar / DigitalTrends

HuggingSnap is powered by the SmolVLM2 model developed by Hugging Face. So, what can this model running the show behind this app accomplish? Well, a lot. Aside from answering questions based on what it sees through an iPhone’s camera, it can also process images picked from your phone’s gallery.

For example, show it a picture of any historical monument, and ask it to give you travel suggestions. It can understand the stuff appearing on a graph, or make sense of an electricity bill’s picture and answer queries based on the details it has picked up from the document.

It has a lightweight architecture and is particularly well-suited for on-device applications of AI. On benchmarks, it performs better than Google’s competing open PaliGemma (3B) model and rubs shoulders with Alibaba’s rival Qwen AI model with vision capabilities.

Running HuggingSnap app on iPhone.
Nadeem Sarwar / DigitalTrends

The biggest advantage is that it requires less system resources to run, which is particularly important in the context of smartphones. Interestingly, the popular VLC media player is also using the same SmolVLM2 model to provide video descriptions, letting users search through a video using natural language prompts.

It can also intelligently extract the most important highlight moments from a video. “Designed for efficiency, SmolVLM can answer questions about images, describe visual content, create stories grounded on multiple images, or function as a pure language model without visual inputs,” says the app’s GitHub repository.

Nadeem Sarwar
Nadeem is the Managing Editor at Digital Trends.
Apple could get a taste of sub-nanometer chips in 2029
TSMC is reportedly looking toward the sub-1nm era, with a new report pointing to a 2029 trial production target.
TSMC 12-inch silicon wafer.

Apple is often the first to the starting line when it comes to shrinking silicon, and its partnership with TSMC is a key reason behind that lead. While we are currently settling into the 2nm era, the roadmap for what comes next is already coming into focus. A new report reveals TSMC is eyeing the sub-1nm milestone with a target for trial production as early as 2029.

TSMC's silicon roadmap leading to sub-1nm chips

Read more
Casely is recalling nearly half a million power banks over a fire hazard. Here’s how to check if you’re affected
Casely Power Pod recall reissued after a fatality and an in-flight fire
casely-power-bank-recalled

If you own a power bank, you need to check if it’s a faulty model. Casely has issued a recall for about 429,200 units of the Casely Power Pods through the US Consumer Product Safety Commission. The lithium-ion battery inside can overheat and ignite, posing a serious fire and burn risk.

Why has the Casely power bank been recalled?

Read more
Chinese repair shops have apparently figured out how to fix ugly dents on iPhones
In China, dents, scratches, and all the damage in between can be wiped from your iPhone 17 Pro and Pro Max, and the results are seriously impressive.
Rear shell of iPhone 17 Pro.

With the iPhone 17 Pro and Pro Max, Apple switched back to an aluminum frame owing to the material’s better thermal conductivity and lightweight nature. While this practical change makes sense, the aluminum is softer than titanium, which also means it shows damage more readily. 

Drop it from a decent height onto a hard surface, and it will show. But here's the thing. Aluminum is also much more forgiving to repair. Unlike titanium, it responds well to skilled hands, and Chinese repair shops are making the most of this fact, doing some crazy repairs.

Read more