Skip to main content
  1. Home
  2. Computing
  3. News

IBM is cutting deep-learning processing times from days down to hours

Add as a preferred source on Google

Deep learning uses algorithms inspired by the way human brains operate to put computers to work on tasks too big for organic gray matter. On Monday, IBM announced that a new record for the performance of a large neural network working with a large data set.

The company’s new deep-learning software brings together more than 256 graphics processing units across 64 IBM Power systems. The speed improvements brought about by the research come as a result of better communication between the array of GPUs.

Recommended Videos

Faster GPUs provide the necessary muscle to take on the kind of large scale problems today’s deep-learning systems are capable of tackling. However, the faster the components are, the more difficult it is to ensure that they are all working together as one cohesive unit.

As individual GPUs work on a particular problem, they share their learning with the other processors that make up the system. Conventional software is not capable of keeping up with the speed of current GPU technology, which means that time is wasted as they wait around for one another’s results.

Hillery Hunter, IBM’s director of systems acceleration and memory, compared the situation to the well-known parable of the blind men and the elephant. The company’s distributed deep-learning project has resulted in an API that developers can be used in conjunction with deep-learning frameworks to scale to multiple servers, making sure that their GPUs remain synchronized.

IBM recorded image recognition accuracy of 33.8 percent on a test run using 7.5 million images from the ImageNet-22K database. The previous best-published result was 29.8 percent, which was posted by Microsoft in October 2014 — in the past, accuracy has typically edged forward at a rate of about one percent in new implementations, so an improvement of four percent is considered to be a very good result.

Crucially, IBM’s system managed to achieve this in seven hours; the process that allowed Microsoft to set the previous record took 10 days to complete.

“Speed and scalability, which means higher accuracy, means that we can quickly retrain an AI model after there is a new cyber-security hack or a new fraud situation,” Hunter told Digital Trends. “Waiting for days or weeks to retrain the model is not practical — so being able to train accurately and within hours makes a big difference.”

These massive improvements in terms of speed, combined with advances in terms of accuracy make IBM’s distributed deep-learning software a major boon for anyone working with this technology. A technical preview of the API is available now as part of the company’s PowerAI enterprise deep-learning software.

Brad Jones
Brad is an English-born writer currently splitting his time between Edinburgh and Pennsylvania. You can find him on Twitter…
Microsoft Edge is about to get more frequent updates, but don’t expect more features
Starting with Edge 152 on August 27, Microsoft is cutting its release cycle in half, with smaller but more frequent updates for Stable channel users.
Microsoft Edge illustration official

Microsoft is accelerating updates to its Edge browser, switching from a monthly release schedule to a biweekly one. The change takes effect with Edge 152, due on August 27, and puts the browser on the same cadence as Google Chrome.

More updates, not more features

Read more
What makes a laptop good for both work and entertainment?
Computer, Electronics, Laptop

This post is brought to you in paid partnership with HP.

The HP OmniBook X Flip is designed as an all‑day AI PC that adapts seamlessly from productivity to entertainment without switching devices.

Read more
Your Windows 11 PC can now natively run AI workloads, even if it lacks the Copilot+ badge
Windows 11 laptop on a table

For the better part of a year, Microsoft has been telling us that the future of AI on Windows belongs to Copilot+ PCs. If you wanted Microsoft’s most advanced local AI features, you needed a machine with a dedicated Neural Processing Unit (NPU). That was the deal. Now, Microsoft appears to be rewriting the rules.

According to updated documentation, Windows 11’s local Language Model APIs can now run on non-Copilot+ PCs, provided they have an Nvidia GeForce RTX 30-series GPU (or newer) with at least 6GB of VRAM. On the surface, this sounds like a developer-focused update. In reality, it could be one of the most significant shifts in Microsoft’s AI PC strategy since Copilot+ PCs launched last year. More importantly, it raises a question that has been lingering ever since the AI PC era began: Did we really need NPUs for all of this in the first place?

Read more