IBM Plans to Reduce AI Training Time & Power Consumption with a New Processor

Endroid March 28, 2016

Researchers from the IBM T.J. Watson Research Center plans to develop a processor that will reduce the training time and power consumption of an AI.

In the paper entitled “Acceleration of Deep Neural Network Training with Resistive Cross-Point Devices,” IBM researchers Tayfun Gokmen and Yurii Vlasov wrote that training Deep Neural Networks (DNNs) proved to be time-consuming – requiring days of training – and a data center requiring thousands of machines.

DNNs can be trained to function similar to humans like playing board games such as Go. In the past two years, DNNs have demonstrated significant commercial success in speech recognition, visual object detection, pattern extraction and other capacities. Google’s AI AlphaGo, for instance, was able to defeat top human Go players.

“Training of large DNNs, however, is universally considered as time-consuming and computationally intensive task that demands datacenter-scale computational resources recruited for many days,” Gokmen and Vlasov said.

To counter this time and power consumption problems, the IBM researchers proposed a concept called resistive processing unit (RPU), a processor that can “potentially accelerate DNN training by orders of magnitude while using much less power.”

The IBM researchers added that a single RPU chip can speed up training of an AI by 30,000 times while providing power efficiency of 84,000 GigaOps/s/W.

According to Gokmen and Vlasov, with a single RPU accelerator, problems that currently require days of AI training using datacenter-size cluster with thousands of machines can be solved within hours.

“A system consisted of a cluster of RPU accelerators will be able to tackle Big Data problems with trillions of parameters that are impossible to address today…,” the IBM researchers added.

Examples of problems referred to by Gokmen and Vlasov include natural speech recognition, translating a particular language to all world languages, real-time analytics of large streams of scientific and business data, integration and analysis of data coming from a vast number of IoT (Internet of Things) sensors.

Related News

Fundamental Research Labs Raises $33M to Build Cross-Vertical AI Agents

From Meta’s Massive Offers to Anthropic’s Sky-High Valuation — Is There a Ceiling for AI?

Meta Courts AI Talent While Anthropic Eyes $170 Billion Valuation: Is the Bubble Nearing?

You may have missed

Authorities Seize BlackSuit Ransomware Gang’s Servers in International Operation

Instagram Now Requires 1,000 Followers to Use Live Feature

Truecaller’s Call-Recording Feature Won’t Work on iPhones from September 30

Fundamental Research Labs Raises $33M to Build Cross-Vertical AI Agents

Endroid got you covered 👋Get news about crypto, AI and tech straight to your mail box

Sign up to receive awesome content in your inbox, every week.

Related News

Fundamental Research Labs Raises $33M to Build Cross-Vertical AI Agents

From Meta’s Massive Offers to Anthropic’s Sky-High Valuation — Is There a Ceiling for AI?

Meta Courts AI Talent While Anthropic Eyes $170 Billion Valuation: Is the Bubble Nearing?

You may have missed

Authorities Seize BlackSuit Ransomware Gang’s Servers in International Operation

Instagram Now Requires 1,000 Followers to Use Live Feature

Truecaller’s Call-Recording Feature Won’t Work on iPhones from September 30

Fundamental Research Labs Raises $33M to Build Cross-Vertical AI Agents

Endroid got you covered 👋
Get news about crypto, AI and tech straight to your mail box