Shifting Up to a New Level of Edge AI

By Roni Sadeh

VP Technologies

CEVA, Inc.

November 22, 2021

Story

The evolutionary phase of technologies is a familiar place, and AI, the prime technology of recent years, is no different. In AI -- new generations add more MACs, multiple layers of quantization, this feature, that feature -- all to chase improved TOPS/Watt.

We have been successfully introducing AI-at-the-edge solutions to various markets through our CEVA NeuPro platform within this environment. Now, users want more, but sometimes evolution alone is not enough.

At first, users placed an emphasis on ease of use to help them introduce this new AI-at-the-edge technology. But as their expertise in advanced AI techniques deepens, they want access to all possible ways to build differentiation into their products, and exceed current state-of-the-art approaches by an order of magnitude. Getting there quickly isn’t possible through evolution; revolutionary improvements are necessary. What they want has switched from ease of use to maximal algorithmic flexibility at maximum throughput and minimal power.

Measuring Up

TOPS/W is a nice marketing number but it’s too crude to be useful in real applications. In visual inference for example, frames per second per watt (FPS/W) is a much more meaningful metric. The value of a good score in this context is easy to understand. Detecting a pedestrian or car ahead or a car passing from behind requires quick response. There’s little time to brake or steer away and neither action is instantaneous. An inference engine must be able to manage a minimum of 100 FPS – with the lowest possible power because this is only one of many sensor/AI systems around the car. That demands much higher fps/W for competitive power.

The market opportunity is unquestionable. Automotive and telecom applications are expected to be the biggest contributors to this growth, and in automotive, intelligent imaging continues to be strong. Incidentally, so does the “many cameras” trend in mobile phones. In fact, the imaging pipeline in such cameras has started to replace conventional algorithms with neural nets for de-noising, image stabilization, super-resolution and other novel functions, all running at 60fps in a very constrained energy envelope.

What a Major Advance Requires

There are some interesting things happening around analog AI and spiking neural nets, but product makers don’t want to jump too far away from what they are sure can scale to volume today. That constraint still leaves a lot of algorithm potential, but now product builders want access to all those algorithms with much more flexibility to squeeze out maximum performance at minimum power.

The list of optimization possibilities is long: a wide range of quantization options, winograd support, sparsity optimization to skip multiplications by zero, data type diversity in activation and weights across a range of bit-sizes, vector processing capability in parallel with neural multiplies, data compression to reduce loading time for weights and activations, matrix decomposition support, delivering up to a 50:1 acceleration over a reference network, and next generation NN architectures, like transformers and 3D convolution support.

A Call to Action

Product builders, now with more experience in AI, know what they want to build and how to build it. What they need is a platform offering all the neural net component algorithms they already understand, to construct that optimal solution for their product.

This is a dream list of algorithms and optimizations to deliver the true breakthrough capability, throughput, and low power that advanced edge AI needs. But why only a dream? Advanced product builders are no longer satisfied with incremental improvements in AI. They now expect platforms aligned with their greatly improved understanding of possibilities.

To learn more about CEVA’s work in edge AI, click HERE.

Roni Sadeh has more than 20 years of experience in processor and accelerator design, focusing in recent years on AI related SW and HW solutions for audio / speech / computer vision applications, and designing the next generation of AI accelerators scalable to hundreds of TOPs.

Roni holds a B.Sc in Aeronautical Engineering from Technion University.

Embedded Computing Design

By Roni Sadeh

The evolutionary phase of technologies is a familiar place, and AI, the prime technology of recent years, is no different. In AI -- new generations add more MACs, multiple layers of quantization, this feature, that feature -- all to chase improved TOPS/Watt.

Measuring Up

What a Major Advance Requires

A Call to Action

Categories

IoT - Edge Computing

AI & Machine Learning - AI Logic Devices & Workload Acceleration

Trending Articles

Embedded Computing Design Announces SCI Semiconductor Wins embedded world Best in Show

BeagleBoard.org Launches BeagleBadge at embedded world Germany

Innatera Highlights Brain-Inspired Computing for Sensing and Industrial Monitoring at embedded world

SECO Features NXP i.MX 95-Based Edge AI Solutions at embedded world Germany 2026

The Road to embedded world: Infineon Accelerates Edge AI and Power Design with PSOC MCUs

Analog & Power

Selecting Current Sense Transformers for Switched-mode Power Supplies

AI & Machine Learning

AI Playtime is Over: It is Time to Get Serious

Industrial

ASRock Industrial’s iEP-5010G-DCN Wins Embedded Computing Design Best in Show Award at Embedded World 2026

IoT

The Road to embedded world: Ambarella Debuts Edge AI Innovations and Developer Tools