VeriSilicon’s Ultra-Low Energy NPU Delivers 40+ TOPS for On-Device LLM Inference in Mobile Applications

VeriSilicon has unveiled its ultra-low energy Neural Network Processing Unit (NPU) IP, designed for on-device inference of large language models (LLMs) with an impressive performance exceeding 40 TOPS. This energy-efficient architecture addresses the growing demand for generative AI capabilities on mobile platforms like AI phones and PCs while meeting stringent energy efficiency standards. The NPU supports mixed-precision computation, advanced sparsity optimization, and parallel processing, enhancing memory management and reducing latency for responsive AI applications. It is compatible with a range of AI algorithms and models, including Stable Diffusion and LLaMA-7B, and integrates seamlessly with other VeriSilicon processing IPs for comprehensive AI solutions. Furthermore, the NPU supports popular AI frameworks like TensorFlow Lite, ONNX, and PyTorch, facilitating ease of deployment. VeriSilicon’s commitment to ultra-low energy NPU development positions it as a pivotal player in the evolution of mobile devices into personal AI servers amid rapidly advancing AI technologies.

Source link

News

Company:

Join our community of SUBSCRIBERS and be part of the conversation.

Did Microsoft’s OpenAI Partnership Impact This Earnings Report? – MarketWatch

Palo Alto Networks Unveils Prisma AIRS 2.0: Elevating the Secure AI Agent Ecosystem

Introducing Agent Lightning: Microsoft’s Innovative AI Framework for RL-Based Training of LLMs Across AI Agents – MarkTechPost

YouTube Launches AI Tool to Elevate Low-Quality Videos to HD and 4K Quality

AI Search Tools Vulnerable to Deceptive Content – Dark Reading | Security

Effortless Website Uptime Monitoring Made Easy

YouTube Restructures AI Team and Implements Job Cuts

Elon Musk Unveils Grokipedia: AI-Enhanced Wikipedia Clone Goes Live

Creating a Free AI Visibility and GEO Tool: My Journey and Motivation

DeepMind’s AI Develops Original Chess Puzzles, Garnering Acclaim from Grandmasters

VeriSilicon’s Ultra-Low Energy NPU Delivers 40+ TOPS for On-Device LLM Inference in Mobile Applications

OpenAI’s Sora 2: A Beacon of Innovation or a Mirage of Deception?

Suno Pursues $100M Funding at $2B Valuation While Launching Its Most Advanced Free AI Music Tool Yet

Leading AI Startups to Watch in 2025

GitHub Launches Agent HQ: Introducing Coding Agents for Copilot Pro+

Harnessing Generative AI: Elevating Student Success Through Innovative Teaching Strategies

Local News

Effortless Website Uptime Monitoring Made Easy

Did Microsoft’s OpenAI Partnership Impact This Earnings Report? – MarketWatch

YouTube Restructures AI Team and Implements Job Cuts

Palo Alto Networks Unveils Prisma AIRS 2.0: Elevating the Secure AI Agent Ecosystem

Effortless Website Uptime Monitoring Made Easy

Did Microsoft’s OpenAI Partnership Impact This Earnings Report? – MarketWatch

YouTube Restructures AI Team and Implements Job Cuts