Naigang Wang

Title

RSM, Manager, AI acceleration algorithm and framework

Publications

Advancing Fluorescence Light Detection and Ranging in Scattering Media with a Physics-Guided Mixture-of-Experts and Evidential Critics
- - Ismail Erbas
  - Ferhat Demikiran
  - et al.
- 2025
- NeurIPS 2025
Compressing Recurrent Neural Networks for FPGA-accelerated Implementation in Fluorescence Lifetime Imaging
- - Ismail Erbas
  - Vikas Pandey
  - et al.
- 2024
- NeurIPS 2024
A Provably Effective Method for Pruning Experts in Fine-tuned Sparse Mixture-of-Experts
- - Mohammed Nowaz Rabbani Chowdhury
  - Meng Wang
  - et al.
- 2024
- ICML 2024
Improved Techniques for Quantizing Deep Networks with Adaptive Bit-Widths
- - Ximeng Sun
  - Rameswar Panda
  - et al.
- 2024
- WACV 2024
Deep Compression of Pre-trained Transformer Models
- - Naigang Wang
  - Chi-Chun Liu
  - et al.
- 2022
- NeurIPS 2022
A 7-nm Four-Core Mixed-Precision AI Chip with 26.2-TFLOPS Hybrid-FP8 Training, 104.9-TOPS INT4 Inference, and Workload-Aware Throttling
- - Sae Kyu Lee
  - Ankur Agrawal
  - et al.
- 2021
- IEEE JSSC
4-bit quantization of LSTM-based speech recognition models
- - Andrea Fasoli
  - Chia-Yu Chen
  - et al.
- 2021
- INTERSPEECH 2021
Hardware-Aware Neural Architecture Search: Survey and Taxonomy
- - Hadjer Benmeziane
  - Kaoutar El Maghraoui
  - et al.
- 2021
- IJCAI 2021
RaPiD: AI Accelerator for Ultra-Low Precision Training and Inference
- - Swagath Venkataramani
  - Vijayalakshmi Srinivasan
  - et al.
- 2021
- ISCA 2021
A 7nm 4-Core AI Chip with 25.6TFLOPS Hybrid FP8 Training, 102.4TOPS INT4 Inference and Workload-Aware Throttling
- - Ankur Agrawal
  - Saekyu Lee
  - et al.
- 2021
- ISSCC 2021

Blog posts

Ultra-low-precision training of deep neural networks
Technical note
Naigang Wang
09 May 2019
- AI
8-bit precision for training deep learning systems
Research
Naigang Wang
03 Dec 2018
- AI
- AI Hardware

Top collaborators

Swagath Venkataramani

Principal Research Scientist, AIU Architecture and Compilers

Kaoutar El Maghraoui

Principal Research Scientist and Manager, AIU Spyre Model Enablement, AI Hardware Center

Matthew Ziegler

Principal Research Scientist

Karthik Swaminathan

Senior Research Scientist, Efficient and Resilient Systems

Naigang Wang

Title

Publications

Advancing Fluorescence Light Detection and Ranging in Scattering Media with a Physics-Guided Mixture-of-Experts and Evidential Critics

Compressing Recurrent Neural Networks for FPGA-accelerated Implementation in Fluorescence Lifetime Imaging

A Provably Effective Method for Pruning Experts in Fine-tuned Sparse Mixture-of-Experts

Improved Techniques for Quantizing Deep Networks with Adaptive Bit-Widths

Deep Compression of Pre-trained Transformer Models

A 7-nm Four-Core Mixed-Precision AI Chip with 26.2-TFLOPS Hybrid-FP8 Training, 104.9-TOPS INT4 Inference, and Workload-Aware Throttling

4-bit quantization of LSTM-based speech recognition models

Hardware-Aware Neural Architecture Search: Survey and Taxonomy

RaPiD: AI Accelerator for Ultra-Low Precision Training and Inference

A 7nm 4-Core AI Chip with 25.6TFLOPS Hybrid FP8 Training, 102.4TOPS INT4 Inference and Workload-Aware Throttling

Patents

Mixed Precision Capable Hardware For Tuning A Machine Learning Model

Very Low Precision Floating Point Representation For Deep Learning Acceleration

Magnetic Inductor Stacks With Multilayer Isolation Layers

Laminated Magnetic Inductor Stack With High Frequency Peak Quality Factor

Stress Management For Thick Magnetic Film Inductors

Magnetic Inductor With Multiple Magnetic Layer Thicknesses

Providing Supply Voltage To A Dynamic Internal Power Supply Node

Planar Solenoid Inductors With Antiferromagnetic Pinned Cores

Resonant Clock Circuit With Magnetic Shield