WebNN NPU Compute Measurement
Estimate effective NPU compute available to the browser by running a WebNN NPU-only matrix multiplication workload
Category
How to Use
- Choose the NPU-only backend, compute workload size, and measurement duration
- Click "Measure NPU" to build the WebNN graph and run warmup batches
- Review measured NPU TOPS, peak TOPS, sustained TOPS, average latency, and iteration count
- NPU-only mode does not fall back to GPU or CPU; if NPU context creation fails, no NPU compute result is reported
Examples
-
NPU-only compute measurement
Input:
Backend: NPU Only | Workload: Compute | Duration: 20sOutput:
Measured NPU Compute 8.42 TOPS | Peak 9.10 TOPS | Sustained 8.01 TOPS -
CPU/GPU baseline
Input:
Backend: CPU or GPU Baseline | Workload: Balanced | Duration: 10sOutput:
Compare against NPU-only mode to see browser scheduling differences
FAQ
- Is this the vendor-rated NPU peak TOPS?
- No. This is effective browser-visible TOPS for the WebNN graph, including browser, driver, scheduling, and small synchronization overhead.
- How does it make sure the NPU is used?
- NPU-only mode requests deviceType=npu and no longer falls back to GPU or CPU. If the browser cannot create an NPU context, the tool reports an error instead of another backend result.
- Why use matrix multiplication?
- Dense matmul layers are common neural-network operators and map well to AI accelerators. The tool computes per-inference operations as 2 × batch × hidden × hidden × layers, then divides by measured execution time to report TOPS.
Related tools
- UUID Generator
Batch generate UUID v4 with one-click copy
- Timestamp Converter
Convert between timestamps and date/time formats
- Regex Tester
Test and debug regular expressions in real time
- Hash Generator
Compute MD5, SHA1, SHA256 hash values
- GPU Stress Test
FurMark-like browser stress test powered by Three.js WebGPU with real-time stability metrics
- GPU Benchmark
Run a fixed WebGPU scene for 60 seconds and sum per-second FPS as the final score
- Local AI Model Runner
Upload an ONNX model or enter a model download URL, then run browser-local inference with WebNN NPU/GPU first
- QR Code Generator
Generate custom QR codes with advanced features like colors, logo, and batch generation