Question 1

Is this the vendor-rated NPU peak TOPS?

Accepted Answer

No. This is effective browser-visible TOPS for the WebNN graph, including browser, driver, scheduling, and small synchronization overhead.

Question 2

How does it make sure the NPU is used?

Accepted Answer

NPU-only mode requests deviceType=npu and no longer falls back to GPU or CPU. If the browser cannot create an NPU context, the tool reports an error instead of another backend result.

Question 3

Why use matrix multiplication?

Accepted Answer

Dense matmul layers are common neural-network operators and map well to AI accelerators. The tool computes per-inference operations as 2 × batch × hidden × hidden × layers, then divides by measured execution time to report TOPS.

WebNN NPU Compute Measurement

Category

How to Use

Examples

FAQ

Related tools