I'm curious about why you use chunk during the calculation of min and max, why not just calculate? And can't we quantize weights and bias of BN?
I'm curious about why you use chunk during the calculation of min and max, why not just calculate? And can't we quantize weights and bias of BN?