Identify a binary weight or binary weight and activation subnetwork within a randomly initialized network by only pruning and binarizing the network.
Why do you think that https://github.com/microsoft/Swin-Transformer is a good alternative to biprop