Pruning

Make neural network smaller by removing synapses and neurons

주로 불필요한 가중치를 제거(0으로 만듦)해서 네트워크를 가지치기 하는 것

$$ \arg \min_{W_p} L(x; W_p) \\ \text{subject to } ||W_p||_0 < N $$

Pruning Granularity

Fine-grained / Unstructured

Coarse-grained/Structured/Pattern-based