We propose a new technique, Singular Vector Canonical Correlation Analysis (SVCCA), a tool for quickly comparing two representations in a way that is both to affine transform (allowing comparison between different layers and) and fast to compute (allowing more comparisons to be calculated than previous methods). We deploy this tool to measure the intrinsic of layers, showing in some cases needless over-parameterization; probe learning dynamics throughout training, finding that networks converge final representations from the bottom up; to show where class-specific in networks is formed; and to suggest new training regimes that save computation and overfit less. Code: : //github. com/google/svcca/
Raghu et al. (Mon,) studied this question.