name: inverse layout: true class: --- class: middle, inverse .right[ [github](https://github.com/facundoq/rotational_variance) ] *** .frontimage[ data:image/s3,"s3://crabby-images/0c793/0c793c87cb5614800040465157fcf1c89ef012f8" alt="" ] .left[ #### Facundo Quiroga¹, Jordina Torrents-Barrena², #### Laura Lanzarini¹, Dómenec Puig-Valls² # Measuring transformation (in)variances # in Neural Networks *** .foot[ [.logo[data:image/s3,"s3://crabby-images/9be4d/9be4d7c4d26ac3008d4d5a2bdc307e55687f1a5e" alt="III-LIDI, Universidad Nacional de La Plata"]](http://weblidi.info.unlp.edu.ar/) ¹ **III-LIDI** Instituto de Investigación y Desarrrollo en Informática, **Universidad Nacional de La Plata**, *Argentina* [.logo[data:image/s3,"s3://crabby-images/54614/54614238c8898ed9591903783a11199ec481f2be" alt="IRCV, Universitat Rovira i Virgili"]](http://deim.urv.cat/~rivi) ² **IRCV** Intelligent Robotics and Computer Vision Group, **Universitat Rovira i Virgili**, *Spain* ] ] --- class: inverse,center # Brain vs Rotation (180°) data:image/s3,"s3://crabby-images/68c19/68c19ff0c63d6b71993722efc31fd5895aba5e24" alt="" --- class: inverse,center # Brain vs Rotation (90°) data:image/s3,"s3://crabby-images/eb35c/eb35c7c3db043f855feea24d31a7977dc2c68f92" alt="" --- class: inverse,center # Brain vs Rotation (0°) data:image/s3,"s3://crabby-images/217ae/217ae9775fb22fe7f6aa7a5a3e45cf0874990879" alt="" Mercedes Sosa --- class: inverse,center # How do we encode
=
=
.left[ * In our brains? * In a Neural Network? ] --- class: inverse,center # Neural Networks .mainimage90[data:image/s3,"s3://crabby-images/63e17/63e17f20edc603915f677bac82569d08212bf21e" alt="activations"] .left[ * Properties * Structured by layers/activations * Complex, distributed representations ] --- class: inverse,center # Neural Networks vs Rotations .mainimage80[ data:image/s3,"s3://crabby-images/f73b9/f73b94068795b9616677ca6b27903793fc4c3d54" alt="" ] --- # *Invariant* Neural Networks * A network $f$ is invariant to a set of transformations $T= [t_1,t_2,\dots,t_n]$ if $ \forall x$ $$f(t_1(x))=f(t_2(x))= \dots =f(t_n(x))$$ * If $T= [R0,R90,R180]$ .mainimage50[ data:image/s3,"s3://crabby-images/84ecc/84eccbe892ada3aeefffe3ecfd45604a0a1df929" alt="" ] --- # *Invariant* Neural Networks - Current approaches 1. Correct rotation with another model .imagemodels[data:image/s3,"s3://crabby-images/da842/da84256e10d657688b48970301815c3fc8c67b95" alt="rotation model"] 2. Invariant by *design* (30+ models proposed in the last few years) .imagemodels[data:image/s3,"s3://crabby-images/7f728/7f728626a058616def30a679838f1b15120e6f0a" alt="invariant designed model"] 3. Invariant by *training* - with data augmentation .imagemodels[data:image/s3,"s3://crabby-images/ee5db/ee5db5f0bf41c4826258d00941990b0ca3942628" alt="invariant trained model"] 4. Hybrids --- # *Invariant* Neural Networks - Evaluation .rotationvsaccuracy[ data:image/s3,"s3://crabby-images/ca75c/ca75cb53638dfe35ccf26fdb16fd7814a3e948f0" alt="accuracy vs rotation" .greenbox[] *Variant* model, .graybox[] *Invariant* model ] 1. Accuracy vs Rotation Angle (image) * **Black box** measure * Standard 2. [Lenc and Vedaldi 2014](https://arxiv.org/abs/1411.5908) measure *equivariance* * Generalization of *invariance*. * Hard to use, various assumptions. * Not used (8 citations). --- .floatright[ [tf playground](https://playground.tensorflow.org) ] # Our approach - Motivation .mainimage90[data:image/s3,"s3://crabby-images/7a65d/7a65d4c98cba70d3da7b45145200a8bd9a559a78" alt="activations"] * Limitations of past approaches * Disregard internal representations * No insight on efficiency of encoding --- class: inverse layout: true --- # Our approach - Motivation .heatmap_sample[ ] .mainimage60[data:image/s3,"s3://crabby-images/64027/6402760842e5688b3b194eceb1d9a5c3ad524456" alt="activations"] 1. Measure invariance in **each activation** 2. Invariance is hard to measure * Measure **variance** instead --- # Proposed measure $V$ - Setup * Given * Activation function $a(x)$ * Samples $ X = \[ x\_1, \dots, x_{n} \] $ * Transformations $ T = \[ t\_1, \dots, t_{m} \] $ * Measure variance $V$ for single activation $a$ -- .matrix[ data:image/s3,"s3://crabby-images/a9560/a956059c85cf92f7a03f22c979b5f02a91389f80" alt="matrix A" ] * Activations value matrix $A$: $$ A = \begin{bmatrix} a(t\_1(x\_1)) & \cdots & a(t\_m(x\_1)) \\\ \vdots & \ddots & \vdots \\\ a(t\_1(x\_n)) & \cdots & a(t\_m(x\_n)) \end{bmatrix} $$ -- * Two sources of variance * Samples * Transformations --- # Proposed measure $V$ - Definition .matrix2[ data:image/s3,"s3://crabby-images/a9560/a956059c85cf92f7a03f22c979b5f02a91389f80" alt="matrix A" ] $$ A = \begin{bmatrix} a(t\_1(x\_1)) & \cdots & a(t\_m(x\_1)) \\\ \vdots & \ddots & \vdots \\\ a(t\_1(x\_n)) & \cdots & a(t\_m(x\_n)) \end{bmatrix} $$ -- * Variance over transformations (desired) $$V\_t = \text{mean variance over columns of A} $$ -- * Variance over samples (normalization) $$V\_x = \text{mean variance over rows of A} $$ -- * Normalized variance (single activation): $$ V = \frac{V\_t}{V\_x} $$ --- class: layout: true --- # Results - Datasets and Networks * Measured various datasets/networks * Show only: * MNIST dataset .sampleimage[ data:image/s3,"s3://crabby-images/4d451/4d45131090765277b510f47aff3e2a115f225e1f" alt="" ] .sampleimage[ data:image/s3,"s3://crabby-images/c07de/c07de8567de1fe7c40f5f54ac77abd5cc7830fc0" alt="" ] * Simple CNN .image70[data:image/s3,"s3://crabby-images/54084/5408454055190fb1de343fa8e3bd1ebb8c30f521" alt="" ] --- # Results - Invariance per layer - mnist/simple cnn .center[.image60[data:image/s3,"s3://crabby-images/29eb8/29eb87316a3a148e5515564ef85083ad28c61179" alt="" ] * Lower layers have more invariance * Variant models are similar except for last layers ] --- # Results - Invariance per class - mnist/simple cnn | | | |:-------------------------:|:-------------------------:| |
|
| Variant | Invariant * Class conditional invariance --- class: inverse layout: true --- .pull-left[ # Conclusion * $V$ is a low-granularity (in)variance measure * Works on arbitrary transformations * Works on Arbitrary networks * Insight into Neural Networks internal invariance ] .pull-right[ # Future work * Variants of $V$ * Other normalizations (WIP) * ANOVA (WIP) * Information theory * $V$ for specific layers * Convolutional Layers (not shown) * Recurrent Networks * $V$ during training (WIP) * Transfer learning * Random networks * $V$ as a library * Pytorch/Tensorflow/Tensorboard (WIP) ]