Optimization Approach to a Block of Layers and Derivative Free Optimization


I present a new view to the output activations of a block of layers in deep neural networks. In particular, we view the output activation of a linear operator, convolutional or fully connected, followed by a non-linearity, and followed by another linear operator as an approximate solution to a certain convex optimization problem. We show that replacing layers with optimization layers as solvers improve performance in different settings.

Jul 7, 2019 12:00 AM — Jul 9, 2019 12:00 AM
Odessa, Ukraine
Adel Bibi
Adel Bibi
Senior Research Fellow

My research interests include machine learning, computer vision, and optimization.