Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Luca Franceschi

Learning Discrete Structures for Graph Neural Networks

May 17, 2019
Luca Franceschi, Mathias Niepert, Massimiliano Pontil, Xiao He

Figure 1 for Learning Discrete Structures for Graph Neural Networks

Figure 2 for Learning Discrete Structures for Graph Neural Networks

Figure 3 for Learning Discrete Structures for Graph Neural Networks

Figure 4 for Learning Discrete Structures for Graph Neural Networks

Graph neural networks (GNNs) are a popular class of machine learning models whose major advantage is their ability to incorporate a sparse and discrete dependency structure between data points. Unfortunately, GNNs can only be used when such a graph-structure is available. In practice, however, real-world graphs are often noisy and incomplete or might not be available at all. With this work, we propose to jointly learn the graph structure and the parameters of graph convolutional networks (GCNs) by approximately solving a bilevel program that learns a discrete probability distribution on the edges of the graph. This allows one to apply GCNs not only in scenarios where the given graph is incomplete or corrupted but also in those where a graph is not available. We conduct a series of experiments that analyze the behavior of the proposed method and demonstrate that it outperforms related methods by a significant margin.

* To appear as a conference paper at ICML 2019, code at https://github.com/lucfra/LDS

Via

Access Paper or Ask Questions

Fast and Continuous Foothold Adaptation for Dynamic Locomotion through CNNs

Feb 15, 2019
Octavio Villarreal, Victor Barasuol, Marco Camurri, Luca Franceschi, Michele Focchi, Massimiliano Pontil, Darwin G. Caldwell, Claudio Semini

Figure 1 for Fast and Continuous Foothold Adaptation for Dynamic Locomotion through CNNs

Figure 2 for Fast and Continuous Foothold Adaptation for Dynamic Locomotion through CNNs

Figure 3 for Fast and Continuous Foothold Adaptation for Dynamic Locomotion through CNNs

Figure 4 for Fast and Continuous Foothold Adaptation for Dynamic Locomotion through CNNs

Legged robots can outperform wheeled machines for most navigation tasks across unknown and rough terrains. For such tasks, visual feedback is a fundamental asset to provide robots with terrain-awareness. However, robust dynamic locomotion on difficult terrains with real-time performance guarantees remains a challenge. We present here a real-time, dynamic foothold adaptation strategy based on visual feedback. Our method adjusts the landing position of the feet in a fully reactive manner, using only on-board computers and sensors. The correction is computed and executed continuously along the swing phase trajectory of each leg. To efficiently adapt the landing position, we implement a self-supervised foothold classifier based on a Convolutional Neural Network (CNN). Our method results in an up to 200 times faster computation with respect to the full-blown heuristics. Our goal is to react to visual stimuli from the environment, bridging the gap between blind reactive locomotion and purely vision-based planning strategies. We assess the performance of our method on the dynamic quadruped robot HyQ, executing static and dynamic gaits (at speeds up to 0.5 m/s) in both simulated and real scenarios; the benefit of safe foothold adaptation is clearly demonstrated by the overall robot behavior.

* 9 pages, 11 figures. Accepted to RA-L + ICRA 2019, January 2019

Via

Access Paper or Ask Questions

Bilevel Programming for Hyperparameter Optimization and Meta-Learning

Jul 03, 2018
Luca Franceschi, Paolo Frasconi, Saverio Salzo, Riccardo Grazzi, Massimilano Pontil

Figure 1 for Bilevel Programming for Hyperparameter Optimization and Meta-Learning

Figure 2 for Bilevel Programming for Hyperparameter Optimization and Meta-Learning

Figure 3 for Bilevel Programming for Hyperparameter Optimization and Meta-Learning

Figure 4 for Bilevel Programming for Hyperparameter Optimization and Meta-Learning

We introduce a framework based on bilevel programming that unifies gradient-based hyperparameter optimization and meta-learning. We show that an approximate version of the bilevel problem can be solved by taking into explicit account the optimization dynamics for the inner objective. Depending on the specific setting, the outer variables take either the meaning of hyperparameters in a supervised learning problem or parameters of a meta-learner. We provide sufficient conditions under which solutions of the approximate problem converge to those of the exact problem. We instantiate our approach for meta-learning in the case of deep learning where representation layers are treated as hyperparameters shared across a set of training episodes. In experiments, we confirm our theoretical findings, present encouraging results for few-shot learning and contrast the bilevel approach against classical approaches for learning-to-learn.

* ICML 2018; code for replicating experiments at https://github.com/prolearner/hyper-representation, main package (Far-HO) at https://github.com/lucfra/FAR-HO

Via

Access Paper or Ask Questions

Far-HO: A Bilevel Programming Package for Hyperparameter Optimization and Meta-Learning

Jun 13, 2018
Luca Franceschi, Riccardo Grazzi, Massimiliano Pontil, Saverio Salzo, Paolo Frasconi

Figure 1 for Far-HO: A Bilevel Programming Package for Hyperparameter Optimization and Meta-Learning

Figure 2 for Far-HO: A Bilevel Programming Package for Hyperparameter Optimization and Meta-Learning

In (Franceschi et al., 2018) we proposed a unified mathematical framework, grounded on bilevel programming, that encompasses gradient-based hyperparameter optimization and meta-learning. We formulated an approximate version of the problem where the inner objective is solved iteratively, and gave sufficient conditions ensuring convergence to the exact problem. In this work we show how to optimize learning rates, automatically weight the loss of single examples and learn hyper-representations with Far-HO, a software package based on the popular deep learning framework TensorFlow that allows to seamlessly tackle both HO and ML problems.

* This submission is a reduced version of (Franceschi et al., arXiv:1806.04910) which has been accepted at the main ICML 2018 conference. In this paper we illustrate the software framework, material that could not be included in the conference paper

Via

Access Paper or Ask Questions

A Bridge Between Hyperparameter Optimization and Larning-to-learn

Feb 04, 2018
Luca Franceschi, Michele Donini, Paolo Frasconi, Massimiliano Pontil

Figure 1 for A Bridge Between Hyperparameter Optimization and Larning-to-learn

Figure 2 for A Bridge Between Hyperparameter Optimization and Larning-to-learn

Figure 3 for A Bridge Between Hyperparameter Optimization and Larning-to-learn

Figure 4 for A Bridge Between Hyperparameter Optimization and Larning-to-learn

We consider a class of a nested optimization problems involving inner and outer objectives. We observe that by taking into explicit account the optimization dynamics for the inner objective it is possible to derive a general framework that unifies gradient-based hyperparameter optimization and meta-learning (or learning-to-learn). Depending on the specific setting, the variables of the outer objective take either the meaning of hyperparameters in a supervised learning problem or parameters of a meta-learner. We show that some recently proposed methods in the latter setting can be instantiated in our framework and tackled with the same gradient-based algorithms. Finally, we discuss possible design patterns for learning-to-learn and present encouraging preliminary experiments for few-shot learning.

* NIPS 2017 workshop on Meta-learning (http://metalearning.ml/)

Via

Access Paper or Ask Questions

Forward and Reverse Gradient-Based Hyperparameter Optimization

Dec 12, 2017
Luca Franceschi, Michele Donini, Paolo Frasconi, Massimiliano Pontil

Figure 1 for Forward and Reverse Gradient-Based Hyperparameter Optimization

Figure 2 for Forward and Reverse Gradient-Based Hyperparameter Optimization

Figure 3 for Forward and Reverse Gradient-Based Hyperparameter Optimization

Figure 4 for Forward and Reverse Gradient-Based Hyperparameter Optimization

We study two procedures (reverse-mode and forward-mode) for computing the gradient of the validation error with respect to the hyperparameters of any iterative learning algorithm such as stochastic gradient descent. These procedures mirror two methods of computing gradients for recurrent neural networks and have different trade-offs in terms of running time and space requirements. Our formulation of the reverse-mode procedure is linked to previous work by Maclaurin et al. [2015] but does not require reversible dynamics. The forward-mode procedure is suitable for real-time hyperparameter updates, which may significantly speed up hyperparameter optimization on large datasets. We present experiments on data cleaning and on learning task interactions. We also present one large-scale experiment where the use of previous gradient-based methods would be prohibitive.

* Franceschi, L., Donini, M., Frasconi, P. & Pontil, M.. (2017). Forward and Reverse Gradient-Based Hyperparameter Optimization. Proceedings of the 34th International Conference on Machine Learning, in PMLR 70:1165-1173
* - Posted the ICML Camera Ready version. - Added a link to a newer package implementation of the algorithms

Via

Access Paper or Ask Questions