Alert button
Picture for Roman Schutski

Roman Schutski

Alert button

TTOpt: A Maximum Volume Quantized Tensor Train-based Optimization and its Application to Reinforcement Learning

Apr 30, 2022
Konstantin Sozykin, Andrei Chertkov, Roman Schutski, Anh-Huy Phan, Andrzej Cichocki, Ivan Oseledets

Figure 1 for TTOpt: A Maximum Volume Quantized Tensor Train-based Optimization and its Application to Reinforcement Learning
Figure 2 for TTOpt: A Maximum Volume Quantized Tensor Train-based Optimization and its Application to Reinforcement Learning
Figure 3 for TTOpt: A Maximum Volume Quantized Tensor Train-based Optimization and its Application to Reinforcement Learning
Figure 4 for TTOpt: A Maximum Volume Quantized Tensor Train-based Optimization and its Application to Reinforcement Learning

We present a novel procedure for optimization based on the combination of efficient quantized tensor train representation and a generalized maximum matrix volume principle. We demonstrate the applicability of the new Tensor Train Optimizer (TTOpt) method for various tasks, ranging from minimization of multidimensional functions to reinforcement learning. Our algorithm compares favorably to popular evolutionary-based methods and outperforms them by the number of function evaluations or execution time, often by a significant margin.

* 20 pages, 8 figures 
Viaarxiv icon

Graph Convolutional Policy for Solving Tree Decomposition via Reinforcement Learning Heuristics

Oct 18, 2019
Taras Khakhulin, Roman Schutski, Ivan Oseledets

Figure 1 for Graph Convolutional Policy for Solving Tree Decomposition via Reinforcement Learning Heuristics
Figure 2 for Graph Convolutional Policy for Solving Tree Decomposition via Reinforcement Learning Heuristics
Figure 3 for Graph Convolutional Policy for Solving Tree Decomposition via Reinforcement Learning Heuristics
Figure 4 for Graph Convolutional Policy for Solving Tree Decomposition via Reinforcement Learning Heuristics

We propose a Reinforcement Learning based approach to approximately solve the Tree Decomposition (TD)problem. TD is a combinatorial problem, which is central to the analysis of graph minor structure and computational complexity, as well as in the algorithms of probabilistic inference, register allocation, and other practical tasks. Recently, it has been shown that combinatorial problems can be successively solved by learned heuristics. However, the majority of existing works do not address the question of the generalization of learning-based solutions. Our model is based on the graph convolution neural network (GCN) for learning graph representations. We show that the agent builton GCN and trained on a single graph using an Actor-Critic method can efficiently generalize to real-world TD problem instances. We establish that our method successfully generalizes from small graphs, where TD can be found by exact algorithms, to large instances of practical interest, while still having very low time-to-solution. On the other hand, the agent-based approach surpasses all greedy heuristics by the quality of the solution.

* 8 pages, 7 figures 
Viaarxiv icon