Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yinan Zhang

PuzzleFlex: kinematic motion of chains with loose joints

Jun 20, 2019
Samuel Lensgraf, Karim Itani, Yinan Zhang, Zezhou Sun, Yijia Wu, Alberto Quattrini Li, Bo Zhu, Emily Whiting, Weifu Wang, Devin Balkcom

Figure 1 for PuzzleFlex: kinematic motion of chains with loose joints

Figure 2 for PuzzleFlex: kinematic motion of chains with loose joints

Figure 3 for PuzzleFlex: kinematic motion of chains with loose joints

Figure 4 for PuzzleFlex: kinematic motion of chains with loose joints

This paper presents a method of computing free motions of a planar assembly of rigid bodies connected by loose joints. Joints are modeled using local distance constraints, which are then linearized with respect to configuration space velocities, yielding a linear programming formulation that allows analysis of systems with thousands of rigid bodies. Potential applications include analysis of collections of modular robots, structural stability perturbation analysis, tolerance analysis for mechanical systems,and formation control of mobile robots.

Via

Access Paper or Ask Questions

Diversity-Promoting Deep Reinforcement Learning for Interactive Recommendation

Mar 19, 2019
Yong Liu, Yinan Zhang, Qiong Wu, Chunyan Miao, Lizhen Cui, Binqiang Zhao, Yin Zhao, Lu Guan

Figure 1 for Diversity-Promoting Deep Reinforcement Learning for Interactive Recommendation

Figure 2 for Diversity-Promoting Deep Reinforcement Learning for Interactive Recommendation

Figure 3 for Diversity-Promoting Deep Reinforcement Learning for Interactive Recommendation

Figure 4 for Diversity-Promoting Deep Reinforcement Learning for Interactive Recommendation

Interactive recommendation that models the explicit interactions between users and the recommender system has attracted a lot of research attentions in recent years. Most previous interactive recommendation systems only focus on optimizing recommendation accuracy while overlooking other important aspects of recommendation quality, such as the diversity of recommendation results. In this paper, we propose a novel recommendation model, named \underline{D}iversity-promoting \underline{D}eep \underline{R}einforcement \underline{L}earning (D$^2$RL), which encourages the diversity of recommendation results in interaction recommendations. More specifically, we adopt a Determinantal Point Process (DPP) model to generate diverse, while relevant item recommendations. A personalized DPP kernel matrix is maintained for each user, which is constructed from two parts: a fixed similarity matrix capturing item-item similarity, and the relevance of items dynamically learnt through an actor-critic reinforcement learning framework. We performed extensive offline experiments as well as simulated online experiments with real world datasets to demonstrate the effectiveness of the proposed model.

Via

Access Paper or Ask Questions

Towards Physically Safe Reinforcement Learning under Supervision

Jan 19, 2019
Yinan Zhang, Devin Balkcom, Haoxiang Li

Figure 1 for Towards Physically Safe Reinforcement Learning under Supervision

Figure 2 for Towards Physically Safe Reinforcement Learning under Supervision

Figure 3 for Towards Physically Safe Reinforcement Learning under Supervision

Figure 4 for Towards Physically Safe Reinforcement Learning under Supervision

This paper addresses the question of how a previously available control policy $\pi_s$ can be used as a supervisor to more quickly and safely train a new learned control policy $\pi_L$ for a robot. A weighted average of the supervisor and learned policies is used during trials, with a heavier weight initially on the supervisor, in order to allow safe and useful physical trials while the learned policy is still ineffective. During the process, the weight is adjusted to favor the learned policy. As weights are adjusted, the learned network must compensate so as to give safe and reasonable outputs under the different weights. A pioneer network is introduced that pre-learns a policy that performs similarly to the current learned policy under the planned next step for new weights; this pioneer network then replaces the currently learned network in the next set of trials. Experiments in OpenAI Gym demonstrate the effectiveness of the proposed method.

Via

Access Paper or Ask Questions

Sampling Clustering

Jun 21, 2018
Ching Tarn, Yinan Zhang, Ye Feng

We propose an efficient graph-based divisive cluster analysis approach called sampling clustering. It constructs a lite informative dendrogram by recursively dividing a graph into subgraphs. In each recursive call, a graph is sampled first with a set of vertices being removed to disconnect latent clusters, then condensed by adding edges to the remaining vertices to avoid graph fragmentation caused by vertex removals. We also present some sampling and condensing methods and discuss the effectiveness in this paper. Our implementations run in linear time and achieve outstanding performance on various types of datasets. Experimental results show that they outperform state-of-the-art clustering algorithms with significantly less computing resources requirements.

Via

Access Paper or Ask Questions