Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yuan Shen

Generalized Expectation Maximization Framework for Blind Image Super Resolution

May 23, 2023

Yuxiao Li, Zhiming Wang, Yuan Shen

Figure 1 for Generalized Expectation Maximization Framework for Blind Image Super Resolution

Figure 2 for Generalized Expectation Maximization Framework for Blind Image Super Resolution

Figure 3 for Generalized Expectation Maximization Framework for Blind Image Super Resolution

Figure 4 for Generalized Expectation Maximization Framework for Blind Image Super Resolution

Abstract:Learning-based methods for blind single image super resolution (SISR) conduct the restoration by a learned mapping between high-resolution (HR) images and their low-resolution (LR) counterparts degraded with arbitrary blur kernels. However, these methods mostly require an independent step to estimate the blur kernel, leading to error accumulation between steps. We propose an end-to-end learning framework for the blind SISR problem, which enables image restoration within a unified Bayesian framework with either full- or semi-supervision. The proposed method, namely SREMN, integrates learning techniques into the generalized expectation-maximization (GEM) algorithm and infers HR images from the maximum likelihood estimation (MLE). Extensive experiments show the superiority of the proposed method with comparison to existing work and novelty in semi-supervised learning.

Via

Access Paper or Ask Questions

A Deep Learning Approach for Generating Soft Range Information from RF Data

May 23, 2023

Yuxiao Li, Santiago Mazuelas, Yuan Shen

Abstract:Radio frequency (RF)-based techniques are widely adopted for indoor localization despite the challenges in extracting sufficient information from measurements. Soft range information (SRI) offers a promising alternative for highly accurate localization that gives all probable range values rather than a single estimate of distance. We propose a deep learning approach to generate accurate SRI from RF measurements. In particular, the proposed approach is implemented by a network with two neural modules and conducts the generation directly from raw data. Extensive experiments on a case study with two public datasets are conducted to quantify the efficiency in different indoor localization tasks. The results show that the proposed approach can generate highly accurate SRI, and significantly outperforms conventional techniques in both non-line-of-sight (NLOS) detection and ranging error mitigation.

* 021 IEEE Globecom Workshops (GC Wkshps), Madrid, Spain, 2021, pp. 1-5
* Published in: 2021 IEEE Globecom Workshops (GC Wkshps)

Via

Access Paper or Ask Questions

Efficient Planar Pose Estimation via UWB Measurements

Sep 15, 2022

Haodong Jiang, Wentao Wang, Yuan Shen, Xinghan Li, Xiaoqiang Ren, Biqiang Mu, Junfeng Wu

Figure 1 for Efficient Planar Pose Estimation via UWB Measurements

Figure 2 for Efficient Planar Pose Estimation via UWB Measurements

Figure 3 for Efficient Planar Pose Estimation via UWB Measurements

Figure 4 for Efficient Planar Pose Estimation via UWB Measurements

Abstract:State estimation is an essential part of autonomous systems. Integrating the Ultra-Wideband(UWB) technique has been shown to correct the long-term estimation drift and bypass the complexity of loop closure detection. However, few works on robotics adopt UWB as a stand-alone state estimation solution. The primary purpose of this work is to investigate planar pose estimation using only UWB range measurements and study the estimator's statistical efficiency. We prove the excellent property of a two-step scheme, which says that we can refine a consistent estimator to be asymptotically efficient by one step of Gauss-Newton iteration. Grounded on this result, we design the GN-ULS estimator and evaluate it through simulations and collected datasets. GN-ULS attains millimeter and sub-degree level accuracy on our static datasets and attains centimeter and degree level accuracy on our dynamic datasets, presenting the possibility of using only UWB for real-time state estimation.

* We change the organization of the paper, add one author, and correct several typos

Via

Access Paper or Ask Questions

Injecting Image Details into CLIP's Feature Space

Aug 31, 2022

Zilun Zhang, Cuifeng Shen, Yuan Shen, Huixin Xiong, Xinyu Zhou

Figure 1 for Injecting Image Details into CLIP's Feature Space

Figure 2 for Injecting Image Details into CLIP's Feature Space

Figure 3 for Injecting Image Details into CLIP's Feature Space

Figure 4 for Injecting Image Details into CLIP's Feature Space

Abstract:Although CLIP-like Visual Language Models provide a functional joint feature space for image and text, due to the limitation of the CILP-like model's image input size (e.g., 224), subtle details are lost in the feature representation if we input high-resolution images (e.g., 2240). In this work, we introduce an efficient framework that can produce a single feature representation for a high-resolution image that injects image details and shares the same semantic space as the original CLIP. In the framework, we train a feature fusing model based on CLIP features extracted from a carefully designed image patch method that can cover objects of any scale, weakly supervised by image-agnostic class prompted queries. We validate our framework by retrieving images from class prompted queries on the real world and synthetic datasets, showing significant performance improvement on these tasks. Furthermore, to fully demonstrate our framework's detail retrieval ability, we construct a CLEVR-like synthetic dataset called CLVER-DS, which is fully annotated and has a controllable object scale.

Via

Access Paper or Ask Questions

CoCAtt: A Cognitive-Conditioned Driver Attention Dataset (Supplementary Material)

Jul 08, 2022

Yuan Shen, Niviru Wijayaratne, Pranav Sriram, Aamir Hasan, Peter Du, Katherine Driggs-Campbell

Figure 1 for CoCAtt: A Cognitive-Conditioned Driver Attention Dataset (Supplementary Material)

Figure 2 for CoCAtt: A Cognitive-Conditioned Driver Attention Dataset (Supplementary Material)

Figure 3 for CoCAtt: A Cognitive-Conditioned Driver Attention Dataset (Supplementary Material)

Figure 4 for CoCAtt: A Cognitive-Conditioned Driver Attention Dataset (Supplementary Material)

Abstract:The task of driver attention prediction has drawn considerable interest among researchers in robotics and the autonomous vehicle industry. Driver attention prediction can play an instrumental role in mitigating and preventing high-risk events, like collisions and casualties. However, existing driver attention prediction models neglect the distraction state and intention of the driver, which can significantly influence how they observe their surroundings. To address these issues, we present a new driver attention dataset, CoCAtt (Cognitive-Conditioned Attention). Unlike previous driver attention datasets, CoCAtt includes per-frame annotations that describe the distraction state and intention of the driver. In addition, the attention data in our dataset is captured in both manual and autopilot modes using eye-tracking devices of different resolutions. Our results demonstrate that incorporating the above two driver states into attention modeling can improve the performance of driver attention prediction. To the best of our knowledge, this work is the first to provide autopilot attention data. Furthermore, CoCAtt is currently the largest and the most diverse driver attention dataset in terms of autonomy levels, eye tracker resolutions, and driving scenarios. CoCAtt is available for download at https://cocatt-dataset.github.io.

* Supplementary Material for the main paper, "CoCAtt: A Cognitive-Conditioned Driver Attention Dataset". Accepted at ITSC2022

Via

Access Paper or Ask Questions

Learning Dynamic View Synthesis With Few RGBD Cameras

Apr 22, 2022

Shengze Wang, YoungJoong Kwon, Yuan Shen, Qian Zhang, Andrei State, Jia-Bin Huang, Henry Fuchs

Figure 1 for Learning Dynamic View Synthesis With Few RGBD Cameras

Figure 2 for Learning Dynamic View Synthesis With Few RGBD Cameras

Figure 3 for Learning Dynamic View Synthesis With Few RGBD Cameras

Figure 4 for Learning Dynamic View Synthesis With Few RGBD Cameras

Abstract:There have been significant advancements in dynamic novel view synthesis in recent years. However, current deep learning models often require (1) prior models (e.g., SMPL human models), (2) heavy pre-processing, or (3) per-scene optimization. We propose to utilize RGBD cameras to remove these limitations and synthesize free-viewpoint videos of dynamic indoor scenes. We generate feature point clouds from RGBD frames and then render them into free-viewpoint videos via a neural renderer. However, the inaccurate, unstable, and incomplete depth measurements induce severe distortions, flickering, and ghosting artifacts. We enforce spatial-temporal consistency via the proposed Cycle Reconstruction Consistency and Temporal Stabilization module to reduce these artifacts. We introduce a simple Regional Depth-Inpainting module that adaptively inpaints missing depth values to render complete novel views. Additionally, we present a Human-Things Interactions dataset to validate our approach and facilitate future research. The dataset consists of 43 multi-view RGBD video sequences of everyday activities, capturing complex interactions between human subjects and their surroundings. Experiments on the HTI dataset show that our method outperforms the baseline per-frame image fidelity and spatial-temporal consistency. We will release our code, and the dataset on the website soon.

Via

Access Paper or Ask Questions

Explore-Bench: Data Sets, Metrics and Evaluations for Frontier-based and Deep-reinforcement-learning-based Autonomous Exploration

Feb 24, 2022

Yuanfan Xu, Jincheng Yu, Jiahao Tang, Jiantao Qiu, Jian Wang, Yuan Shen, Yu Wang, Huazhong Yang

Figure 1 for Explore-Bench: Data Sets, Metrics and Evaluations for Frontier-based and Deep-reinforcement-learning-based Autonomous Exploration

Figure 2 for Explore-Bench: Data Sets, Metrics and Evaluations for Frontier-based and Deep-reinforcement-learning-based Autonomous Exploration

Figure 3 for Explore-Bench: Data Sets, Metrics and Evaluations for Frontier-based and Deep-reinforcement-learning-based Autonomous Exploration

Figure 4 for Explore-Bench: Data Sets, Metrics and Evaluations for Frontier-based and Deep-reinforcement-learning-based Autonomous Exploration

Abstract:Autonomous exploration and mapping of unknown terrains employing single or multiple robots is an essential task in mobile robotics and has therefore been widely investigated. Nevertheless, given the lack of unified data sets, metrics, and platforms to evaluate the exploration approaches, we develop an autonomous robot exploration benchmark entitled Explore-Bench. The benchmark involves various exploration scenarios and presents two types of quantitative metrics to evaluate exploration efficiency and multi-robot cooperation. Explore-Bench is extremely useful as, recently, deep reinforcement learning (DRL) has been widely used for robot exploration tasks and achieved promising results. However, training DRL-based approaches requires large data sets, and additionally, current benchmarks rely on realistic simulators with a slow simulation speed, which is not appropriate for training exploration strategies. Hence, to support efficient DRL training and comprehensive evaluation, the suggested Explore-Bench designs a 3-level platform with a unified data flow and $12 \times$ speed-up that includes a grid-based simulator for fast evaluation and efficient training, a realistic Gazebo simulator, and a remotely accessible robot testbed for high-accuracy tests in physical environments. The practicality of the proposed benchmark is highlighted with the application of one DRL-based and three frontier-based exploration approaches. Furthermore, we analyze the performance differences and provide some insights about the selection and design of exploration methods. Our benchmark is available at https://github.com/efc-robot/Explore-Bench.

* To be published in IEEE International Conference on Robotics and Automation (ICRA), May 23-27, 2022

Via

Access Paper or Ask Questions

Multi-UAV Coverage Planning with Limited Endurance in Disaster Environment

Jan 25, 2022

Hongyu Song, Jincheng Yu, Jiantao Qiu, Zhixiao Sun, Kuijun Lang, Qing Luo, Yuan Shen, Yu Wang

Figure 1 for Multi-UAV Coverage Planning with Limited Endurance in Disaster Environment

Figure 2 for Multi-UAV Coverage Planning with Limited Endurance in Disaster Environment

Figure 3 for Multi-UAV Coverage Planning with Limited Endurance in Disaster Environment

Figure 4 for Multi-UAV Coverage Planning with Limited Endurance in Disaster Environment

Abstract:For scenes such as floods and earthquakes, the disaster area is large, and rescue time is tight. Multi-UAV exploration is more efficient than a single UAV. Existing UAV exploration work is modeled as a Coverage Path Planning (CPP) task to achieve full coverage of the area in the presence of obstacles. However, the endurance capability of UAV is limited, and the rescue time is urgent. Thus, even using multiple UAVs cannot achieve complete disaster area coverage in time. Therefore, in this paper we propose a multi-Agent Endurance-limited CPP (MAEl-CPP) problem based on a priori heatmap of the disaster area, which requires the exploration of more valuable areas under limited energy. Furthermore, we propose a path planning algorithm for the MAEl-CPP problem, by ranking the possible disaster areas according to their importance through satellite or remote aerial images and completing path planning according to the importance level. Experimental results show that our proposed algorithm is at least twice as effective as the existing method in terms of search efficiency.

Via

Access Paper or Ask Questions

CoCAtt: A Cognitive-Conditioned Driver Attention Dataset

Nov 23, 2021

Yuan Shen, Niviru Wijayaratne, Pranav Sriram, Aamir Hasan, Peter Du, Katie Driggs-Campbell

Figure 1 for CoCAtt: A Cognitive-Conditioned Driver Attention Dataset

Figure 2 for CoCAtt: A Cognitive-Conditioned Driver Attention Dataset

Figure 3 for CoCAtt: A Cognitive-Conditioned Driver Attention Dataset

Figure 4 for CoCAtt: A Cognitive-Conditioned Driver Attention Dataset

* 10 pages, 5 figures

Via

Access Paper or Ask Questions

Relative Distributed Formation and Obstacle Avoidance with Multi-agent Reinforcement Learning

Nov 14, 2021

Yuzi Yan, Xiaoxiang Li, Xinyou Qiu, Jiantao Qiu, Jian Wang, Yu Wang, Yuan Shen

Figure 1 for Relative Distributed Formation and Obstacle Avoidance with Multi-agent Reinforcement Learning

Figure 2 for Relative Distributed Formation and Obstacle Avoidance with Multi-agent Reinforcement Learning

Figure 3 for Relative Distributed Formation and Obstacle Avoidance with Multi-agent Reinforcement Learning

Figure 4 for Relative Distributed Formation and Obstacle Avoidance with Multi-agent Reinforcement Learning

Abstract:Multi-agent formation as well as obstacle avoidance is one of the most actively studied topics in the field of multi-agent systems. Although some classic controllers like model predictive control (MPC) and fuzzy control achieve a certain measure of success, most of them require precise global information which is not accessible in harsh environments. On the other hand, some reinforcement learning (RL) based approaches adopt the leader-follower structure to organize different agents' behaviors, which sacrifices the collaboration between agents thus suffering from bottlenecks in maneuverability and robustness. In this paper, we propose a distributed formation and obstacle avoidance method based on multi-agent reinforcement learning (MARL). Agents in our system only utilize local and relative information to make decisions and control themselves distributively. Agent in the multi-agent system will reorganize themselves into a new topology quickly in case that any of them is disconnected. Our method achieves better performance regarding formation error, formation convergence rate and on-par success rate of obstacle avoidance compared with baselines (both classic control methods and another RL-based method). The feasibility of our method is verified by both simulation and hardware implementation with Ackermann-steering vehicles.

Via

Access Paper or Ask Questions