Picture for Nan Jiang

Nan Jiang

Faculty of Information Technology, Beijing University of Technology, Beijing, China, Beijing Key Laboratory of Trusted Computing, Beijing, China, National Engineering Laboratory for Critical Technologies of Information Security Classified Protection, Beijing, China

Explaining RL Decisions with Trajectories

Add code
May 06, 2023
Figure 1 for Explaining RL Decisions with Trajectories
Figure 2 for Explaining RL Decisions with Trajectories
Figure 3 for Explaining RL Decisions with Trajectories
Figure 4 for Explaining RL Decisions with Trajectories
Viaarxiv icon

Adversarial Model for Offline Reinforcement Learning

Add code
Feb 21, 2023
Viaarxiv icon

Offline Learning in Markov Games with General Function Approximation

Add code
Feb 06, 2023
Viaarxiv icon

Reinforcement Learning in Low-Rank MDPs with Density Features

Add code
Feb 04, 2023
Viaarxiv icon

KNOD: Domain Knowledge Distilled Tree Decoder for Automated Program Repair

Add code
Feb 03, 2023
Viaarxiv icon

CHAIRS: Towards Full-Body Articulated Human-Object Interaction

Add code
Dec 20, 2022
Viaarxiv icon

Semantics-Aware Remote Estimation via Information Bottleneck-Inspired Type Based Multiple Access

Add code
Dec 19, 2022
Figure 1 for Semantics-Aware Remote Estimation via Information Bottleneck-Inspired Type Based Multiple Access
Figure 2 for Semantics-Aware Remote Estimation via Information Bottleneck-Inspired Type Based Multiple Access
Figure 3 for Semantics-Aware Remote Estimation via Information Bottleneck-Inspired Type Based Multiple Access
Viaarxiv icon

Toward Multi-Service Edge-Intelligence Paradigm: Temporal-Adaptive Prediction for Time-Critical Control over Wireless

Add code
Dec 12, 2022
Figure 1 for Toward Multi-Service Edge-Intelligence Paradigm: Temporal-Adaptive Prediction for Time-Critical Control over Wireless
Figure 2 for Toward Multi-Service Edge-Intelligence Paradigm: Temporal-Adaptive Prediction for Time-Critical Control over Wireless
Figure 3 for Toward Multi-Service Edge-Intelligence Paradigm: Temporal-Adaptive Prediction for Time-Critical Control over Wireless
Figure 4 for Toward Multi-Service Edge-Intelligence Paradigm: Temporal-Adaptive Prediction for Time-Critical Control over Wireless
Viaarxiv icon

Learning Combinatorial Structures via Markov Random Fields with Sampling through Lovász Local Lemma

Add code
Dec 02, 2022
Viaarxiv icon

ARMOR: A Model-based Framework for Improving Arbitrary Baseline Policies with Offline Data

Add code
Nov 08, 2022
Viaarxiv icon