Picture for Shiji Song

Shiji Song

Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning

Add code
Oct 30, 2023
Viaarxiv icon

Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL

Add code
Oct 06, 2023
Viaarxiv icon

Avalon's Game of Thoughts: Battle Against Deception through Recursive Contemplation

Add code
Oct 06, 2023
Figure 1 for Avalon's Game of Thoughts: Battle Against Deception through Recursive Contemplation
Figure 2 for Avalon's Game of Thoughts: Battle Against Deception through Recursive Contemplation
Figure 3 for Avalon's Game of Thoughts: Battle Against Deception through Recursive Contemplation
Figure 4 for Avalon's Game of Thoughts: Battle Against Deception through Recursive Contemplation
Viaarxiv icon

Facilitating Battery Swapping Services for Freight Trucks with Spatial-Temporal Demand Prediction

Add code
Oct 01, 2023
Figure 1 for Facilitating Battery Swapping Services for Freight Trucks with Spatial-Temporal Demand Prediction
Figure 2 for Facilitating Battery Swapping Services for Freight Trucks with Spatial-Temporal Demand Prediction
Figure 3 for Facilitating Battery Swapping Services for Freight Trucks with Spatial-Temporal Demand Prediction
Figure 4 for Facilitating Battery Swapping Services for Freight Trucks with Spatial-Temporal Demand Prediction
Viaarxiv icon

Leveraging Reward Consistency for Interpretable Feature Discovery in Reinforcement Learning

Add code
Sep 04, 2023
Viaarxiv icon

DAT++: Spatially Dynamic Vision Transformer with Deformable Attention

Add code
Sep 04, 2023
Figure 1 for DAT++: Spatially Dynamic Vision Transformer with Deformable Attention
Figure 2 for DAT++: Spatially Dynamic Vision Transformer with Deformable Attention
Figure 3 for DAT++: Spatially Dynamic Vision Transformer with Deformable Attention
Figure 4 for DAT++: Spatially Dynamic Vision Transformer with Deformable Attention
Viaarxiv icon

Hundreds Guide Millions: Adaptive Offline Reinforcement Learning with Expert Guidance

Add code
Sep 04, 2023
Figure 1 for Hundreds Guide Millions: Adaptive Offline Reinforcement Learning with Expert Guidance
Figure 2 for Hundreds Guide Millions: Adaptive Offline Reinforcement Learning with Expert Guidance
Figure 3 for Hundreds Guide Millions: Adaptive Offline Reinforcement Learning with Expert Guidance
Figure 4 for Hundreds Guide Millions: Adaptive Offline Reinforcement Learning with Expert Guidance
Viaarxiv icon

Latency-aware Unified Dynamic Networks for Efficient Image Recognition

Add code
Sep 02, 2023
Viaarxiv icon

Computation-efficient Deep Learning for Computer Vision: A Survey

Add code
Aug 27, 2023
Viaarxiv icon

Learning Specialized Activation Functions for Physics-informed Neural Networks

Add code
Aug 08, 2023
Viaarxiv icon