Picture for Brian M. Sadler

Brian M. Sadler

DIPPER: Direct Preference Optimization to Accelerate Primitive-Enabled Hierarchical Reinforcement Learning

Add code
Jun 16, 2024
Viaarxiv icon

PIPER: Primitive-Informed Preference-based Hierarchical Reinforcement Learning via Hindsight Relabeling

Add code
Apr 20, 2024
Figure 1 for PIPER: Primitive-Informed Preference-based Hierarchical Reinforcement Learning via Hindsight Relabeling
Figure 2 for PIPER: Primitive-Informed Preference-based Hierarchical Reinforcement Learning via Hindsight Relabeling
Figure 3 for PIPER: Primitive-Informed Preference-based Hierarchical Reinforcement Learning via Hindsight Relabeling
Figure 4 for PIPER: Primitive-Informed Preference-based Hierarchical Reinforcement Learning via Hindsight Relabeling
Viaarxiv icon

Global Optimality without Mixing Time Oracles in Average-reward RL via Multi-level Actor-Critic

Add code
Mar 18, 2024
Figure 1 for Global Optimality without Mixing Time Oracles in Average-reward RL via Multi-level Actor-Critic
Figure 2 for Global Optimality without Mixing Time Oracles in Average-reward RL via Multi-level Actor-Critic
Figure 3 for Global Optimality without Mixing Time Oracles in Average-reward RL via Multi-level Actor-Critic
Viaarxiv icon

Sampling-based Safe Reinforcement Learning for Nonlinear Dynamical Systems

Add code
Mar 06, 2024
Figure 1 for Sampling-based Safe Reinforcement Learning for Nonlinear Dynamical Systems
Figure 2 for Sampling-based Safe Reinforcement Learning for Nonlinear Dynamical Systems
Figure 3 for Sampling-based Safe Reinforcement Learning for Nonlinear Dynamical Systems
Figure 4 for Sampling-based Safe Reinforcement Learning for Nonlinear Dynamical Systems
Viaarxiv icon

Deceptive Path Planning via Reinforcement Learning with Graph Neural Networks

Add code
Feb 09, 2024
Viaarxiv icon

Factor Graph Processing for Dual-Blind Deconvolution at ISAC Receiver

Add code
Oct 22, 2023
Figure 1 for Factor Graph Processing for Dual-Blind Deconvolution at ISAC Receiver
Figure 2 for Factor Graph Processing for Dual-Blind Deconvolution at ISAC Receiver
Figure 3 for Factor Graph Processing for Dual-Blind Deconvolution at ISAC Receiver
Figure 4 for Factor Graph Processing for Dual-Blind Deconvolution at ISAC Receiver
Viaarxiv icon

An Invitation to Hypercomplex Phase Retrieval: Theory and Applications

Add code
Oct 20, 2023
Viaarxiv icon

Index-Modulated Metasurface Transceiver Design using Reconfigurable Intelligent Surfaces for 6G Wireless Networks

Add code
Oct 04, 2023
Figure 1 for Index-Modulated Metasurface Transceiver Design using Reconfigurable Intelligent Surfaces for 6G Wireless Networks
Figure 2 for Index-Modulated Metasurface Transceiver Design using Reconfigurable Intelligent Surfaces for 6G Wireless Networks
Figure 3 for Index-Modulated Metasurface Transceiver Design using Reconfigurable Intelligent Surfaces for 6G Wireless Networks
Figure 4 for Index-Modulated Metasurface Transceiver Design using Reconfigurable Intelligent Surfaces for 6G Wireless Networks
Viaarxiv icon

Octonion Phase Retrieval

Add code
Aug 30, 2023
Figure 1 for Octonion Phase Retrieval
Figure 2 for Octonion Phase Retrieval
Figure 3 for Octonion Phase Retrieval
Viaarxiv icon

Ada-NAV: Adaptive Trajectory-Based Sample Efficient Policy Learning for Robotic Navigation

Add code
Jun 09, 2023
Figure 1 for Ada-NAV: Adaptive Trajectory-Based Sample Efficient Policy Learning for Robotic Navigation
Figure 2 for Ada-NAV: Adaptive Trajectory-Based Sample Efficient Policy Learning for Robotic Navigation
Figure 3 for Ada-NAV: Adaptive Trajectory-Based Sample Efficient Policy Learning for Robotic Navigation
Figure 4 for Ada-NAV: Adaptive Trajectory-Based Sample Efficient Policy Learning for Robotic Navigation
Viaarxiv icon