Picture for Zipeng Dai

Zipeng Dai

CuDA2: An approach for Incorporating Traitor Agents into Cooperative Multi-Agent Systems

Add code
Jun 25, 2024
Viaarxiv icon

HiBid: A Cross-Channel Constrained Bidding System with Budget Allocation by Hierarchical Offline Deep Reinforcement Learning

Add code
Dec 29, 2023
Figure 1 for HiBid: A Cross-Channel Constrained Bidding System with Budget Allocation by Hierarchical Offline Deep Reinforcement Learning
Figure 2 for HiBid: A Cross-Channel Constrained Bidding System with Budget Allocation by Hierarchical Offline Deep Reinforcement Learning
Figure 3 for HiBid: A Cross-Channel Constrained Bidding System with Budget Allocation by Hierarchical Offline Deep Reinforcement Learning
Figure 4 for HiBid: A Cross-Channel Constrained Bidding System with Budget Allocation by Hierarchical Offline Deep Reinforcement Learning
Viaarxiv icon

Semi-Centralised Multi-Agent Reinforcement Learning with Policy-Embedded Training

Add code
Sep 02, 2022
Figure 1 for Semi-Centralised Multi-Agent Reinforcement Learning with Policy-Embedded Training
Figure 2 for Semi-Centralised Multi-Agent Reinforcement Learning with Policy-Embedded Training
Figure 3 for Semi-Centralised Multi-Agent Reinforcement Learning with Policy-Embedded Training
Figure 4 for Semi-Centralised Multi-Agent Reinforcement Learning with Policy-Embedded Training
Viaarxiv icon

Timing is Everything: Learning to Act Selectively with Costly Actions and Budgetary Constraints

Add code
Jun 06, 2022
Figure 1 for Timing is Everything: Learning to Act Selectively with Costly Actions and Budgetary Constraints
Figure 2 for Timing is Everything: Learning to Act Selectively with Costly Actions and Budgetary Constraints
Figure 3 for Timing is Everything: Learning to Act Selectively with Costly Actions and Budgetary Constraints
Figure 4 for Timing is Everything: Learning to Act Selectively with Costly Actions and Budgetary Constraints
Viaarxiv icon