Picture for Yunjian Xu

Yunjian Xu

ADG: Ambient Diffusion-Guided Dataset Recovery for Corruption-Robust Offline Reinforcement Learning

Add code
May 29, 2025
Viaarxiv icon

Do Not Let Low-Probability Tokens Over-Dominate in RL for LLMs

Add code
May 19, 2025
Viaarxiv icon

GAS: Generative Auto-bidding with Post-training Search

Add code
Dec 22, 2024
Figure 1 for GAS: Generative Auto-bidding with Post-training Search
Figure 2 for GAS: Generative Auto-bidding with Post-training Search
Figure 3 for GAS: Generative Auto-bidding with Post-training Search
Figure 4 for GAS: Generative Auto-bidding with Post-training Search
Viaarxiv icon