Picture for Hung Le

Hung Le

Pick

SPaRFT: Self-Paced Reinforcement Fine-Tuning for Large Language Models

Add code
Aug 07, 2025
Viaarxiv icon

DmC: Nearest Neighbor Guidance Diffusion Model for Offline Cross-domain Reinforcement Learning

Add code
Jul 28, 2025
Viaarxiv icon

Hybrid Cross-domain Robust Reinforcement Learning

Add code
May 29, 2025
Viaarxiv icon

Beyond the Known: Decision Making with Counterfactual Reasoning Decision Transformer

Add code
May 14, 2025
Viaarxiv icon

Reasoning Under 1 Billion: Memory-Augmented Reinforcement Learning for Large Language Models

Add code
Apr 03, 2025
Viaarxiv icon

Sparse Mixture of Experts as Unified Competitive Learning

Add code
Mar 29, 2025
Viaarxiv icon

S2MoE: Robust Sparse Mixture of Experts via Stochastic Learning

Add code
Mar 29, 2025
Figure 1 for S2MoE: Robust Sparse Mixture of Experts via Stochastic Learning
Figure 2 for S2MoE: Robust Sparse Mixture of Experts via Stochastic Learning
Figure 3 for S2MoE: Robust Sparse Mixture of Experts via Stochastic Learning
Figure 4 for S2MoE: Robust Sparse Mixture of Experts via Stochastic Learning
Viaarxiv icon

On the effectiveness of discrete representations in sparse mixture of experts

Add code
Nov 28, 2024
Figure 1 for On the effectiveness of discrete representations in sparse mixture of experts
Figure 2 for On the effectiveness of discrete representations in sparse mixture of experts
Figure 3 for On the effectiveness of discrete representations in sparse mixture of experts
Figure 4 for On the effectiveness of discrete representations in sparse mixture of experts
Viaarxiv icon

CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models

Add code
Nov 07, 2024
Figure 1 for CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models
Figure 2 for CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models
Figure 3 for CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models
Figure 4 for CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models
Viaarxiv icon

Stable Hadamard Memory: Revitalizing Memory-Augmented Agents for Reinforcement Learning

Add code
Oct 14, 2024
Figure 1 for Stable Hadamard Memory: Revitalizing Memory-Augmented Agents for Reinforcement Learning
Figure 2 for Stable Hadamard Memory: Revitalizing Memory-Augmented Agents for Reinforcement Learning
Figure 3 for Stable Hadamard Memory: Revitalizing Memory-Augmented Agents for Reinforcement Learning
Figure 4 for Stable Hadamard Memory: Revitalizing Memory-Augmented Agents for Reinforcement Learning
Viaarxiv icon