Picture for Dong Huang

Dong Huang

Afterburner: Reinforcement Learning Facilitates Self-Improving Code Efficiency Optimization

Add code
May 29, 2025
Viaarxiv icon

Structured Agent Distillation for Large Language Model

Add code
May 20, 2025
Viaarxiv icon

EffiBench-X: A Multi-Language Benchmark for Measuring Efficiency of LLM-Generated Code

Add code
May 19, 2025
Viaarxiv icon

M2Rec: Multi-scale Mamba for Efficient Sequential Recommendation

Add code
May 07, 2025
Viaarxiv icon

Robust Deep Reinforcement Learning in Robotics via Adaptive Gradient-Masked Adversarial Attacks

Add code
Mar 26, 2025
Viaarxiv icon

State-Aware Perturbation Optimization for Robust Deep Reinforcement Learning

Add code
Mar 26, 2025
Viaarxiv icon

Rethinking Adversarial Attacks in Reinforcement Learning from Policy Distribution Perspective

Add code
Jan 08, 2025
Viaarxiv icon

RoRA: Efficient Fine-Tuning of LLM with Reliability Optimization for Rank Adaptation

Add code
Jan 08, 2025
Figure 1 for RoRA: Efficient Fine-Tuning of LLM with Reliability Optimization for Rank Adaptation
Figure 2 for RoRA: Efficient Fine-Tuning of LLM with Reliability Optimization for Rank Adaptation
Figure 3 for RoRA: Efficient Fine-Tuning of LLM with Reliability Optimization for Rank Adaptation
Figure 4 for RoRA: Efficient Fine-Tuning of LLM with Reliability Optimization for Rank Adaptation
Viaarxiv icon

Effi-Code: Unleashing Code Efficiency in Language Models

Add code
Oct 14, 2024
Viaarxiv icon

HE-Nav: A High-Performance and Efficient Navigation System for Aerial-Ground Robots in Cluttered Environments

Add code
Oct 07, 2024
Viaarxiv icon