Picture for Dong Huang

Dong Huang

Afterburner: Reinforcement Learning Facilitates Self-Improving Code Efficiency Optimization

Add code
May 29, 2025
Viaarxiv icon

Structured Agent Distillation for Large Language Model

Add code
May 20, 2025
Viaarxiv icon

EffiBench-X: A Multi-Language Benchmark for Measuring Efficiency of LLM-Generated Code

Add code
May 19, 2025
Viaarxiv icon

M2Rec: Multi-scale Mamba for Efficient Sequential Recommendation

Add code
May 07, 2025
Viaarxiv icon

Robust Deep Reinforcement Learning in Robotics via Adaptive Gradient-Masked Adversarial Attacks

Add code
Mar 26, 2025
Viaarxiv icon

State-Aware Perturbation Optimization for Robust Deep Reinforcement Learning

Add code
Mar 26, 2025
Viaarxiv icon

RoRA: Efficient Fine-Tuning of LLM with Reliability Optimization for Rank Adaptation

Add code
Jan 08, 2025
Figure 1 for RoRA: Efficient Fine-Tuning of LLM with Reliability Optimization for Rank Adaptation
Figure 2 for RoRA: Efficient Fine-Tuning of LLM with Reliability Optimization for Rank Adaptation
Figure 3 for RoRA: Efficient Fine-Tuning of LLM with Reliability Optimization for Rank Adaptation
Figure 4 for RoRA: Efficient Fine-Tuning of LLM with Reliability Optimization for Rank Adaptation
Viaarxiv icon

Rethinking Adversarial Attacks in Reinforcement Learning from Policy Distribution Perspective

Add code
Jan 08, 2025
Viaarxiv icon

Effi-Code: Unleashing Code Efficiency in Language Models

Add code
Oct 14, 2024
Viaarxiv icon

HE-Nav: A High-Performance and Efficient Navigation System for Aerial-Ground Robots in Cluttered Environments

Add code
Oct 07, 2024
Viaarxiv icon