Picture for Heng Huang

Heng Huang

The University of Texas at Arlington

Explore Data Left Behind in Reinforcement Learning for Reasoning Language Models

Add code
Nov 06, 2025
Viaarxiv icon

Trade-off in Estimating the Number of Byzantine Clients in Federated Learning

Add code
Oct 06, 2025
Figure 1 for Trade-off in Estimating the Number of Byzantine Clients in Federated Learning
Figure 2 for Trade-off in Estimating the Number of Byzantine Clients in Federated Learning
Figure 3 for Trade-off in Estimating the Number of Byzantine Clients in Federated Learning
Figure 4 for Trade-off in Estimating the Number of Byzantine Clients in Federated Learning
Viaarxiv icon

Zeroth-Order Methods for Stochastic Nonconvex Nonsmooth Composite Optimization

Add code
Oct 06, 2025
Viaarxiv icon

Achieve Performatively Optimal Policy for Performative Reinforcement Learning

Add code
Oct 06, 2025
Figure 1 for Achieve Performatively Optimal Policy for Performative Reinforcement Learning
Viaarxiv icon

Parallel-R1: Towards Parallel Thinking via Reinforcement Learning

Add code
Sep 09, 2025
Viaarxiv icon

Rectified Robust Policy Optimization for Model-Uncertain Constrained Reinforcement Learning without Strong Duality

Add code
Aug 24, 2025
Figure 1 for Rectified Robust Policy Optimization for Model-Uncertain Constrained Reinforcement Learning without Strong Duality
Figure 2 for Rectified Robust Policy Optimization for Model-Uncertain Constrained Reinforcement Learning without Strong Duality
Figure 3 for Rectified Robust Policy Optimization for Model-Uncertain Constrained Reinforcement Learning without Strong Duality
Figure 4 for Rectified Robust Policy Optimization for Model-Uncertain Constrained Reinforcement Learning without Strong Duality
Viaarxiv icon

Cost-Aware Contrastive Routing for LLMs

Add code
Aug 17, 2025
Figure 1 for Cost-Aware Contrastive Routing for LLMs
Figure 2 for Cost-Aware Contrastive Routing for LLMs
Figure 3 for Cost-Aware Contrastive Routing for LLMs
Figure 4 for Cost-Aware Contrastive Routing for LLMs
Viaarxiv icon

A Watermark for Auto-Regressive Image Generation Models

Add code
Jun 13, 2025
Figure 1 for A Watermark for Auto-Regressive Image Generation Models
Figure 2 for A Watermark for Auto-Regressive Image Generation Models
Figure 3 for A Watermark for Auto-Regressive Image Generation Models
Figure 4 for A Watermark for Auto-Regressive Image Generation Models
Viaarxiv icon

ARGUS: Hallucination and Omission Evaluation in Video-LLMs

Add code
Jun 09, 2025
Viaarxiv icon

CoTGuard: Using Chain-of-Thought Triggering for Copyright Protection in Multi-Agent LLM Systems

Add code
May 26, 2025
Viaarxiv icon