Picture for Heng Huang

Heng Huang

The University of Texas at Arlington

Trade-off in Estimating the Number of Byzantine Clients in Federated Learning

Add code
Oct 06, 2025
Viaarxiv icon

Achieve Performatively Optimal Policy for Performative Reinforcement Learning

Add code
Oct 06, 2025
Figure 1 for Achieve Performatively Optimal Policy for Performative Reinforcement Learning
Viaarxiv icon

Zeroth-Order Methods for Stochastic Nonconvex Nonsmooth Composite Optimization

Add code
Oct 06, 2025
Viaarxiv icon

Parallel-R1: Towards Parallel Thinking via Reinforcement Learning

Add code
Sep 09, 2025
Viaarxiv icon

Rectified Robust Policy Optimization for Model-Uncertain Constrained Reinforcement Learning without Strong Duality

Add code
Aug 24, 2025
Figure 1 for Rectified Robust Policy Optimization for Model-Uncertain Constrained Reinforcement Learning without Strong Duality
Figure 2 for Rectified Robust Policy Optimization for Model-Uncertain Constrained Reinforcement Learning without Strong Duality
Figure 3 for Rectified Robust Policy Optimization for Model-Uncertain Constrained Reinforcement Learning without Strong Duality
Figure 4 for Rectified Robust Policy Optimization for Model-Uncertain Constrained Reinforcement Learning without Strong Duality
Viaarxiv icon

Cost-Aware Contrastive Routing for LLMs

Add code
Aug 17, 2025
Figure 1 for Cost-Aware Contrastive Routing for LLMs
Figure 2 for Cost-Aware Contrastive Routing for LLMs
Figure 3 for Cost-Aware Contrastive Routing for LLMs
Figure 4 for Cost-Aware Contrastive Routing for LLMs
Viaarxiv icon

A Watermark for Auto-Regressive Image Generation Models

Add code
Jun 13, 2025
Figure 1 for A Watermark for Auto-Regressive Image Generation Models
Figure 2 for A Watermark for Auto-Regressive Image Generation Models
Figure 3 for A Watermark for Auto-Regressive Image Generation Models
Figure 4 for A Watermark for Auto-Regressive Image Generation Models
Viaarxiv icon

ARGUS: Hallucination and Omission Evaluation in Video-LLMs

Add code
Jun 09, 2025
Viaarxiv icon

CoTGuard: Using Chain-of-Thought Triggering for Copyright Protection in Multi-Agent LLM Systems

Add code
May 26, 2025
Viaarxiv icon

Learning to Reason via Mixture-of-Thought for Logical Reasoning

Add code
May 21, 2025
Viaarxiv icon