Picture for Yuhui Wang

Yuhui Wang

JFTA-Bench: Evaluate LLM's Ability of Tracking and Analyzing Malfunctions Using Fault Trees

Add code
Mar 24, 2026
Viaarxiv icon

Efficient Morphology-Control Co-Design via Stackelberg Proximal Policy Optimization

Add code
Mar 16, 2026
Viaarxiv icon

AgentLAB: Benchmarking LLM Agents against Long-Horizon Attacks

Add code
Feb 18, 2026
Viaarxiv icon

A Unified Framework for Rethinking Policy Divergence Measures in GRPO

Add code
Feb 05, 2026
Viaarxiv icon

DFPO: Scaling Value Modeling via Distributional Flow towards Robust and Generalizable LLM Post-Training

Add code
Feb 05, 2026
Viaarxiv icon

RASA: Routing-Aware Safety Alignment for Mixture-of-Experts Models

Add code
Feb 04, 2026
Viaarxiv icon

OpenNovelty: An LLM-powered Agentic System for Verifiable Scholarly Novelty Assessment

Add code
Jan 04, 2026
Viaarxiv icon

A data-physics hybrid generative model for patient-specific post-stroke motor rehabilitation using wearable sensor data

Add code
Dec 16, 2025
Viaarxiv icon

Synthetic Voices, Real Threats: Evaluating Large Text-to-Speech Models in Generating Harmful Audio

Add code
Nov 14, 2025
Viaarxiv icon

LLMEval-3: A Large-Scale Longitudinal Study on Robust and Fair Evaluation of Large Language Models

Add code
Aug 07, 2025
Viaarxiv icon