Picture for Yuhui Wang

Yuhui Wang

LLMEval-Med: A Real-world Clinical Benchmark for Medical LLMs with Physician Validation

Add code
Jun 04, 2025
Viaarxiv icon

Self-Destructive Language Model

Add code
May 18, 2025
Viaarxiv icon

AutoRAN: Weak-to-Strong Jailbreaking of Large Reasoning Models

Add code
May 16, 2025
Viaarxiv icon

Directly Forecasting Belief for Reinforcement Learning with Delays

Add code
May 01, 2025
Viaarxiv icon

Open-Sora 2.0: Training a Commercial-Level Video Generation Model in $200k

Add code
Mar 12, 2025
Viaarxiv icon

PFDial: A Structured Dialogue Instruction Fine-tuning Method Based on UML Flowcharts

Add code
Mar 09, 2025
Viaarxiv icon

GraphRAG under Fire

Add code
Jan 23, 2025
Viaarxiv icon

Cluster-Based Multi-Agent Task Scheduling for Space-Air-Ground Integrated Networks

Add code
Dec 14, 2024
Figure 1 for Cluster-Based Multi-Agent Task Scheduling for Space-Air-Ground Integrated Networks
Figure 2 for Cluster-Based Multi-Agent Task Scheduling for Space-Air-Ground Integrated Networks
Figure 3 for Cluster-Based Multi-Agent Task Scheduling for Space-Air-Ground Integrated Networks
Figure 4 for Cluster-Based Multi-Agent Task Scheduling for Space-Air-Ground Integrated Networks
Viaarxiv icon

RobustKV: Defending Large Language Models against Jailbreak Attacks via KV Eviction

Add code
Oct 25, 2024
Figure 1 for RobustKV: Defending Large Language Models against Jailbreak Attacks via KV Eviction
Figure 2 for RobustKV: Defending Large Language Models against Jailbreak Attacks via KV Eviction
Figure 3 for RobustKV: Defending Large Language Models against Jailbreak Attacks via KV Eviction
Figure 4 for RobustKV: Defending Large Language Models against Jailbreak Attacks via KV Eviction
Viaarxiv icon

Scaling Value Iteration Networks to 5000 Layers for Extreme Long-Term Planning

Add code
Jun 12, 2024
Viaarxiv icon