Picture for Hongning Wang

Hongning Wang

SynCred-Bench: Benchmarking Synthetic Credibility in AI-Generated Visual Misinformation

Add code
Jun 02, 2026
Viaarxiv icon

RUBAS: Rubric-Based Reinforcement Learning for Agent Safety

Add code
Jun 02, 2026
Viaarxiv icon

You Live More Than Once: Towards Hierarchical Skill Meta-Evolving

Add code
May 27, 2026
Viaarxiv icon

SkillEvolver: Skill Learning as a Meta-Skill

Add code
May 11, 2026
Viaarxiv icon

HoWToBench: Holistic Evaluation for LLM's Capability in Human-level Writing using Tree of Writing

Add code
Apr 21, 2026
Viaarxiv icon

LASA: Language-Agnostic Semantic Alignment at the Semantic Bottleneck for LLM Safety

Add code
Apr 13, 2026
Viaarxiv icon

IF-RewardBench: Benchmarking Judge Models for Instruction-Following Evaluation

Add code
Mar 05, 2026
Viaarxiv icon

RLAR: An Agentic Reward System for Multi-task Reinforcement Learning on Large Language Models

Add code
Feb 28, 2026
Viaarxiv icon

RAVEL: Reasoning Agents for Validating and Evaluating LLM Text Synthesis

Add code
Feb 28, 2026
Viaarxiv icon

Grounding LLMs in Scientific Discovery via Embodied Actions

Add code
Feb 24, 2026
Viaarxiv icon