Picture for Zhuang Li

Zhuang Li

NoRA: Evaluating Grounded Reasonableness in Visual First-person Normative Action Reasoning

Add code
Jun 03, 2026
Viaarxiv icon

RDGen: Demonstration Generation for High-Quality Robot Learning via Reinforcement Learning

Add code
May 29, 2026
Viaarxiv icon

How Robust Are Large Language Models for Clinical Numeracy? An Empirical Study on Numerical Reasoning Abilities in Clinical Contexts

Add code
Apr 13, 2026
Viaarxiv icon

Hi-Reco: High-Fidelity Real-Time Conversational Digital Humans

Add code
Nov 16, 2025
Viaarxiv icon

DiscoSG: Towards Discourse-Level Text Scene Graph Parsing through Iterative Graph Refinement

Add code
Jun 18, 2025
Viaarxiv icon

QQSUM: A Novel Task and Model of Quantitative Query-Focused Summarization for Review-based Product Question Answering

Add code
Jun 04, 2025
Viaarxiv icon

TRIDENT: Enhancing Large Language Model Safety with Tri-Dimensional Diversified Red-Teaming Data Synthesis

Add code
May 30, 2025
Figure 1 for TRIDENT: Enhancing Large Language Model Safety with Tri-Dimensional Diversified Red-Teaming Data Synthesis
Figure 2 for TRIDENT: Enhancing Large Language Model Safety with Tri-Dimensional Diversified Red-Teaming Data Synthesis
Figure 3 for TRIDENT: Enhancing Large Language Model Safety with Tri-Dimensional Diversified Red-Teaming Data Synthesis
Figure 4 for TRIDENT: Enhancing Large Language Model Safety with Tri-Dimensional Diversified Red-Teaming Data Synthesis
Viaarxiv icon

EVOREFUSE: Evolutionary Prompt Optimization for Evaluation and Mitigation of LLM Over-Refusal to Pseudo-Malicious Instructions

Add code
May 29, 2025
Figure 1 for EVOREFUSE: Evolutionary Prompt Optimization for Evaluation and Mitigation of LLM Over-Refusal to Pseudo-Malicious Instructions
Figure 2 for EVOREFUSE: Evolutionary Prompt Optimization for Evaluation and Mitigation of LLM Over-Refusal to Pseudo-Malicious Instructions
Figure 3 for EVOREFUSE: Evolutionary Prompt Optimization for Evaluation and Mitigation of LLM Over-Refusal to Pseudo-Malicious Instructions
Figure 4 for EVOREFUSE: Evolutionary Prompt Optimization for Evaluation and Mitigation of LLM Over-Refusal to Pseudo-Malicious Instructions
Viaarxiv icon

LazyReview A Dataset for Uncovering Lazy Thinking in NLP Peer Reviews

Add code
Apr 15, 2025
Viaarxiv icon

RIDE: Enhancing Large Language Model Alignment through Restyled In-Context Learning Demonstration Exemplars

Add code
Feb 20, 2025
Viaarxiv icon