Picture for Zongxia Li

Zongxia Li

Self-Rewarding Vision-Language Model via Reasoning Decomposition

Add code
Aug 27, 2025
Viaarxiv icon

R-Zero: Self-Evolving Reasoning LLM from Zero Data

Add code
Aug 07, 2025
Viaarxiv icon

Semantically-Aware Rewards for Open-Ended R1 Training in Free-Form Generation

Add code
Jun 18, 2025
Viaarxiv icon

VideoHallu: Evaluating and Mitigating Multi-modal Hallucinations for Synthetic Videos

Add code
May 02, 2025
Viaarxiv icon

Large Language Models Are Effective Human Annotation Assistants, But Not Good Independent Annotators

Add code
Mar 09, 2025
Figure 1 for Large Language Models Are Effective Human Annotation Assistants, But Not Good Independent Annotators
Figure 2 for Large Language Models Are Effective Human Annotation Assistants, But Not Good Independent Annotators
Figure 3 for Large Language Models Are Effective Human Annotation Assistants, But Not Good Independent Annotators
Figure 4 for Large Language Models Are Effective Human Annotation Assistants, But Not Good Independent Annotators
Viaarxiv icon

Large Language Models Struggle to Describe the Haystack without Human Help: Human-in-the-loop Evaluation of LLMs

Add code
Feb 20, 2025
Viaarxiv icon

Benchmark Evaluations, Applications, and Challenges of Large Vision Language Models: A Survey

Add code
Jan 04, 2025
Figure 1 for Benchmark Evaluations, Applications, and Challenges of Large Vision Language Models: A Survey
Figure 2 for Benchmark Evaluations, Applications, and Challenges of Large Vision Language Models: A Survey
Figure 3 for Benchmark Evaluations, Applications, and Challenges of Large Vision Language Models: A Survey
Figure 4 for Benchmark Evaluations, Applications, and Challenges of Large Vision Language Models: A Survey
Viaarxiv icon

SciDoc2Diagrammer-MAF: Towards Generation of Scientific Diagrams from Documents guided by Multi-Aspect Feedback Refinement

Add code
Sep 28, 2024
Viaarxiv icon

Do Large Language Models Discriminate in Hiring Decisions on the Basis of Race, Ethnicity, and Gender?

Add code
Jun 15, 2024
Figure 1 for Do Large Language Models Discriminate in Hiring Decisions on the Basis of Race, Ethnicity, and Gender?
Figure 2 for Do Large Language Models Discriminate in Hiring Decisions on the Basis of Race, Ethnicity, and Gender?
Figure 3 for Do Large Language Models Discriminate in Hiring Decisions on the Basis of Race, Ethnicity, and Gender?
Figure 4 for Do Large Language Models Discriminate in Hiring Decisions on the Basis of Race, Ethnicity, and Gender?
Viaarxiv icon

PANDA (Pedantic ANswer-correctness Determination and Adjudication):Improving Automatic Evaluation for Question Answering and Text Generation

Add code
Feb 17, 2024
Figure 1 for PANDA (Pedantic ANswer-correctness Determination and Adjudication):Improving Automatic Evaluation for Question Answering and Text Generation
Figure 2 for PANDA (Pedantic ANswer-correctness Determination and Adjudication):Improving Automatic Evaluation for Question Answering and Text Generation
Figure 3 for PANDA (Pedantic ANswer-correctness Determination and Adjudication):Improving Automatic Evaluation for Question Answering and Text Generation
Figure 4 for PANDA (Pedantic ANswer-correctness Determination and Adjudication):Improving Automatic Evaluation for Question Answering and Text Generation
Viaarxiv icon