Picture for Zhuohang Li

Zhuohang Li

WorldArena: A Unified Benchmark for Evaluating Perception and Functional Utility of Embodied World Models

Add code
Feb 09, 2026
Viaarxiv icon

From Passive Metric to Active Signal: The Evolving Role of Uncertainty Quantification in Large Language Models

Add code
Jan 22, 2026
Viaarxiv icon

GR-Dexter Technical Report

Add code
Dec 30, 2025
Viaarxiv icon

What Really is a Member? Discrediting Membership Inference via Poisoning

Add code
Jun 06, 2025
Figure 1 for What Really is a Member? Discrediting Membership Inference via Poisoning
Figure 2 for What Really is a Member? Discrediting Membership Inference via Poisoning
Figure 3 for What Really is a Member? Discrediting Membership Inference via Poisoning
Figure 4 for What Really is a Member? Discrediting Membership Inference via Poisoning
Viaarxiv icon

Can Large Language Models Help Multimodal Language Analysis? MMLA: A Comprehensive Benchmark

Add code
Apr 24, 2025
Viaarxiv icon

SCE: Scalable Consistency Ensembles Make Blackbox Large Language Model Generation More Reliable

Add code
Mar 13, 2025
Viaarxiv icon

Towards Statistical Factuality Guarantee for Large Vision-Language Models

Add code
Feb 27, 2025
Viaarxiv icon

Automatic Prompt Optimization via Heuristic Search: A Survey

Add code
Feb 26, 2025
Viaarxiv icon

Do You Know What You Are Talking About? Characterizing Query-Knowledge Relevance For Reliable Retrieval Augmented Generation

Add code
Oct 10, 2024
Figure 1 for Do You Know What You Are Talking About? Characterizing Query-Knowledge Relevance For Reliable Retrieval Augmented Generation
Figure 2 for Do You Know What You Are Talking About? Characterizing Query-Knowledge Relevance For Reliable Retrieval Augmented Generation
Figure 3 for Do You Know What You Are Talking About? Characterizing Query-Knowledge Relevance For Reliable Retrieval Augmented Generation
Figure 4 for Do You Know What You Are Talking About? Characterizing Query-Knowledge Relevance For Reliable Retrieval Augmented Generation
Viaarxiv icon

Exploring User-level Gradient Inversion with a Diffusion Prior

Add code
Sep 11, 2024
Figure 1 for Exploring User-level Gradient Inversion with a Diffusion Prior
Figure 2 for Exploring User-level Gradient Inversion with a Diffusion Prior
Figure 3 for Exploring User-level Gradient Inversion with a Diffusion Prior
Figure 4 for Exploring User-level Gradient Inversion with a Diffusion Prior
Viaarxiv icon