Picture for Wengyu Zhang

Wengyu Zhang

Removal of Hallucination on Hallucination: Debate-Augmented RAG

Add code
May 24, 2025
Viaarxiv icon

MolGround: A Benchmark for Molecular Grounding

Add code
Apr 01, 2025
Viaarxiv icon

Mean of Means: Human Localization with Calibration-free and Unconstrained Camera Settings (extended version)

Add code
Feb 18, 2025
Figure 1 for Mean of Means: Human Localization with Calibration-free and Unconstrained Camera Settings (extended version)
Figure 2 for Mean of Means: Human Localization with Calibration-free and Unconstrained Camera Settings (extended version)
Figure 3 for Mean of Means: Human Localization with Calibration-free and Unconstrained Camera Settings (extended version)
Figure 4 for Mean of Means: Human Localization with Calibration-free and Unconstrained Camera Settings (extended version)
Viaarxiv icon

PolySmart @ TRECVid 2024 Video-To-Text

Add code
Dec 23, 2024
Figure 1 for PolySmart @ TRECVid 2024 Video-To-Text
Figure 2 for PolySmart @ TRECVid 2024 Video-To-Text
Figure 3 for PolySmart @ TRECVid 2024 Video-To-Text
Figure 4 for PolySmart @ TRECVid 2024 Video-To-Text
Viaarxiv icon

Mean of Means: A 10-dollar Solution for Human Localization with Calibration-free and Unconstrained Camera Settings

Add code
Jul 30, 2024
Figure 1 for Mean of Means: A 10-dollar Solution for Human Localization with Calibration-free and Unconstrained Camera Settings
Figure 2 for Mean of Means: A 10-dollar Solution for Human Localization with Calibration-free and Unconstrained Camera Settings
Figure 3 for Mean of Means: A 10-dollar Solution for Human Localization with Calibration-free and Unconstrained Camera Settings
Figure 4 for Mean of Means: A 10-dollar Solution for Human Localization with Calibration-free and Unconstrained Camera Settings
Viaarxiv icon

Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval

Add code
Jul 23, 2024
Figure 1 for Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval
Figure 2 for Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval
Figure 3 for Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval
Figure 4 for Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval
Viaarxiv icon

A Survey on Personalized Content Synthesis with Diffusion Models

Add code
May 09, 2024
Viaarxiv icon

Generative Active Learning for Image Synthesis Personalization

Add code
Mar 22, 2024
Viaarxiv icon

A Picture Is Worth a Graph: Blueprint Debate on Graph for Multimodal Reasoning

Add code
Mar 22, 2024
Viaarxiv icon