Picture for Xiaofeng Zhang

Xiaofeng Zhang

Bias Analysis in Unconditional Image Generative Models

Add code
Jun 10, 2025
Viaarxiv icon

CulturalFrames: Assessing Cultural Expectation Alignment in Text-to-Image Models and Evaluation Metrics

Add code
Jun 10, 2025
Viaarxiv icon

AdaToken-3D: Dynamic Spatial Gating for Efficient 3D Large Multimodal-Models Reasoning

Add code
May 19, 2025
Viaarxiv icon

LensNet: An End-to-End Learning Framework for Empirical Point Spread Function Modeling and Lensless Imaging Reconstruction

Add code
May 03, 2025
Viaarxiv icon

Improving Complex Reasoning with Dynamic Prompt Corruption: A soft prompt Optimization Approach

Add code
Mar 17, 2025
Viaarxiv icon

PKRD-CoT: A Unified Chain-of-thought Prompting for Multi-Modal Large Language Models in Autonomous Driving

Add code
Dec 02, 2024
Viaarxiv icon

Seeing Clearly by Layer Two: Enhancing Attention Heads to Alleviate Hallucination in LVLMs

Add code
Nov 15, 2024
Figure 1 for Seeing Clearly by Layer Two: Enhancing Attention Heads to Alleviate Hallucination in LVLMs
Figure 2 for Seeing Clearly by Layer Two: Enhancing Attention Heads to Alleviate Hallucination in LVLMs
Figure 3 for Seeing Clearly by Layer Two: Enhancing Attention Heads to Alleviate Hallucination in LVLMs
Figure 4 for Seeing Clearly by Layer Two: Enhancing Attention Heads to Alleviate Hallucination in LVLMs
Viaarxiv icon

DDFAV: Remote Sensing Large Vision Language Models Dataset and Evaluation Benchmark

Add code
Nov 05, 2024
Viaarxiv icon

High-Fidelity Document Stain Removal via A Large-Scale Real-World Dataset and A Memory-Augmented Transformer

Add code
Oct 30, 2024
Viaarxiv icon

GiVE: Guiding Visual Encoder to Perceive Overlooked Information

Add code
Oct 26, 2024
Viaarxiv icon