Picture for Haiyun Guo

Haiyun Guo

UniFGVC: Universal Training-Free Few-Shot Fine-Grained Vision Classification via Attribute-Aware Multimodal Retrieval

Add code
Aug 06, 2025
Viaarxiv icon

Steering LVLMs via Sparse Autoencoder for Hallucination Mitigation

Add code
May 22, 2025
Viaarxiv icon

PhysVLM: Enabling Visual Language Models to Understand Robotic Physical Reachability

Add code
Mar 13, 2025
Viaarxiv icon

Cracking the Code of Hallucination in LVLMs with Vision-aware Head Divergence

Add code
Dec 18, 2024
Figure 1 for Cracking the Code of Hallucination in LVLMs with Vision-aware Head Divergence
Figure 2 for Cracking the Code of Hallucination in LVLMs with Vision-aware Head Divergence
Figure 3 for Cracking the Code of Hallucination in LVLMs with Vision-aware Head Divergence
Figure 4 for Cracking the Code of Hallucination in LVLMs with Vision-aware Head Divergence
Viaarxiv icon

Monocular Lane Detection Based on Deep Learning: A Survey

Add code
Nov 26, 2024
Figure 1 for Monocular Lane Detection Based on Deep Learning: A Survey
Figure 2 for Monocular Lane Detection Based on Deep Learning: A Survey
Figure 3 for Monocular Lane Detection Based on Deep Learning: A Survey
Figure 4 for Monocular Lane Detection Based on Deep Learning: A Survey
Viaarxiv icon

SEEKR: Selective Attention-Guided Knowledge Retention for Continual Learning of Large Language Models

Add code
Nov 09, 2024
Viaarxiv icon

WaveMo: Learning Wavefront Modulations to See Through Scattering

Add code
Apr 11, 2024
Figure 1 for WaveMo: Learning Wavefront Modulations to See Through Scattering
Figure 2 for WaveMo: Learning Wavefront Modulations to See Through Scattering
Figure 3 for WaveMo: Learning Wavefront Modulations to See Through Scattering
Figure 4 for WaveMo: Learning Wavefront Modulations to See Through Scattering
Viaarxiv icon

Continual Instruction Tuning for Large Multimodal Models

Add code
Nov 27, 2023
Figure 1 for Continual Instruction Tuning for Large Multimodal Models
Figure 2 for Continual Instruction Tuning for Large Multimodal Models
Figure 3 for Continual Instruction Tuning for Large Multimodal Models
Figure 4 for Continual Instruction Tuning for Large Multimodal Models
Viaarxiv icon

FPM-INR: Fourier ptychographic microscopy image stack reconstruction using implicit neural representations

Add code
Oct 31, 2023
Figure 1 for FPM-INR: Fourier ptychographic microscopy image stack reconstruction using implicit neural representations
Figure 2 for FPM-INR: Fourier ptychographic microscopy image stack reconstruction using implicit neural representations
Figure 3 for FPM-INR: Fourier ptychographic microscopy image stack reconstruction using implicit neural representations
Figure 4 for FPM-INR: Fourier ptychographic microscopy image stack reconstruction using implicit neural representations
Viaarxiv icon

ZBS: Zero-shot Background Subtraction via Instance-level Background Modeling and Foreground Selection

Add code
Mar 26, 2023
Figure 1 for ZBS: Zero-shot Background Subtraction via Instance-level Background Modeling and Foreground Selection
Figure 2 for ZBS: Zero-shot Background Subtraction via Instance-level Background Modeling and Foreground Selection
Figure 3 for ZBS: Zero-shot Background Subtraction via Instance-level Background Modeling and Foreground Selection
Figure 4 for ZBS: Zero-shot Background Subtraction via Instance-level Background Modeling and Foreground Selection
Viaarxiv icon