Picture for Yaqi Xie

Yaqi Xie

VScan: Rethinking Visual Token Reduction for Efficient Large Vision-Language Models

Add code
May 28, 2025
Viaarxiv icon

InstructPart: Task-Oriented Part Segmentation with Instruction Reasoning

Add code
May 23, 2025
Viaarxiv icon

Spectral-Aware Global Fusion for RGB-Thermal Semantic Segmentation

Add code
May 21, 2025
Viaarxiv icon

Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language Models

Add code
Feb 10, 2025
Viaarxiv icon

HiMemFormer: Hierarchical Memory-Aware Transformer for Multi-Agent Action Anticipation

Add code
Nov 03, 2024
Figure 1 for HiMemFormer: Hierarchical Memory-Aware Transformer for Multi-Agent Action Anticipation
Figure 2 for HiMemFormer: Hierarchical Memory-Aware Transformer for Multi-Agent Action Anticipation
Figure 3 for HiMemFormer: Hierarchical Memory-Aware Transformer for Multi-Agent Action Anticipation
Figure 4 for HiMemFormer: Hierarchical Memory-Aware Transformer for Multi-Agent Action Anticipation
Viaarxiv icon

LogiCity: Advancing Neuro-Symbolic AI with Abstract Urban Simulation

Add code
Nov 01, 2024
Figure 1 for LogiCity: Advancing Neuro-Symbolic AI with Abstract Urban Simulation
Figure 2 for LogiCity: Advancing Neuro-Symbolic AI with Abstract Urban Simulation
Figure 3 for LogiCity: Advancing Neuro-Symbolic AI with Abstract Urban Simulation
Figure 4 for LogiCity: Advancing Neuro-Symbolic AI with Abstract Urban Simulation
Viaarxiv icon

Dual Prototype Evolving for Test-Time Generalization of Vision-Language Models

Add code
Oct 16, 2024
Figure 1 for Dual Prototype Evolving for Test-Time Generalization of Vision-Language Models
Figure 2 for Dual Prototype Evolving for Test-Time Generalization of Vision-Language Models
Figure 3 for Dual Prototype Evolving for Test-Time Generalization of Vision-Language Models
Figure 4 for Dual Prototype Evolving for Test-Time Generalization of Vision-Language Models
Viaarxiv icon

VC Theory for Inventory Policies

Add code
Apr 17, 2024
Viaarxiv icon

Sigma: Siamese Mamba Network for Multi-Modal Semantic Segmentation

Add code
Apr 05, 2024
Viaarxiv icon

MUGC: Machine Generated versus User Generated Content Detection

Add code
Mar 28, 2024
Viaarxiv icon