Picture for Jinqiao Wang

Jinqiao Wang

Foundation Model Research Center, Institute of Automation, Chinese Academy of Sciences, objecteye.Inc

AnomalyMoE: Towards a Language-free Generalist Model for Unified Visual Anomaly Detection

Add code
Aug 08, 2025
Viaarxiv icon

UniFGVC: Universal Training-Free Few-Shot Fine-Grained Vision Classification via Attribute-Aware Multimodal Retrieval

Add code
Aug 06, 2025
Viaarxiv icon

Scaling Linear Attention with Sparse State Expansion

Add code
Jul 22, 2025
Viaarxiv icon

MUG: Pseudo Labeling Augmented Audio-Visual Mamba Network for Audio-Visual Video Parsing

Add code
Jul 02, 2025
Viaarxiv icon

VFaith: Do Large Multimodal Models Really Reason on Seen Images Rather than Previous Memories?

Add code
Jun 13, 2025
Viaarxiv icon

Understand, Think, and Answer: Advancing Visual Reasoning with Large Multimodal Models

Add code
May 27, 2025
Viaarxiv icon

MathPhys-Guided Coarse-to-Fine Anomaly Synthesis with SQE-Driven Bi-Level Optimization for Anomaly Detection

Add code
Apr 17, 2025
Viaarxiv icon

PhysVLM: Enabling Visual Language Models to Understand Robotic Physical Reachability

Add code
Mar 13, 2025
Viaarxiv icon

LightPlanner: Unleashing the Reasoning Capabilities of Lightweight Large Language Models in Task Planning

Add code
Mar 11, 2025
Viaarxiv icon

Synthetic Data is an Elegant GIFT for Continual Vision-Language Models

Add code
Mar 06, 2025
Viaarxiv icon