Picture for Ming Tang

Ming Tang

Foundation Model Research Center, Institute of Automation, Chinese Academy of Sciences

AnomalyMoE: Towards a Language-free Generalist Model for Unified Visual Anomaly Detection

Add code
Aug 08, 2025
Viaarxiv icon

UniFGVC: Universal Training-Free Few-Shot Fine-Grained Vision Classification via Attribute-Aware Multimodal Retrieval

Add code
Aug 06, 2025
Viaarxiv icon

Real-Time Distributed Optical Fiber Vibration Recognition via Extreme Lightweight Model and Cross-Domain Distillation

Add code
Jul 28, 2025
Viaarxiv icon

FlowSpec: Continuous Pipelined Speculative Decoding for Efficient Distributed LLM Inference

Add code
Jul 03, 2025
Viaarxiv icon

MUG: Pseudo Labeling Augmented Audio-Visual Mamba Network for Audio-Visual Video Parsing

Add code
Jul 02, 2025
Viaarxiv icon

SCOUT: Teaching Pre-trained Language Models to Enhance Reasoning via Flow Chain-of-Thought

Add code
May 30, 2025
Viaarxiv icon

Understand, Think, and Answer: Advancing Visual Reasoning with Large Multimodal Models

Add code
May 27, 2025
Viaarxiv icon

One-for-All Pruning: A Universal Model for Customized Compression of Large Language Models

Add code
May 18, 2025
Viaarxiv icon

iMacSR: Intermediate Multi-Access Supervision and Regularization in Training Autonomous Driving Models

Add code
May 01, 2025
Viaarxiv icon

FedEMA: Federated Exponential Moving Averaging with Negative Entropy Regularizer in Autonomous Driving

Add code
May 01, 2025
Viaarxiv icon