Picture for Yun Fu

Yun Fu

IIR-VLM: In-Context Instance-level Recognition for Large Vision-Language Models

Add code
Jan 20, 2026
Viaarxiv icon

Diffusion-DRF: Differentiable Reward Flow for Video Diffusion Fine-Tuning

Add code
Jan 07, 2026
Viaarxiv icon

Rethinking Fine-Tuning: Unlocking Hidden Capabilities in Vision-Language Models

Add code
Dec 28, 2025
Viaarxiv icon

SimpleCall: A Lightweight Image Restoration Agent in Label-Free Environments with MLLM Perceptual Feedback

Add code
Dec 21, 2025
Viaarxiv icon

LinkedOut: Linking World Knowledge Representation Out of Video LLM for Next-Generation Video Recommendation

Add code
Dec 18, 2025
Viaarxiv icon

AdaSports-Traj: Role- and Domain-Aware Adaptation for Multi-Agent Trajectory Modeling in Sports

Add code
Sep 19, 2025
Viaarxiv icon

MTS-DMAE: Dual-Masked Autoencoder for Unsupervised Multivariate Time Series Representation Learning

Add code
Sep 19, 2025
Figure 1 for MTS-DMAE: Dual-Masked Autoencoder for Unsupervised Multivariate Time Series Representation Learning
Figure 2 for MTS-DMAE: Dual-Masked Autoencoder for Unsupervised Multivariate Time Series Representation Learning
Figure 3 for MTS-DMAE: Dual-Masked Autoencoder for Unsupervised Multivariate Time Series Representation Learning
Figure 4 for MTS-DMAE: Dual-Masked Autoencoder for Unsupervised Multivariate Time Series Representation Learning
Viaarxiv icon

Dense Video Understanding with Gated Residual Tokenization

Add code
Sep 18, 2025
Viaarxiv icon

Out-of-Sight Trajectories: Tracking, Fusion, and Prediction

Add code
Sep 18, 2025
Viaarxiv icon

ExpertGen: Training-Free Expert Guidance for Controllable Text-to-Face Generation

Add code
May 22, 2025
Viaarxiv icon