Picture for Weiyao Lin

Weiyao Lin

From Priors to Perception: Grounding Video-LLMs in Physical Reality

Add code
May 06, 2026
Viaarxiv icon

UnGAP: Uncertainty-Guided Affine Prompting for Real-Time Crack Segmentation

Add code
May 04, 2026
Viaarxiv icon

Object-Attribute-Relation Model Driven Adaptive Hierarchical Transmission for Multimodal Semantic Communication

Add code
Apr 09, 2026
Viaarxiv icon

VidLaDA: Bidirectional Diffusion Large Language Models for Efficient Video Understanding

Add code
Jan 25, 2026
Viaarxiv icon

Enhancing Video Large Language Models with Structured Multi-Video Collaborative Reasoning (early version)

Add code
Sep 16, 2025
Viaarxiv icon

ProgD: Progressive Multi-scale Decoding with Dynamic Graphs for Joint Multi-agent Motion Forecasting

Add code
Sep 11, 2025
Viaarxiv icon

MCA: 2D-3D Retrieval with Noisy Labels via Multi-level Adaptive Correction and Alignment

Add code
Aug 08, 2025
Viaarxiv icon

ABC: Adaptive BayesNet Structure Learning for Computational Scalable Multi-task Image Compression

Add code
Jun 18, 2025
Figure 1 for ABC: Adaptive BayesNet Structure Learning for Computational Scalable Multi-task Image Compression
Figure 2 for ABC: Adaptive BayesNet Structure Learning for Computational Scalable Multi-task Image Compression
Figure 3 for ABC: Adaptive BayesNet Structure Learning for Computational Scalable Multi-task Image Compression
Figure 4 for ABC: Adaptive BayesNet Structure Learning for Computational Scalable Multi-task Image Compression
Viaarxiv icon

CogStream: Context-guided Streaming Video Question Answering

Add code
Jun 12, 2025
Figure 1 for CogStream: Context-guided Streaming Video Question Answering
Figure 2 for CogStream: Context-guided Streaming Video Question Answering
Figure 3 for CogStream: Context-guided Streaming Video Question Answering
Figure 4 for CogStream: Context-guided Streaming Video Question Answering
Viaarxiv icon

Looking Beyond Visible Cues: Implicit Video Question Answering via Dual-Clue Reasoning

Add code
Jun 09, 2025
Figure 1 for Looking Beyond Visible Cues: Implicit Video Question Answering via Dual-Clue Reasoning
Figure 2 for Looking Beyond Visible Cues: Implicit Video Question Answering via Dual-Clue Reasoning
Figure 3 for Looking Beyond Visible Cues: Implicit Video Question Answering via Dual-Clue Reasoning
Figure 4 for Looking Beyond Visible Cues: Implicit Video Question Answering via Dual-Clue Reasoning
Viaarxiv icon