Picture for Wenqiang Zhang

Wenqiang Zhang

Tsinghua University

PG-Attack: A Precision-Guided Adversarial Attack Framework Against Vision Foundation Models for Autonomous Driving

Add code
Jul 18, 2024
Viaarxiv icon

From Efficient Multimodal Models to World Models: A Survey

Add code
Jun 27, 2024
Viaarxiv icon

Seeking Certainty In Uncertainty: Dual-Stage Unified Framework Solving Uncertainty in Dynamic Facial Expression Recognition

Add code
Jun 24, 2024
Figure 1 for Seeking Certainty In Uncertainty: Dual-Stage Unified Framework Solving Uncertainty in Dynamic Facial Expression Recognition
Figure 2 for Seeking Certainty In Uncertainty: Dual-Stage Unified Framework Solving Uncertainty in Dynamic Facial Expression Recognition
Figure 3 for Seeking Certainty In Uncertainty: Dual-Stage Unified Framework Solving Uncertainty in Dynamic Facial Expression Recognition
Figure 4 for Seeking Certainty In Uncertainty: Dual-Stage Unified Framework Solving Uncertainty in Dynamic Facial Expression Recognition
Viaarxiv icon

Suppressing Uncertainties in Degradation Estimation for Blind Super-Resolution

Add code
Jun 24, 2024
Figure 1 for Suppressing Uncertainties in Degradation Estimation for Blind Super-Resolution
Figure 2 for Suppressing Uncertainties in Degradation Estimation for Blind Super-Resolution
Figure 3 for Suppressing Uncertainties in Degradation Estimation for Blind Super-Resolution
Figure 4 for Suppressing Uncertainties in Degradation Estimation for Blind Super-Resolution
Viaarxiv icon

OUS: Scene-Guided Dynamic Facial Expression Recognition

Add code
May 29, 2024
Viaarxiv icon

LVOS: A Benchmark for Large-scale Long-term Video Object Segmentation

Add code
May 01, 2024
Figure 1 for LVOS: A Benchmark for Large-scale Long-term Video Object Segmentation
Figure 2 for LVOS: A Benchmark for Large-scale Long-term Video Object Segmentation
Figure 3 for LVOS: A Benchmark for Large-scale Long-term Video Object Segmentation
Figure 4 for LVOS: A Benchmark for Large-scale Long-term Video Object Segmentation
Viaarxiv icon

De-confounded Data-free Knowledge Distillation for Handling Distribution Shifts

Add code
Mar 28, 2024
Figure 1 for De-confounded Data-free Knowledge Distillation for Handling Distribution Shifts
Figure 2 for De-confounded Data-free Knowledge Distillation for Handling Distribution Shifts
Figure 3 for De-confounded Data-free Knowledge Distillation for Handling Distribution Shifts
Figure 4 for De-confounded Data-free Knowledge Distillation for Handling Distribution Shifts
Viaarxiv icon

MAGIS: LLM-Based Multi-Agent Framework for GitHub Issue Resolution

Add code
Mar 26, 2024
Figure 1 for MAGIS: LLM-Based Multi-Agent Framework for GitHub Issue Resolution
Figure 2 for MAGIS: LLM-Based Multi-Agent Framework for GitHub Issue Resolution
Figure 3 for MAGIS: LLM-Based Multi-Agent Framework for GitHub Issue Resolution
Figure 4 for MAGIS: LLM-Based Multi-Agent Framework for GitHub Issue Resolution
Viaarxiv icon

Improving Adversarial Transferability of Visual-Language Pre-training Models through Collaborative Multimodal Interaction

Add code
Mar 16, 2024
Figure 1 for Improving Adversarial Transferability of Visual-Language Pre-training Models through Collaborative Multimodal Interaction
Figure 2 for Improving Adversarial Transferability of Visual-Language Pre-training Models through Collaborative Multimodal Interaction
Figure 3 for Improving Adversarial Transferability of Visual-Language Pre-training Models through Collaborative Multimodal Interaction
Figure 4 for Improving Adversarial Transferability of Visual-Language Pre-training Models through Collaborative Multimodal Interaction
Viaarxiv icon

OneTracker: Unifying Visual Object Tracking with Foundation Models and Efficient Tuning

Add code
Mar 14, 2024
Figure 1 for OneTracker: Unifying Visual Object Tracking with Foundation Models and Efficient Tuning
Figure 2 for OneTracker: Unifying Visual Object Tracking with Foundation Models and Efficient Tuning
Figure 3 for OneTracker: Unifying Visual Object Tracking with Foundation Models and Efficient Tuning
Figure 4 for OneTracker: Unifying Visual Object Tracking with Foundation Models and Efficient Tuning
Viaarxiv icon