Picture for Qian Zhu

Qian Zhu

A Pragmatic VLA Foundation Model

Add code
Jan 26, 2026
Viaarxiv icon

The Great March 100: 100 Detail-oriented Tasks for Evaluating Embodied AI Agents

Add code
Jan 16, 2026
Viaarxiv icon

AudioCIL: A Python Toolbox for Audio Class-Incremental Learning with Multiple Scenes

Add code
Dec 16, 2024
Viaarxiv icon

SNN-PAR: Energy Efficient Pedestrian Attribute Recognition via Spiking Neural Networks

Add code
Oct 10, 2024
Viaarxiv icon

AnomalyCD: A benchmark for Earth anomaly change detection with high-resolution and time-series observations

Add code
Sep 09, 2024
Figure 1 for AnomalyCD: A benchmark for Earth anomaly change detection with high-resolution and time-series observations
Figure 2 for AnomalyCD: A benchmark for Earth anomaly change detection with high-resolution and time-series observations
Figure 3 for AnomalyCD: A benchmark for Earth anomaly change detection with high-resolution and time-series observations
Figure 4 for AnomalyCD: A benchmark for Earth anomaly change detection with high-resolution and time-series observations
Viaarxiv icon

Pedestrian Attribute Recognition: A New Benchmark Dataset and A Large Language Model Augmented Framework

Add code
Aug 19, 2024
Viaarxiv icon

Spatio-Temporal Side Tuning Pre-trained Foundation Models for Video-based Pedestrian Attribute Recognition

Add code
Apr 27, 2024
Figure 1 for Spatio-Temporal Side Tuning Pre-trained Foundation Models for Video-based Pedestrian Attribute Recognition
Figure 2 for Spatio-Temporal Side Tuning Pre-trained Foundation Models for Video-based Pedestrian Attribute Recognition
Figure 3 for Spatio-Temporal Side Tuning Pre-trained Foundation Models for Video-based Pedestrian Attribute Recognition
Figure 4 for Spatio-Temporal Side Tuning Pre-trained Foundation Models for Video-based Pedestrian Attribute Recognition
Viaarxiv icon

The Contemporary Art of Image Search: Iterative User Intent Expansion via Vision-Language Model

Add code
Dec 05, 2023
Figure 1 for The Contemporary Art of Image Search: Iterative User Intent Expansion via Vision-Language Model
Figure 2 for The Contemporary Art of Image Search: Iterative User Intent Expansion via Vision-Language Model
Figure 3 for The Contemporary Art of Image Search: Iterative User Intent Expansion via Vision-Language Model
Figure 4 for The Contemporary Art of Image Search: Iterative User Intent Expansion via Vision-Language Model
Viaarxiv icon

Persua: A Visual Interactive System to Enhance the Persuasiveness of Arguments in Online Discussion

Add code
Apr 21, 2022
Figure 1 for Persua: A Visual Interactive System to Enhance the Persuasiveness of Arguments in Online Discussion
Figure 2 for Persua: A Visual Interactive System to Enhance the Persuasiveness of Arguments in Online Discussion
Figure 3 for Persua: A Visual Interactive System to Enhance the Persuasiveness of Arguments in Online Discussion
Figure 4 for Persua: A Visual Interactive System to Enhance the Persuasiveness of Arguments in Online Discussion
Viaarxiv icon

Crop Height and Plot Estimation from Unmanned Aerial Vehicles using 3D LiDAR

Add code
Oct 30, 2019
Figure 1 for Crop Height and Plot Estimation from Unmanned Aerial Vehicles using 3D LiDAR
Figure 2 for Crop Height and Plot Estimation from Unmanned Aerial Vehicles using 3D LiDAR
Figure 3 for Crop Height and Plot Estimation from Unmanned Aerial Vehicles using 3D LiDAR
Figure 4 for Crop Height and Plot Estimation from Unmanned Aerial Vehicles using 3D LiDAR
Viaarxiv icon