Picture for Cairong Zhao

Cairong Zhao

A Comprehensive Evaluation on Quantization Techniques for Large Language Models

Add code
Jul 23, 2025
Viaarxiv icon

One Object, Multiple Lies: A Benchmark for Cross-task Adversarial Attack on Unified Vision-Language Models

Add code
Jul 10, 2025
Viaarxiv icon

Text-promptable Object Counting via Quantity Awareness Enhancement

Add code
Jul 09, 2025
Viaarxiv icon

A Neural Representation Framework with LLM-Driven Spatial Reasoning for Open-Vocabulary 3D Visual Grounding

Add code
Jul 09, 2025
Viaarxiv icon

Perception Activator: An intuitive and portable framework for brain cognitive exploration

Add code
Jul 03, 2025
Viaarxiv icon

Toward Rich Video Human-Motion2D Generation

Add code
Jun 17, 2025
Viaarxiv icon

TRAIL: Transferable Robust Adversarial Images via Latent diffusion

Add code
May 22, 2025
Viaarxiv icon

Exploring Interpretability for Visual Prompt Tuning with Hierarchical Concepts

Add code
Mar 08, 2025
Viaarxiv icon

DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion

Add code
Oct 31, 2024
Figure 1 for DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion
Figure 2 for DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion
Figure 3 for DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion
Figure 4 for DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion
Viaarxiv icon

Uni$^2$Det: Unified and Universal Framework for Prompt-Guided Multi-dataset 3D Detection

Add code
Sep 30, 2024
Figure 1 for Uni$^2$Det: Unified and Universal Framework for Prompt-Guided Multi-dataset 3D Detection
Figure 2 for Uni$^2$Det: Unified and Universal Framework for Prompt-Guided Multi-dataset 3D Detection
Figure 3 for Uni$^2$Det: Unified and Universal Framework for Prompt-Guided Multi-dataset 3D Detection
Figure 4 for Uni$^2$Det: Unified and Universal Framework for Prompt-Guided Multi-dataset 3D Detection
Viaarxiv icon