Picture for Nicu Sebe

Nicu Sebe

ESPLoRA: Enhanced Spatial Precision with Low-Rank Adaption in Text-to-Image Diffusion Models for High-Definition Synthesis

Add code
Apr 18, 2025
Figure 1 for ESPLoRA: Enhanced Spatial Precision with Low-Rank Adaption in Text-to-Image Diffusion Models for High-Definition Synthesis
Figure 2 for ESPLoRA: Enhanced Spatial Precision with Low-Rank Adaption in Text-to-Image Diffusion Models for High-Definition Synthesis
Figure 3 for ESPLoRA: Enhanced Spatial Precision with Low-Rank Adaption in Text-to-Image Diffusion Models for High-Definition Synthesis
Figure 4 for ESPLoRA: Enhanced Spatial Precision with Low-Rank Adaption in Text-to-Image Diffusion Models for High-Definition Synthesis
Viaarxiv icon

NTIRE 2025 Challenge on Cross-Domain Few-Shot Object Detection: Methods and Results

Add code
Apr 14, 2025
Viaarxiv icon

RAGME: Retrieval Augmented Video Generation for Enhanced Motion Realism

Add code
Apr 09, 2025
Figure 1 for RAGME: Retrieval Augmented Video Generation for Enhanced Motion Realism
Figure 2 for RAGME: Retrieval Augmented Video Generation for Enhanced Motion Realism
Figure 3 for RAGME: Retrieval Augmented Video Generation for Enhanced Motion Realism
Figure 4 for RAGME: Retrieval Augmented Video Generation for Enhanced Motion Realism
Viaarxiv icon

Cross-Modal and Uncertainty-Aware Agglomeration for Open-Vocabulary 3D Scene Understanding

Add code
Mar 20, 2025
Figure 1 for Cross-Modal and Uncertainty-Aware Agglomeration for Open-Vocabulary 3D Scene Understanding
Figure 2 for Cross-Modal and Uncertainty-Aware Agglomeration for Open-Vocabulary 3D Scene Understanding
Figure 3 for Cross-Modal and Uncertainty-Aware Agglomeration for Open-Vocabulary 3D Scene Understanding
Figure 4 for Cross-Modal and Uncertainty-Aware Agglomeration for Open-Vocabulary 3D Scene Understanding
Viaarxiv icon

Multi-focal Conditioned Latent Diffusion for Person Image Synthesis

Add code
Mar 19, 2025
Figure 1 for Multi-focal Conditioned Latent Diffusion for Person Image Synthesis
Figure 2 for Multi-focal Conditioned Latent Diffusion for Person Image Synthesis
Figure 3 for Multi-focal Conditioned Latent Diffusion for Person Image Synthesis
Figure 4 for Multi-focal Conditioned Latent Diffusion for Person Image Synthesis
Viaarxiv icon

Safe Vision-Language Models via Unsafe Weights Manipulation

Add code
Mar 14, 2025
Figure 1 for Safe Vision-Language Models via Unsafe Weights Manipulation
Figure 2 for Safe Vision-Language Models via Unsafe Weights Manipulation
Figure 3 for Safe Vision-Language Models via Unsafe Weights Manipulation
Figure 4 for Safe Vision-Language Models via Unsafe Weights Manipulation
Viaarxiv icon

NullFace: Training-Free Localized Face Anonymization

Add code
Mar 11, 2025
Figure 1 for NullFace: Training-Free Localized Face Anonymization
Figure 2 for NullFace: Training-Free Localized Face Anonymization
Figure 3 for NullFace: Training-Free Localized Face Anonymization
Figure 4 for NullFace: Training-Free Localized Face Anonymization
Viaarxiv icon

CLIP is Strong Enough to Fight Back: Test-time Counterattacks towards Zero-shot Adversarial Robustness of CLIP

Add code
Mar 05, 2025
Figure 1 for CLIP is Strong Enough to Fight Back: Test-time Counterattacks towards Zero-shot Adversarial Robustness of CLIP
Figure 2 for CLIP is Strong Enough to Fight Back: Test-time Counterattacks towards Zero-shot Adversarial Robustness of CLIP
Figure 3 for CLIP is Strong Enough to Fight Back: Test-time Counterattacks towards Zero-shot Adversarial Robustness of CLIP
Figure 4 for CLIP is Strong Enough to Fight Back: Test-time Counterattacks towards Zero-shot Adversarial Robustness of CLIP
Viaarxiv icon

Fully-Geometric Cross-Attention for Point Cloud Registration

Add code
Feb 12, 2025
Figure 1 for Fully-Geometric Cross-Attention for Point Cloud Registration
Figure 2 for Fully-Geometric Cross-Attention for Point Cloud Registration
Figure 3 for Fully-Geometric Cross-Attention for Point Cloud Registration
Figure 4 for Fully-Geometric Cross-Attention for Point Cloud Registration
Viaarxiv icon

Diff9D: Diffusion-Based Domain-Generalized Category-Level 9-DoF Object Pose Estimation

Add code
Feb 04, 2025
Figure 1 for Diff9D: Diffusion-Based Domain-Generalized Category-Level 9-DoF Object Pose Estimation
Figure 2 for Diff9D: Diffusion-Based Domain-Generalized Category-Level 9-DoF Object Pose Estimation
Figure 3 for Diff9D: Diffusion-Based Domain-Generalized Category-Level 9-DoF Object Pose Estimation
Figure 4 for Diff9D: Diffusion-Based Domain-Generalized Category-Level 9-DoF Object Pose Estimation
Viaarxiv icon