Picture for Federico Tombari

Federico Tombari

Text-Conditioned Resampler For Long Form Video Understanding

Add code
Dec 19, 2023
Figure 1 for Text-Conditioned Resampler For Long Form Video Understanding
Figure 2 for Text-Conditioned Resampler For Long Form Video Understanding
Figure 3 for Text-Conditioned Resampler For Long Form Video Understanding
Figure 4 for Text-Conditioned Resampler For Long Form Video Understanding
Viaarxiv icon

LIME: Localized Image Editing via Attention Regularization in Diffusion Models

Add code
Dec 14, 2023
Figure 1 for LIME: Localized Image Editing via Attention Regularization in Diffusion Models
Figure 2 for LIME: Localized Image Editing via Attention Regularization in Diffusion Models
Figure 3 for LIME: Localized Image Editing via Attention Regularization in Diffusion Models
Figure 4 for LIME: Localized Image Editing via Attention Regularization in Diffusion Models
Viaarxiv icon

CONFORM: Contrast is All You Need For High-Fidelity Text-to-Image Diffusion Models

Add code
Dec 11, 2023
Figure 1 for CONFORM: Contrast is All You Need For High-Fidelity Text-to-Image Diffusion Models
Figure 2 for CONFORM: Contrast is All You Need For High-Fidelity Text-to-Image Diffusion Models
Figure 3 for CONFORM: Contrast is All You Need For High-Fidelity Text-to-Image Diffusion Models
Figure 4 for CONFORM: Contrast is All You Need For High-Fidelity Text-to-Image Diffusion Models
Viaarxiv icon

Re-Nerfing: Enforcing Geometric Constraints on Neural Radiance Fields through Novel Views Synthesis

Add code
Dec 04, 2023
Figure 1 for Re-Nerfing: Enforcing Geometric Constraints on Neural Radiance Fields through Novel Views Synthesis
Figure 2 for Re-Nerfing: Enforcing Geometric Constraints on Neural Radiance Fields through Novel Views Synthesis
Figure 3 for Re-Nerfing: Enforcing Geometric Constraints on Neural Radiance Fields through Novel Views Synthesis
Figure 4 for Re-Nerfing: Enforcing Geometric Constraints on Neural Radiance Fields through Novel Views Synthesis
Viaarxiv icon

DNS SLAM: Dense Neural Semantic-Informed SLAM

Add code
Nov 30, 2023
Viaarxiv icon

SemiVL: Semi-Supervised Semantic Segmentation with Vision-Language Guidance

Add code
Nov 27, 2023
Viaarxiv icon

HACD: Hand-Aware Conditional Diffusion for Monocular Hand-Held Object Reconstruction

Add code
Nov 23, 2023
Figure 1 for HACD: Hand-Aware Conditional Diffusion for Monocular Hand-Held Object Reconstruction
Figure 2 for HACD: Hand-Aware Conditional Diffusion for Monocular Hand-Held Object Reconstruction
Figure 3 for HACD: Hand-Aware Conditional Diffusion for Monocular Hand-Held Object Reconstruction
Figure 4 for HACD: Hand-Aware Conditional Diffusion for Monocular Hand-Held Object Reconstruction
Viaarxiv icon

3D Compression Using Neural Fields

Add code
Nov 21, 2023
Viaarxiv icon

SecondPose: SE(3)-Consistent Dual-Stream Feature Fusion for Category-Level Pose Estimation

Add code
Nov 18, 2023
Figure 1 for SecondPose: SE(3)-Consistent Dual-Stream Feature Fusion for Category-Level Pose Estimation
Figure 2 for SecondPose: SE(3)-Consistent Dual-Stream Feature Fusion for Category-Level Pose Estimation
Figure 3 for SecondPose: SE(3)-Consistent Dual-Stream Feature Fusion for Category-Level Pose Estimation
Figure 4 for SecondPose: SE(3)-Consistent Dual-Stream Feature Fusion for Category-Level Pose Estimation
Viaarxiv icon

SILC: Improving Vision Language Pretraining with Self-Distillation

Add code
Oct 20, 2023
Figure 1 for SILC: Improving Vision Language Pretraining with Self-Distillation
Figure 2 for SILC: Improving Vision Language Pretraining with Self-Distillation
Figure 3 for SILC: Improving Vision Language Pretraining with Self-Distillation
Figure 4 for SILC: Improving Vision Language Pretraining with Self-Distillation
Viaarxiv icon