Picture for Farid Boussaid

Farid Boussaid

LatentMove: Towards Complex Human Movement Video Generation

Add code
May 28, 2025
Viaarxiv icon

Watch and Listen: Understanding Audio-Visual-Speech Moments with Multimodal LLM

Add code
May 23, 2025
Viaarxiv icon

Dynamic Neural Surfaces for Elastic 4D Shape Representation and Analysis

Add code
Mar 05, 2025
Viaarxiv icon

UIFormer: A Unified Transformer-based Framework for Incremental Few-Shot Object Detection and Instance Segmentation

Add code
Nov 13, 2024
Figure 1 for UIFormer: A Unified Transformer-based Framework for Incremental Few-Shot Object Detection and Instance Segmentation
Figure 2 for UIFormer: A Unified Transformer-based Framework for Incremental Few-Shot Object Detection and Instance Segmentation
Figure 3 for UIFormer: A Unified Transformer-based Framework for Incremental Few-Shot Object Detection and Instance Segmentation
Figure 4 for UIFormer: A Unified Transformer-based Framework for Incremental Few-Shot Object Detection and Instance Segmentation
Viaarxiv icon

A Riemannian Approach for Spatiotemporal Analysis and Generation of 4D Tree-shaped Structures

Add code
Aug 22, 2024
Figure 1 for A Riemannian Approach for Spatiotemporal Analysis and Generation of 4D Tree-shaped Structures
Figure 2 for A Riemannian Approach for Spatiotemporal Analysis and Generation of 4D Tree-shaped Structures
Figure 3 for A Riemannian Approach for Spatiotemporal Analysis and Generation of 4D Tree-shaped Structures
Figure 4 for A Riemannian Approach for Spatiotemporal Analysis and Generation of 4D Tree-shaped Structures
Viaarxiv icon

Faster Image2Video Generation: A Closer Look at CLIP Image Embedding's Impact on Spatio-Temporal Cross-Attentions

Add code
Jul 27, 2024
Figure 1 for Faster Image2Video Generation: A Closer Look at CLIP Image Embedding's Impact on Spatio-Temporal Cross-Attentions
Figure 2 for Faster Image2Video Generation: A Closer Look at CLIP Image Embedding's Impact on Spatio-Temporal Cross-Attentions
Figure 3 for Faster Image2Video Generation: A Closer Look at CLIP Image Embedding's Impact on Spatio-Temporal Cross-Attentions
Figure 4 for Faster Image2Video Generation: A Closer Look at CLIP Image Embedding's Impact on Spatio-Temporal Cross-Attentions
Viaarxiv icon

Auxiliary Tasks Enhanced Dual-affinity Learning for Weakly Supervised Semantic Segmentation

Add code
Mar 02, 2024
Figure 1 for Auxiliary Tasks Enhanced Dual-affinity Learning for Weakly Supervised Semantic Segmentation
Figure 2 for Auxiliary Tasks Enhanced Dual-affinity Learning for Weakly Supervised Semantic Segmentation
Figure 3 for Auxiliary Tasks Enhanced Dual-affinity Learning for Weakly Supervised Semantic Segmentation
Figure 4 for Auxiliary Tasks Enhanced Dual-affinity Learning for Weakly Supervised Semantic Segmentation
Viaarxiv icon

Box It to Bind It: Unified Layout Control and Attribute Binding in T2I Diffusion Models

Add code
Feb 27, 2024
Figure 1 for Box It to Bind It: Unified Layout Control and Attribute Binding in T2I Diffusion Models
Figure 2 for Box It to Bind It: Unified Layout Control and Attribute Binding in T2I Diffusion Models
Figure 3 for Box It to Bind It: Unified Layout Control and Attribute Binding in T2I Diffusion Models
Figure 4 for Box It to Bind It: Unified Layout Control and Attribute Binding in T2I Diffusion Models
Viaarxiv icon

Transformers in Small Object Detection: A Benchmark and Survey of State-of-the-Art

Add code
Sep 10, 2023
Viaarxiv icon

MCTformer+: Multi-Class Token Transformer for Weakly Supervised Semantic Segmentation

Add code
Aug 06, 2023
Viaarxiv icon