Picture for Didier Stricker

Didier Stricker

3D Spatial Understanding in MLLMs: Disambiguation and Evaluation

Add code
Dec 09, 2024
Figure 1 for 3D Spatial Understanding in MLLMs: Disambiguation and Evaluation
Figure 2 for 3D Spatial Understanding in MLLMs: Disambiguation and Evaluation
Figure 3 for 3D Spatial Understanding in MLLMs: Disambiguation and Evaluation
Figure 4 for 3D Spatial Understanding in MLLMs: Disambiguation and Evaluation
Viaarxiv icon

Beyond Boxes: Mask-Guided Spatio-Temporal Feature Aggregation for Video Object Detection

Add code
Dec 06, 2024
Figure 1 for Beyond Boxes: Mask-Guided Spatio-Temporal Feature Aggregation for Video Object Detection
Figure 2 for Beyond Boxes: Mask-Guided Spatio-Temporal Feature Aggregation for Video Object Detection
Viaarxiv icon

Uni-SLAM: Uncertainty-Aware Neural Implicit SLAM for Real-Time Dense Indoor Scene Reconstruction

Add code
Nov 29, 2024
Figure 1 for Uni-SLAM: Uncertainty-Aware Neural Implicit SLAM for Real-Time Dense Indoor Scene Reconstruction
Figure 2 for Uni-SLAM: Uncertainty-Aware Neural Implicit SLAM for Real-Time Dense Indoor Scene Reconstruction
Figure 3 for Uni-SLAM: Uncertainty-Aware Neural Implicit SLAM for Real-Time Dense Indoor Scene Reconstruction
Figure 4 for Uni-SLAM: Uncertainty-Aware Neural Implicit SLAM for Real-Time Dense Indoor Scene Reconstruction
Viaarxiv icon

Modality-Incremental Learning with Disjoint Relevance Mapping Networks for Image-based Semantic Segmentation

Add code
Nov 26, 2024
Figure 1 for Modality-Incremental Learning with Disjoint Relevance Mapping Networks for Image-based Semantic Segmentation
Figure 2 for Modality-Incremental Learning with Disjoint Relevance Mapping Networks for Image-based Semantic Segmentation
Figure 3 for Modality-Incremental Learning with Disjoint Relevance Mapping Networks for Image-based Semantic Segmentation
Figure 4 for Modality-Incremental Learning with Disjoint Relevance Mapping Networks for Image-based Semantic Segmentation
Viaarxiv icon

MARVEL-40M+: Multi-Level Visual Elaboration for High-Fidelity Text-to-3D Content Creation

Add code
Nov 26, 2024
Viaarxiv icon

AnonyNoise: Anonymizing Event Data with Smart Noise to Outsmart Re-Identification and Preserve Privacy

Add code
Nov 25, 2024
Viaarxiv icon

SurgeoNet: Realtime 3D Pose Estimation of Articulated Surgical Instruments from Stereo Images using a Synthetically-trained Network

Add code
Oct 02, 2024
Figure 1 for SurgeoNet: Realtime 3D Pose Estimation of Articulated Surgical Instruments from Stereo Images using a Synthetically-trained Network
Figure 2 for SurgeoNet: Realtime 3D Pose Estimation of Articulated Surgical Instruments from Stereo Images using a Synthetically-trained Network
Figure 3 for SurgeoNet: Realtime 3D Pose Estimation of Articulated Surgical Instruments from Stereo Images using a Synthetically-trained Network
Figure 4 for SurgeoNet: Realtime 3D Pose Estimation of Articulated Surgical Instruments from Stereo Images using a Synthetically-trained Network
Viaarxiv icon

Classroom-Inspired Multi-Mentor Distillation with Adaptive Learning Strategies

Add code
Sep 30, 2024
Viaarxiv icon

Continual Human Pose Estimation for Incremental Integration of Keypoints and Pose Variations

Add code
Sep 30, 2024
Viaarxiv icon

Text2CAD: Generating Sequential CAD Models from Beginner-to-Expert Level Text Prompts

Add code
Sep 25, 2024
Figure 1 for Text2CAD: Generating Sequential CAD Models from Beginner-to-Expert Level Text Prompts
Figure 2 for Text2CAD: Generating Sequential CAD Models from Beginner-to-Expert Level Text Prompts
Figure 3 for Text2CAD: Generating Sequential CAD Models from Beginner-to-Expert Level Text Prompts
Figure 4 for Text2CAD: Generating Sequential CAD Models from Beginner-to-Expert Level Text Prompts
Viaarxiv icon