Picture for Matthieu Cord

Matthieu Cord

SSDD: Single-Step Diffusion Decoder for Efficient Image Tokenization

Add code
Oct 06, 2025
Viaarxiv icon

RAP: 3D Rasterization Augmented End-to-End Planning

Add code
Oct 05, 2025
Viaarxiv icon

IPA: An Information-Preserving Input Projection Framework for Efficient Foundation Model Adaptation

Add code
Sep 04, 2025
Figure 1 for IPA: An Information-Preserving Input Projection Framework for Efficient Foundation Model Adaptation
Figure 2 for IPA: An Information-Preserving Input Projection Framework for Efficient Foundation Model Adaptation
Figure 3 for IPA: An Information-Preserving Input Projection Framework for Efficient Foundation Model Adaptation
Figure 4 for IPA: An Information-Preserving Input Projection Framework for Efficient Foundation Model Adaptation
Viaarxiv icon

Scaling Laws for Native Multimodal Models Scaling Laws for Native Multimodal Models

Add code
Apr 10, 2025
Figure 1 for Scaling Laws for Native Multimodal Models Scaling Laws for Native Multimodal Models
Figure 2 for Scaling Laws for Native Multimodal Models Scaling Laws for Native Multimodal Models
Figure 3 for Scaling Laws for Native Multimodal Models Scaling Laws for Native Multimodal Models
Figure 4 for Scaling Laws for Native Multimodal Models Scaling Laws for Native Multimodal Models
Viaarxiv icon

VaViM and VaVAM: Autonomous Driving through Video Generative Modeling

Add code
Feb 21, 2025
Viaarxiv icon

GaussRender: Learning 3D Occupancy with Gaussian Rendering

Add code
Feb 07, 2025
Figure 1 for GaussRender: Learning 3D Occupancy with Gaussian Rendering
Figure 2 for GaussRender: Learning 3D Occupancy with Gaussian Rendering
Figure 3 for GaussRender: Learning 3D Occupancy with Gaussian Rendering
Figure 4 for GaussRender: Learning 3D Occupancy with Gaussian Rendering
Viaarxiv icon

Towards Generalizable Trajectory Prediction Using Dual-Level Representation Learning And Adaptive Prompting

Add code
Jan 08, 2025
Figure 1 for Towards Generalizable Trajectory Prediction Using Dual-Level Representation Learning And Adaptive Prompting
Figure 2 for Towards Generalizable Trajectory Prediction Using Dual-Level Representation Learning And Adaptive Prompting
Figure 3 for Towards Generalizable Trajectory Prediction Using Dual-Level Representation Learning And Adaptive Prompting
Figure 4 for Towards Generalizable Trajectory Prediction Using Dual-Level Representation Learning And Adaptive Prompting
Viaarxiv icon

Analyzing Fine-tuning Representation Shift for Multimodal LLMs Steering alignment

Add code
Jan 06, 2025
Viaarxiv icon

PPT: Pre-Training with Pseudo-Labeled Trajectories for Motion Forecasting

Add code
Dec 09, 2024
Viaarxiv icon

GIFT: A Framework for Global Interpretable Faithful Textual Explanations of Vision Classifiers

Add code
Nov 23, 2024
Figure 1 for GIFT: A Framework for Global Interpretable Faithful Textual Explanations of Vision Classifiers
Figure 2 for GIFT: A Framework for Global Interpretable Faithful Textual Explanations of Vision Classifiers
Figure 3 for GIFT: A Framework for Global Interpretable Faithful Textual Explanations of Vision Classifiers
Figure 4 for GIFT: A Framework for Global Interpretable Faithful Textual Explanations of Vision Classifiers
Viaarxiv icon