Picture for Sayan Deb Sarkar

Sayan Deb Sarkar

CoPE-VideoLM: Codec Primitives For Efficient Video Language Models

Add code
Feb 13, 2026
Viaarxiv icon

CrossOver: 3D Scene Cross-Modal Alignment

Add code
Feb 20, 2025
Figure 1 for CrossOver: 3D Scene Cross-Modal Alignment
Figure 2 for CrossOver: 3D Scene Cross-Modal Alignment
Figure 3 for CrossOver: 3D Scene Cross-Modal Alignment
Figure 4 for CrossOver: 3D Scene Cross-Modal Alignment
Viaarxiv icon

HaVQA: A Dataset for Visual Question Answering and Multimodal Research in Hausa Language

Add code
May 28, 2023
Figure 1 for HaVQA: A Dataset for Visual Question Answering and Multimodal Research in Hausa Language
Figure 2 for HaVQA: A Dataset for Visual Question Answering and Multimodal Research in Hausa Language
Figure 3 for HaVQA: A Dataset for Visual Question Answering and Multimodal Research in Hausa Language
Figure 4 for HaVQA: A Dataset for Visual Question Answering and Multimodal Research in Hausa Language
Viaarxiv icon

SGAligner : 3D Scene Alignment with Scene Graphs

Add code
Apr 28, 2023
Figure 1 for SGAligner : 3D Scene Alignment with Scene Graphs
Figure 2 for SGAligner : 3D Scene Alignment with Scene Graphs
Figure 3 for SGAligner : 3D Scene Alignment with Scene Graphs
Figure 4 for SGAligner : 3D Scene Alignment with Scene Graphs
Viaarxiv icon

HO-3D_v3: Improving the Accuracy of Hand-Object Annotations of the HO-3D Dataset

Add code
Jul 02, 2021
Figure 1 for HO-3D_v3: Improving the Accuracy of Hand-Object Annotations of the HO-3D Dataset
Figure 2 for HO-3D_v3: Improving the Accuracy of Hand-Object Annotations of the HO-3D Dataset
Figure 3 for HO-3D_v3: Improving the Accuracy of Hand-Object Annotations of the HO-3D Dataset
Figure 4 for HO-3D_v3: Improving the Accuracy of Hand-Object Annotations of the HO-3D Dataset
Viaarxiv icon

HandsFormer: Keypoint Transformer for Monocular 3D Pose Estimation ofHands and Object in Interaction

Add code
Apr 29, 2021
Figure 1 for HandsFormer: Keypoint Transformer for Monocular 3D Pose Estimation ofHands and Object in Interaction
Figure 2 for HandsFormer: Keypoint Transformer for Monocular 3D Pose Estimation ofHands and Object in Interaction
Figure 3 for HandsFormer: Keypoint Transformer for Monocular 3D Pose Estimation ofHands and Object in Interaction
Figure 4 for HandsFormer: Keypoint Transformer for Monocular 3D Pose Estimation ofHands and Object in Interaction
Viaarxiv icon

Monte Carlo Scene Search for 3D Scene Understanding

Add code
Mar 30, 2021
Figure 1 for Monte Carlo Scene Search for 3D Scene Understanding
Figure 2 for Monte Carlo Scene Search for 3D Scene Understanding
Figure 3 for Monte Carlo Scene Search for 3D Scene Understanding
Figure 4 for Monte Carlo Scene Search for 3D Scene Understanding
Viaarxiv icon