Picture for Pritam Sarkar

Pritam Sarkar

VCRBench: Exploring Long-form Causal Reasoning Capabilities of Large Video Language Models

Add code
May 13, 2025
Figure 1 for VCRBench: Exploring Long-form Causal Reasoning Capabilities of Large Video Language Models
Figure 2 for VCRBench: Exploring Long-form Causal Reasoning Capabilities of Large Video Language Models
Figure 3 for VCRBench: Exploring Long-form Causal Reasoning Capabilities of Large Video Language Models
Figure 4 for VCRBench: Exploring Long-form Causal Reasoning Capabilities of Large Video Language Models
Viaarxiv icon

Self-alignment of Large Video Language Models with Refined Regularized Preference Optimization

Add code
Apr 16, 2025
Figure 1 for Self-alignment of Large Video Language Models with Refined Regularized Preference Optimization
Figure 2 for Self-alignment of Large Video Language Models with Refined Regularized Preference Optimization
Figure 3 for Self-alignment of Large Video Language Models with Refined Regularized Preference Optimization
Figure 4 for Self-alignment of Large Video Language Models with Refined Regularized Preference Optimization
Viaarxiv icon

Mitigating Object Hallucination via Data Augmented Contrastive Tuning

Add code
May 28, 2024
Viaarxiv icon

Region-Disentangled Diffusion Model for High-Fidelity PPG-to-ECG Translation

Add code
Aug 25, 2023
Figure 1 for Region-Disentangled Diffusion Model for High-Fidelity PPG-to-ECG Translation
Figure 2 for Region-Disentangled Diffusion Model for High-Fidelity PPG-to-ECG Translation
Figure 3 for Region-Disentangled Diffusion Model for High-Fidelity PPG-to-ECG Translation
Figure 4 for Region-Disentangled Diffusion Model for High-Fidelity PPG-to-ECG Translation
Viaarxiv icon

Uncovering the Hidden Dynamics of Video Self-supervised Learning under Distribution Shifts

Add code
Jun 03, 2023
Viaarxiv icon

XKD: Cross-modal Knowledge Distillation with Domain Alignment for Video Representation Learning

Add code
Dec 12, 2022
Viaarxiv icon

AVCAffe: A Large Scale Audio-Visual Dataset of Cognitive Load and Affect for Remote Work

Add code
May 13, 2022
Figure 1 for AVCAffe: A Large Scale Audio-Visual Dataset of Cognitive Load and Affect for Remote Work
Figure 2 for AVCAffe: A Large Scale Audio-Visual Dataset of Cognitive Load and Affect for Remote Work
Figure 3 for AVCAffe: A Large Scale Audio-Visual Dataset of Cognitive Load and Affect for Remote Work
Figure 4 for AVCAffe: A Large Scale Audio-Visual Dataset of Cognitive Load and Affect for Remote Work
Viaarxiv icon

Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Temporal Synchronicity

Add code
Nov 14, 2021
Figure 1 for Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Temporal Synchronicity
Figure 2 for Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Temporal Synchronicity
Figure 3 for Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Temporal Synchronicity
Figure 4 for Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Temporal Synchronicity
Viaarxiv icon

CardioGAN: Attentive Generative Adversarial Network with Dual Discriminators for Synthesis of ECG from PPG

Add code
Sep 30, 2020
Figure 1 for CardioGAN: Attentive Generative Adversarial Network with Dual Discriminators for Synthesis of ECG from PPG
Figure 2 for CardioGAN: Attentive Generative Adversarial Network with Dual Discriminators for Synthesis of ECG from PPG
Figure 3 for CardioGAN: Attentive Generative Adversarial Network with Dual Discriminators for Synthesis of ECG from PPG
Figure 4 for CardioGAN: Attentive Generative Adversarial Network with Dual Discriminators for Synthesis of ECG from PPG
Viaarxiv icon

Self-supervised ECG Representation Learning for Emotion Recognition

Add code
Feb 04, 2020
Figure 1 for Self-supervised ECG Representation Learning for Emotion Recognition
Figure 2 for Self-supervised ECG Representation Learning for Emotion Recognition
Figure 3 for Self-supervised ECG Representation Learning for Emotion Recognition
Figure 4 for Self-supervised ECG Representation Learning for Emotion Recognition
Viaarxiv icon