Picture for Sauradip Nag

Sauradip Nag

RespoDiff: Dual-Module Bottleneck Transformation for Responsible & Faithful T2I Generation

Add code
Sep 18, 2025
Viaarxiv icon

Cora: Correspondence-aware image editing using few step diffusion

Add code
May 29, 2025
Viaarxiv icon

In-2-4D: Inbetweening from Two Single-View Images to 4D Generation

Add code
Apr 11, 2025
Figure 1 for In-2-4D: Inbetweening from Two Single-View Images to 4D Generation
Figure 2 for In-2-4D: Inbetweening from Two Single-View Images to 4D Generation
Figure 3 for In-2-4D: Inbetweening from Two Single-View Images to 4D Generation
Figure 4 for In-2-4D: Inbetweening from Two Single-View Images to 4D Generation
Viaarxiv icon

Articulate That Object Part (ATOP): 3D Part Articulation from Text and Motion Personalization

Add code
Feb 11, 2025
Viaarxiv icon

SMITE: Segment Me In TimE

Add code
Oct 24, 2024
Viaarxiv icon

OmniCount: Multi-label Object Counting with Semantic-Geometric Priors

Add code
Mar 14, 2024
Viaarxiv icon

Adaptive-Labeling for Enhancing Remote Sensing Cloud Understanding

Add code
Nov 09, 2023
Viaarxiv icon

DiffSED: Sound Event Detection with Denoising Diffusion

Add code
Aug 16, 2023
Viaarxiv icon

Actor-agnostic Multi-label Action Recognition with Multi-modal Query

Add code
Aug 08, 2023
Figure 1 for Actor-agnostic Multi-label Action Recognition with Multi-modal Query
Figure 2 for Actor-agnostic Multi-label Action Recognition with Multi-modal Query
Figure 3 for Actor-agnostic Multi-label Action Recognition with Multi-modal Query
Figure 4 for Actor-agnostic Multi-label Action Recognition with Multi-modal Query
Viaarxiv icon

DiffTAD: Temporal Action Detection with Proposal Denoising Diffusion

Add code
Mar 27, 2023
Viaarxiv icon