Picture for Anjan Dutta

Anjan Dutta

SVGCraft: Beyond Single Object Text-to-SVG Synthesis with Comprehensive Canvas Layout

Add code
Mar 30, 2024
Figure 1 for SVGCraft: Beyond Single Object Text-to-SVG Synthesis with Comprehensive Canvas Layout
Figure 2 for SVGCraft: Beyond Single Object Text-to-SVG Synthesis with Comprehensive Canvas Layout
Figure 3 for SVGCraft: Beyond Single Object Text-to-SVG Synthesis with Comprehensive Canvas Layout
Figure 4 for SVGCraft: Beyond Single Object Text-to-SVG Synthesis with Comprehensive Canvas Layout
Viaarxiv icon

DeNetDM: Debiasing by Network Depth Modulation

Add code
Mar 28, 2024
Figure 1 for DeNetDM: Debiasing by Network Depth Modulation
Figure 2 for DeNetDM: Debiasing by Network Depth Modulation
Figure 3 for DeNetDM: Debiasing by Network Depth Modulation
Viaarxiv icon

OmniCount: Multi-label Object Counting with Semantic-Geometric Priors

Add code
Mar 14, 2024
Figure 1 for OmniCount: Multi-label Object Counting with Semantic-Geometric Priors
Figure 2 for OmniCount: Multi-label Object Counting with Semantic-Geometric Priors
Figure 3 for OmniCount: Multi-label Object Counting with Semantic-Geometric Priors
Figure 4 for OmniCount: Multi-label Object Counting with Semantic-Geometric Priors
Viaarxiv icon

Learning Conditional Invariances through Non-Commutativity

Add code
Feb 18, 2024
Viaarxiv icon

CLIPDrawX: Primitive-based Explanations for Text Guided Sketch Synthesis

Add code
Dec 04, 2023
Viaarxiv icon

Transitivity Recovering Decompositions: Interpretable and Robust Fine-Grained Relationships

Add code
Oct 24, 2023
Viaarxiv icon

Sarcasm in Sight and Sound: Benchmarking and Expansion to Improve Multimodal Sarcasm Detection

Add code
Sep 29, 2023
Figure 1 for Sarcasm in Sight and Sound: Benchmarking and Expansion to Improve Multimodal Sarcasm Detection
Figure 2 for Sarcasm in Sight and Sound: Benchmarking and Expansion to Improve Multimodal Sarcasm Detection
Figure 3 for Sarcasm in Sight and Sound: Benchmarking and Expansion to Improve Multimodal Sarcasm Detection
Figure 4 for Sarcasm in Sight and Sound: Benchmarking and Expansion to Improve Multimodal Sarcasm Detection
Viaarxiv icon

Actor-agnostic Multi-label Action Recognition with Multi-modal Query

Add code
Aug 08, 2023
Figure 1 for Actor-agnostic Multi-label Action Recognition with Multi-modal Query
Figure 2 for Actor-agnostic Multi-label Action Recognition with Multi-modal Query
Figure 3 for Actor-agnostic Multi-label Action Recognition with Multi-modal Query
Figure 4 for Actor-agnostic Multi-label Action Recognition with Multi-modal Query
Viaarxiv icon

Data-Free Sketch-Based Image Retrieval

Add code
Mar 14, 2023
Figure 1 for Data-Free Sketch-Based Image Retrieval
Figure 2 for Data-Free Sketch-Based Image Retrieval
Figure 3 for Data-Free Sketch-Based Image Retrieval
Figure 4 for Data-Free Sketch-Based Image Retrieval
Viaarxiv icon

Cross-Modal Fusion Distillation for Fine-Grained Sketch-Based Image Retrieval

Add code
Oct 19, 2022
Figure 1 for Cross-Modal Fusion Distillation for Fine-Grained Sketch-Based Image Retrieval
Figure 2 for Cross-Modal Fusion Distillation for Fine-Grained Sketch-Based Image Retrieval
Figure 3 for Cross-Modal Fusion Distillation for Fine-Grained Sketch-Based Image Retrieval
Figure 4 for Cross-Modal Fusion Distillation for Fine-Grained Sketch-Based Image Retrieval
Viaarxiv icon