Picture for Anoop Cherian

Anoop Cherian

Evaluating Large Vision-and-Language Models on Children's Mathematical Olympiads

Add code
Jun 22, 2024
Viaarxiv icon

TI2V-Zero: Zero-Shot Image Conditioning for Text-to-Video Diffusion Models

Add code
Apr 25, 2024
Figure 1 for TI2V-Zero: Zero-Shot Image Conditioning for Text-to-Video Diffusion Models
Figure 2 for TI2V-Zero: Zero-Shot Image Conditioning for Text-to-Video Diffusion Models
Figure 3 for TI2V-Zero: Zero-Shot Image Conditioning for Text-to-Video Diffusion Models
Figure 4 for TI2V-Zero: Zero-Shot Image Conditioning for Text-to-Video Diffusion Models
Viaarxiv icon

Multi-level Reasoning for Robotic Assembly: From Sequence Inference to Contact Selection

Add code
Dec 17, 2023
Viaarxiv icon

Steered Diffusion: A Generalized Framework for Plug-and-Play Conditional Image Synthesis

Add code
Sep 30, 2023
Figure 1 for Steered Diffusion: A Generalized Framework for Plug-and-Play Conditional Image Synthesis
Figure 2 for Steered Diffusion: A Generalized Framework for Plug-and-Play Conditional Image Synthesis
Figure 3 for Steered Diffusion: A Generalized Framework for Plug-and-Play Conditional Image Synthesis
Figure 4 for Steered Diffusion: A Generalized Framework for Plug-and-Play Conditional Image Synthesis
Viaarxiv icon

Pixel-Grounded Prototypical Part Networks

Add code
Sep 25, 2023
Figure 1 for Pixel-Grounded Prototypical Part Networks
Figure 2 for Pixel-Grounded Prototypical Part Networks
Figure 3 for Pixel-Grounded Prototypical Part Networks
Figure 4 for Pixel-Grounded Prototypical Part Networks
Viaarxiv icon

Active Sparse Conversations for Improved Audio-Visual Embodied Navigation

Add code
Jun 06, 2023
Figure 1 for Active Sparse Conversations for Improved Audio-Visual Embodied Navigation
Figure 2 for Active Sparse Conversations for Improved Audio-Visual Embodied Navigation
Figure 3 for Active Sparse Conversations for Improved Audio-Visual Embodied Navigation
Figure 4 for Active Sparse Conversations for Improved Audio-Visual Embodied Navigation
Viaarxiv icon

HaLP: Hallucinating Latent Positives for Skeleton-based Self-Supervised Learning of Actions

Add code
Apr 01, 2023
Viaarxiv icon

Aligning Step-by-Step Instructional Diagrams to Video Demonstrations

Add code
Mar 27, 2023
Viaarxiv icon

Are Deep Neural Networks SMARTer than Second Graders?

Add code
Jan 05, 2023
Figure 1 for Are Deep Neural Networks SMARTer than Second Graders?
Figure 2 for Are Deep Neural Networks SMARTer than Second Graders?
Figure 3 for Are Deep Neural Networks SMARTer than Second Graders?
Figure 4 for Are Deep Neural Networks SMARTer than Second Graders?
Viaarxiv icon

Learning Audio-Visual Dynamics Using Scene Graphs for Audio Source Separation

Add code
Oct 29, 2022
Viaarxiv icon