Picture for Sachit Menon

Sachit Menon

CAViAR: Critic-Augmented Video Agentic Reasoning

Add code
Sep 09, 2025
Viaarxiv icon

MINERVA: Evaluating Complex Video Reasoning

Add code
May 01, 2025
Viaarxiv icon

Whiteboard-of-Thought: Thinking Step-by-Step Across Modalities

Add code
Jun 20, 2024
Figure 1 for Whiteboard-of-Thought: Thinking Step-by-Step Across Modalities
Figure 2 for Whiteboard-of-Thought: Thinking Step-by-Step Across Modalities
Figure 3 for Whiteboard-of-Thought: Thinking Step-by-Step Across Modalities
Figure 4 for Whiteboard-of-Thought: Thinking Step-by-Step Across Modalities
Viaarxiv icon

Generating Illustrated Instructions

Add code
Dec 07, 2023
Figure 1 for Generating Illustrated Instructions
Figure 2 for Generating Illustrated Instructions
Figure 3 for Generating Illustrated Instructions
Figure 4 for Generating Illustrated Instructions
Viaarxiv icon

ViperGPT: Visual Inference via Python Execution for Reasoning

Add code
Mar 14, 2023
Figure 1 for ViperGPT: Visual Inference via Python Execution for Reasoning
Figure 2 for ViperGPT: Visual Inference via Python Execution for Reasoning
Figure 3 for ViperGPT: Visual Inference via Python Execution for Reasoning
Figure 4 for ViperGPT: Visual Inference via Python Execution for Reasoning
Viaarxiv icon

Affective Faces for Goal-Driven Dyadic Communication

Add code
Jan 26, 2023
Figure 1 for Affective Faces for Goal-Driven Dyadic Communication
Figure 2 for Affective Faces for Goal-Driven Dyadic Communication
Figure 3 for Affective Faces for Goal-Driven Dyadic Communication
Figure 4 for Affective Faces for Goal-Driven Dyadic Communication
Viaarxiv icon

Doubly Right Object Recognition: A Why Prompt for Visual Rationales

Add code
Dec 12, 2022
Viaarxiv icon

Task Bias in Vision-Language Models

Add code
Dec 08, 2022
Figure 1 for Task Bias in Vision-Language Models
Figure 2 for Task Bias in Vision-Language Models
Figure 3 for Task Bias in Vision-Language Models
Figure 4 for Task Bias in Vision-Language Models
Viaarxiv icon

Visual Classification via Description from Large Language Models

Add code
Oct 13, 2022
Figure 1 for Visual Classification via Description from Large Language Models
Figure 2 for Visual Classification via Description from Large Language Models
Figure 3 for Visual Classification via Description from Large Language Models
Figure 4 for Visual Classification via Description from Large Language Models
Viaarxiv icon

Forget-me-not! Contrastive Critics for Mitigating Posterior Collapse

Add code
Jul 19, 2022
Figure 1 for Forget-me-not! Contrastive Critics for Mitigating Posterior Collapse
Figure 2 for Forget-me-not! Contrastive Critics for Mitigating Posterior Collapse
Figure 3 for Forget-me-not! Contrastive Critics for Mitigating Posterior Collapse
Figure 4 for Forget-me-not! Contrastive Critics for Mitigating Posterior Collapse
Viaarxiv icon