Picture for Jiaxin Ge

Jiaxin Ge

Puzzled by Puzzles: When Vision-Language Models Can't Take a Hint

Add code
May 29, 2025
Viaarxiv icon

Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective Resampling

Add code
Apr 17, 2025
Viaarxiv icon

EmpathyAgent: Can Embodied Agents Conduct Empathetic Actions?

Add code
Mar 19, 2025
Viaarxiv icon

Enough Coin Flips Can Make LLMs Act Bayesian

Add code
Mar 06, 2025
Viaarxiv icon

AutoPresent: Designing Structured Visuals from Scratch

Add code
Jan 01, 2025
Figure 1 for AutoPresent: Designing Structured Visuals from Scratch
Figure 2 for AutoPresent: Designing Structured Visuals from Scratch
Figure 3 for AutoPresent: Designing Structured Visuals from Scratch
Figure 4 for AutoPresent: Designing Structured Visuals from Scratch
Viaarxiv icon

Training Task Experts through Retrieval Based Distillation

Add code
Jul 07, 2024
Figure 1 for Training Task Experts through Retrieval Based Distillation
Figure 2 for Training Task Experts through Retrieval Based Distillation
Figure 3 for Training Task Experts through Retrieval Based Distillation
Figure 4 for Training Task Experts through Retrieval Based Distillation
Viaarxiv icon

Self-Corrected Multimodal Large Language Model for End-to-End Robot Manipulation

Add code
May 27, 2024
Figure 1 for Self-Corrected Multimodal Large Language Model for End-to-End Robot Manipulation
Figure 2 for Self-Corrected Multimodal Large Language Model for End-to-End Robot Manipulation
Figure 3 for Self-Corrected Multimodal Large Language Model for End-to-End Robot Manipulation
Figure 4 for Self-Corrected Multimodal Large Language Model for End-to-End Robot Manipulation
Viaarxiv icon

Iterative Prompt Relabeling for diffusion model with RLDF

Add code
Dec 23, 2023
Viaarxiv icon

Recursive Visual Programming

Add code
Dec 04, 2023
Figure 1 for Recursive Visual Programming
Figure 2 for Recursive Visual Programming
Figure 3 for Recursive Visual Programming
Figure 4 for Recursive Visual Programming
Viaarxiv icon

From Wrong To Right: A Recursive Approach Towards Vision-Language Explanation

Add code
Nov 21, 2023
Figure 1 for From Wrong To Right: A Recursive Approach Towards Vision-Language Explanation
Figure 2 for From Wrong To Right: A Recursive Approach Towards Vision-Language Explanation
Figure 3 for From Wrong To Right: A Recursive Approach Towards Vision-Language Explanation
Figure 4 for From Wrong To Right: A Recursive Approach Towards Vision-Language Explanation
Viaarxiv icon