Picture for Jiaxin Ge

Jiaxin Ge

VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents

Add code
Jan 23, 2026
Viaarxiv icon

Vision-as-Inverse-Graphics Agent via Interleaved Multimodal Reasoning

Add code
Jan 16, 2026
Viaarxiv icon

Puzzled by Puzzles: When Vision-Language Models Can't Take a Hint

Add code
May 29, 2025
Viaarxiv icon

Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective Resampling

Add code
Apr 17, 2025
Viaarxiv icon

EmpathyAgent: Can Embodied Agents Conduct Empathetic Actions?

Add code
Mar 19, 2025
Viaarxiv icon

Enough Coin Flips Can Make LLMs Act Bayesian

Add code
Mar 06, 2025
Viaarxiv icon

AutoPresent: Designing Structured Visuals from Scratch

Add code
Jan 01, 2025
Figure 1 for AutoPresent: Designing Structured Visuals from Scratch
Figure 2 for AutoPresent: Designing Structured Visuals from Scratch
Figure 3 for AutoPresent: Designing Structured Visuals from Scratch
Figure 4 for AutoPresent: Designing Structured Visuals from Scratch
Viaarxiv icon

Training Task Experts through Retrieval Based Distillation

Add code
Jul 07, 2024
Figure 1 for Training Task Experts through Retrieval Based Distillation
Figure 2 for Training Task Experts through Retrieval Based Distillation
Figure 3 for Training Task Experts through Retrieval Based Distillation
Figure 4 for Training Task Experts through Retrieval Based Distillation
Viaarxiv icon

Self-Corrected Multimodal Large Language Model for End-to-End Robot Manipulation

Add code
May 27, 2024
Figure 1 for Self-Corrected Multimodal Large Language Model for End-to-End Robot Manipulation
Figure 2 for Self-Corrected Multimodal Large Language Model for End-to-End Robot Manipulation
Figure 3 for Self-Corrected Multimodal Large Language Model for End-to-End Robot Manipulation
Figure 4 for Self-Corrected Multimodal Large Language Model for End-to-End Robot Manipulation
Viaarxiv icon

Iterative Prompt Relabeling for diffusion model with RLDF

Add code
Dec 23, 2023
Figure 1 for Iterative Prompt Relabeling for diffusion model with RLDF
Figure 2 for Iterative Prompt Relabeling for diffusion model with RLDF
Figure 3 for Iterative Prompt Relabeling for diffusion model with RLDF
Figure 4 for Iterative Prompt Relabeling for diffusion model with RLDF
Viaarxiv icon