Picture for Jieyu Zhang

Jieyu Zhang

One Trajectory, One Token: Grounded Video Tokenization via Panoptic Sub-object Trajectory

Add code
May 29, 2025
Viaarxiv icon

H2R: A Human-to-Robot Data Augmentation for Robot Pre-training from Videos

Add code
May 17, 2025
Viaarxiv icon

Which Agent Causes Task Failures and When? On Automated Failure Attribution of LLM Multi-Agent Systems

Add code
Apr 30, 2025
Viaarxiv icon

Nemotron-Research-Tool-N1: Tool-Using Language Models with Reinforced Reasoning

Add code
Apr 25, 2025
Viaarxiv icon

Discovering Knowledge Deficiencies of Language Models on Massive Knowledge Base

Add code
Mar 30, 2025
Viaarxiv icon

Generate Any Scene: Evaluating and Improving Text-to-Vision Generation with Scene Graph Programming

Add code
Dec 11, 2024
Figure 1 for Generate Any Scene: Evaluating and Improving Text-to-Vision Generation with Scene Graph Programming
Figure 2 for Generate Any Scene: Evaluating and Improving Text-to-Vision Generation with Scene Graph Programming
Figure 3 for Generate Any Scene: Evaluating and Improving Text-to-Vision Generation with Scene Graph Programming
Figure 4 for Generate Any Scene: Evaluating and Improving Text-to-Vision Generation with Scene Graph Programming
Viaarxiv icon

Template Matters: Understanding the Role of Instruction Templates in Multimodal Language Model Evaluation and Training

Add code
Dec 11, 2024
Viaarxiv icon

TACO: Learning Multi-modal Action Models with Synthetic Chains-of-Thought-and-Action

Add code
Dec 10, 2024
Figure 1 for TACO: Learning Multi-modal Action Models with Synthetic Chains-of-Thought-and-Action
Figure 2 for TACO: Learning Multi-modal Action Models with Synthetic Chains-of-Thought-and-Action
Figure 3 for TACO: Learning Multi-modal Action Models with Synthetic Chains-of-Thought-and-Action
Figure 4 for TACO: Learning Multi-modal Action Models with Synthetic Chains-of-Thought-and-Action
Viaarxiv icon

ProVision: Programmatically Scaling Vision-centric Instruction Data for Multimodal Language Models

Add code
Dec 09, 2024
Viaarxiv icon

EcoAct: Economic Agent Determines When to Register What Action

Add code
Nov 03, 2024
Figure 1 for EcoAct: Economic Agent Determines When to Register What Action
Figure 2 for EcoAct: Economic Agent Determines When to Register What Action
Figure 3 for EcoAct: Economic Agent Determines When to Register What Action
Figure 4 for EcoAct: Economic Agent Determines When to Register What Action
Viaarxiv icon