Picture for Zongyu Lin

Zongyu Lin

Learning to Rank Chain-of-Thought: An Energy-Based Approach with Outcome Supervision

Add code
May 21, 2025
Viaarxiv icon

FLARE: Robot Learning with Implicit World Modeling

Add code
May 21, 2025
Viaarxiv icon

DreamGen: Unlocking Generalization in Robot Learning through Neural Trajectories

Add code
May 19, 2025
Viaarxiv icon

GR00T N1: An Open Foundation Model for Generalist Humanoid Robots

Add code
Mar 18, 2025
Viaarxiv icon

QLASS: Boosting Language Agent Inference via Q-Guided Stepwise Search

Add code
Feb 04, 2025
Viaarxiv icon

STIV: Scalable Text and Image Conditioned Video Generation

Add code
Dec 10, 2024
Viaarxiv icon

MM-Ego: Towards Building Egocentric Multimodal LLMs

Add code
Oct 09, 2024
Figure 1 for MM-Ego: Towards Building Egocentric Multimodal LLMs
Figure 2 for MM-Ego: Towards Building Egocentric Multimodal LLMs
Figure 3 for MM-Ego: Towards Building Egocentric Multimodal LLMs
Figure 4 for MM-Ego: Towards Building Egocentric Multimodal LLMs
Viaarxiv icon

VDebugger: Harnessing Execution Feedback for Debugging Visual Programs

Add code
Jun 19, 2024
Figure 1 for VDebugger: Harnessing Execution Feedback for Debugging Visual Programs
Figure 2 for VDebugger: Harnessing Execution Feedback for Debugging Visual Programs
Figure 3 for VDebugger: Harnessing Execution Feedback for Debugging Visual Programs
Figure 4 for VDebugger: Harnessing Execution Feedback for Debugging Visual Programs
Viaarxiv icon

SparseCL: Sparse Contrastive Learning for Contradiction Retrieval

Add code
Jun 15, 2024
Figure 1 for SparseCL: Sparse Contrastive Learning for Contradiction Retrieval
Figure 2 for SparseCL: Sparse Contrastive Learning for Contradiction Retrieval
Figure 3 for SparseCL: Sparse Contrastive Learning for Contradiction Retrieval
Figure 4 for SparseCL: Sparse Contrastive Learning for Contradiction Retrieval
Viaarxiv icon

VideoPhy: Evaluating Physical Commonsense for Video Generation

Add code
Jun 05, 2024
Figure 1 for VideoPhy: Evaluating Physical Commonsense for Video Generation
Figure 2 for VideoPhy: Evaluating Physical Commonsense for Video Generation
Figure 3 for VideoPhy: Evaluating Physical Commonsense for Video Generation
Figure 4 for VideoPhy: Evaluating Physical Commonsense for Video Generation
Viaarxiv icon