Picture for Zhiyuan Hu

Zhiyuan Hu

BehaviorSFT: Behavioral Token Conditioning for Clinical Agents Across the Proactivity Spectrum

Add code
May 27, 2025
Viaarxiv icon

DeepInverse: A Python package for solving imaging inverse problems with deep learning

Add code
May 26, 2025
Viaarxiv icon

Temporal-Oriented Recipe for Transferring Large Vision-Language Model to Video Understanding

Add code
May 19, 2025
Viaarxiv icon

Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models

Add code
May 15, 2025
Viaarxiv icon

Guiding VLM Agents with Process Rewards at Inference Time for GUI Navigation

Add code
Apr 22, 2025
Viaarxiv icon

JudgeLRM: Large Reasoning Models as a Judge

Add code
Mar 31, 2025
Viaarxiv icon

Ride-Sourcing Vehicle Rebalancing with Service Accessibility Guarantees via Constrained Mean-Field Reinforcement Learning

Add code
Mar 31, 2025
Viaarxiv icon

MMVU: Measuring Expert-Level Multi-Discipline Video Understanding

Add code
Jan 21, 2025
Figure 1 for MMVU: Measuring Expert-Level Multi-Discipline Video Understanding
Figure 2 for MMVU: Measuring Expert-Level Multi-Discipline Video Understanding
Figure 3 for MMVU: Measuring Expert-Level Multi-Discipline Video Understanding
Figure 4 for MMVU: Measuring Expert-Level Multi-Discipline Video Understanding
Viaarxiv icon

Multi-Scale Contrastive Learning for Video Temporal Grounding

Add code
Dec 10, 2024
Figure 1 for Multi-Scale Contrastive Learning for Video Temporal Grounding
Figure 2 for Multi-Scale Contrastive Learning for Video Temporal Grounding
Figure 3 for Multi-Scale Contrastive Learning for Video Temporal Grounding
Figure 4 for Multi-Scale Contrastive Learning for Video Temporal Grounding
Viaarxiv icon

LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models

Add code
Sep 04, 2024
Viaarxiv icon