Picture for He Zhang

He Zhang

Tri-Prompting: Video Diffusion with Unified Control over Scene, Subject, and Motion

Add code
Mar 16, 2026
Viaarxiv icon

GoldenStart: Q-Guided Priors and Entropy Control for Distilling Flow Policies

Add code
Mar 15, 2026
Viaarxiv icon

Universal Pose Pretraining for Generalizable Vision-Language-Action Policies

Add code
Feb 23, 2026
Viaarxiv icon

From Subtle to Significant: Prompt-Driven Self-Improving Optimization in Test-Time Graph OOD Detection

Add code
Feb 19, 2026
Viaarxiv icon

ReusStdFlow: A Standardized Reusability Framework for Dynamic Workflow Construction in Agentic AI

Add code
Feb 16, 2026
Viaarxiv icon

A Brain-inspired Embodied Intelligence for Fluid and Fast Reflexive Robotics Control

Add code
Jan 21, 2026
Viaarxiv icon

DexterCap: An Affordable and Automated System for Capturing Dexterous Hand-Object Manipulation

Add code
Jan 09, 2026
Viaarxiv icon

Both Semantics and Reconstruction Matter: Making Representation Encoders Ready for Text-to-Image Generation and Editing

Add code
Dec 19, 2025
Viaarxiv icon

Fault2Flow: An AlphaEvolve-Optimized Human-in-the-Loop Multi-Agent System for Fault-to-Workflow Automation

Add code
Nov 17, 2025
Viaarxiv icon

MTR-DuplexBench: Towards a Comprehensive Evaluation of Multi-Round Conversations for Full-Duplex Speech Language Models

Add code
Nov 13, 2025
Figure 1 for MTR-DuplexBench: Towards a Comprehensive Evaluation of Multi-Round Conversations for Full-Duplex Speech Language Models
Figure 2 for MTR-DuplexBench: Towards a Comprehensive Evaluation of Multi-Round Conversations for Full-Duplex Speech Language Models
Figure 3 for MTR-DuplexBench: Towards a Comprehensive Evaluation of Multi-Round Conversations for Full-Duplex Speech Language Models
Figure 4 for MTR-DuplexBench: Towards a Comprehensive Evaluation of Multi-Round Conversations for Full-Duplex Speech Language Models
Viaarxiv icon