Picture for Heng Yu

Heng Yu

, Tsinghua University

AnyLift: Scaling Motion Reconstruction from Internet Videos via 2D Diffusion

Add code
Apr 20, 2026
Viaarxiv icon

Predictive Reasoning with Augmented Anomaly Contrastive Learning for Compositional Visual Relations

Add code
Mar 01, 2026
Viaarxiv icon

Compress, Cross and Scale: Multi-Level Compression Cross Networks for Efficient Scaling in Recommender Systems

Add code
Feb 12, 2026
Viaarxiv icon

ViBES: A Conversational Agent with Behaviorally-Intelligent 3D Virtual Body

Add code
Dec 16, 2025
Figure 1 for ViBES: A Conversational Agent with Behaviorally-Intelligent 3D Virtual Body
Figure 2 for ViBES: A Conversational Agent with Behaviorally-Intelligent 3D Virtual Body
Figure 3 for ViBES: A Conversational Agent with Behaviorally-Intelligent 3D Virtual Body
Figure 4 for ViBES: A Conversational Agent with Behaviorally-Intelligent 3D Virtual Body
Viaarxiv icon

SocialGen: Modeling Multi-Human Social Interaction with Language Models

Add code
Mar 28, 2025
Viaarxiv icon

MagicGeo: Training-Free Text-Guided Geometric Diagram Generation

Add code
Feb 19, 2025
Viaarxiv icon

DOTA-ME-CS: Daily Oriented Text Audio-Mandarin English-Code Switching Dataset

Add code
Jan 21, 2025
Figure 1 for DOTA-ME-CS: Daily Oriented Text Audio-Mandarin English-Code Switching Dataset
Figure 2 for DOTA-ME-CS: Daily Oriented Text Audio-Mandarin English-Code Switching Dataset
Figure 3 for DOTA-ME-CS: Daily Oriented Text Audio-Mandarin English-Code Switching Dataset
Figure 4 for DOTA-ME-CS: Daily Oriented Text Audio-Mandarin English-Code Switching Dataset
Viaarxiv icon

End-to-end Generative Spatial-Temporal Ultrasonic Odometry and Mapping Framework

Add code
Dec 23, 2024
Figure 1 for End-to-end Generative Spatial-Temporal Ultrasonic Odometry and Mapping Framework
Figure 2 for End-to-end Generative Spatial-Temporal Ultrasonic Odometry and Mapping Framework
Figure 3 for End-to-end Generative Spatial-Temporal Ultrasonic Odometry and Mapping Framework
Figure 4 for End-to-end Generative Spatial-Temporal Ultrasonic Odometry and Mapping Framework
Viaarxiv icon

4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models

Add code
Jun 11, 2024
Figure 1 for 4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models
Figure 2 for 4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models
Figure 3 for 4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models
Figure 4 for 4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models
Viaarxiv icon

WeightedPose: Generalizable Cross-Pose Estimation via Weighted SVD

Add code
May 03, 2024
Figure 1 for WeightedPose: Generalizable Cross-Pose Estimation via Weighted SVD
Figure 2 for WeightedPose: Generalizable Cross-Pose Estimation via Weighted SVD
Figure 3 for WeightedPose: Generalizable Cross-Pose Estimation via Weighted SVD
Viaarxiv icon