Picture for Zhongzheng Ren

Zhongzheng Ren

RefDecoder: Enhancing Visual Generation with Conditional Video Decoding

Add code
May 14, 2026
Viaarxiv icon

MolmoAct2: Action Reasoning Models for Real-world Deployment

Add code
May 04, 2026
Viaarxiv icon

WildDet3D: Scaling Promptable 3D Detection in the Wild

Add code
Apr 09, 2026
Viaarxiv icon

MolmoWeb: Open Visual Web Agent and Open Data for the Open Web

Add code
Apr 09, 2026
Viaarxiv icon

TOPReward: Token Probabilities as Hidden Zero-Shot Rewards for Robotics

Add code
Feb 22, 2026
Viaarxiv icon

Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding

Add code
Jan 15, 2026
Viaarxiv icon

PhysGen: Rigid-Body Physics-Grounded Image-to-Video Generation

Add code
Sep 27, 2024
Viaarxiv icon

Apple Intelligence Foundation Language Models

Add code
Jul 29, 2024
Figure 1 for Apple Intelligence Foundation Language Models
Figure 2 for Apple Intelligence Foundation Language Models
Figure 3 for Apple Intelligence Foundation Language Models
Figure 4 for Apple Intelligence Foundation Language Models
Viaarxiv icon

NeRFDeformer: NeRF Transformation from a Single View via 3D Scene Flows

Add code
Jun 15, 2024
Viaarxiv icon

GoMAvatar: Efficient Animatable Human Modeling from Monocular Video Using Gaussians-on-Mesh

Add code
Apr 11, 2024
Viaarxiv icon