Picture for Zigang Geng

Zigang Geng

X-Omni: Reinforcement Learning Makes Discrete Autoregressive Image Generative Models Great Again

Add code
Jul 29, 2025
Viaarxiv icon

Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought

Add code
May 21, 2025
Viaarxiv icon

Equivariant Image Modeling

Add code
Mar 24, 2025
Viaarxiv icon

Tokenize Image as a Set

Add code
Mar 20, 2025
Viaarxiv icon

InstructDiffusion: A Generalist Modeling Interface for Vision Tasks

Add code
Sep 07, 2023
Viaarxiv icon

V-DETR: DETR with Vertex Relative Position Encoding for 3D Object Detection

Add code
Aug 08, 2023
Viaarxiv icon

Human Pose as Compositional Tokens

Add code
Mar 21, 2023
Figure 1 for Human Pose as Compositional Tokens
Figure 2 for Human Pose as Compositional Tokens
Figure 3 for Human Pose as Compositional Tokens
Figure 4 for Human Pose as Compositional Tokens
Viaarxiv icon

All in Tokens: Unifying Output Space of Visual Tasks via Soft Token

Add code
Jan 05, 2023
Figure 1 for All in Tokens: Unifying Output Space of Visual Tasks via Soft Token
Figure 2 for All in Tokens: Unifying Output Space of Visual Tasks via Soft Token
Figure 3 for All in Tokens: Unifying Output Space of Visual Tasks via Soft Token
Figure 4 for All in Tokens: Unifying Output Space of Visual Tasks via Soft Token
Viaarxiv icon

Revealing the Dark Secrets of Masked Image Modeling

Add code
May 27, 2022
Figure 1 for Revealing the Dark Secrets of Masked Image Modeling
Figure 2 for Revealing the Dark Secrets of Masked Image Modeling
Figure 3 for Revealing the Dark Secrets of Masked Image Modeling
Figure 4 for Revealing the Dark Secrets of Masked Image Modeling
Viaarxiv icon

Bottom-Up Human Pose Estimation Via Disentangled Keypoint Regression

Add code
Apr 06, 2021
Figure 1 for Bottom-Up Human Pose Estimation Via Disentangled Keypoint Regression
Figure 2 for Bottom-Up Human Pose Estimation Via Disentangled Keypoint Regression
Figure 3 for Bottom-Up Human Pose Estimation Via Disentangled Keypoint Regression
Figure 4 for Bottom-Up Human Pose Estimation Via Disentangled Keypoint Regression
Viaarxiv icon