Picture for Gao Huang

Gao Huang

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Add code
May 07, 2025
Figure 1 for Absolute Zero: Reinforced Self-play Reasoning with Zero Data
Figure 2 for Absolute Zero: Reinforced Self-play Reasoning with Zero Data
Figure 3 for Absolute Zero: Reinforced Self-play Reasoning with Zero Data
Figure 4 for Absolute Zero: Reinforced Self-play Reasoning with Zero Data
Viaarxiv icon

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Add code
Apr 18, 2025
Viaarxiv icon

CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning

Add code
Apr 18, 2025
Figure 1 for CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning
Figure 2 for CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning
Figure 3 for CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning
Figure 4 for CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning
Viaarxiv icon

EchoWorld: Learning Motion-Aware World Models for Echocardiography Probe Guidance

Add code
Apr 17, 2025
Figure 1 for EchoWorld: Learning Motion-Aware World Models for Echocardiography Probe Guidance
Figure 2 for EchoWorld: Learning Motion-Aware World Models for Echocardiography Probe Guidance
Figure 3 for EchoWorld: Learning Motion-Aware World Models for Echocardiography Probe Guidance
Figure 4 for EchoWorld: Learning Motion-Aware World Models for Echocardiography Probe Guidance
Viaarxiv icon

DyDiT++: Dynamic Diffusion Transformers for Efficient Visual Generation

Add code
Apr 09, 2025
Viaarxiv icon

4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models

Add code
Mar 13, 2025
Viaarxiv icon

Towards Understanding Text Hallucination of Diffusion Models via Local Generation Bias

Add code
Mar 05, 2025
Figure 1 for Towards Understanding Text Hallucination of Diffusion Models via Local Generation Bias
Figure 2 for Towards Understanding Text Hallucination of Diffusion Models via Local Generation Bias
Figure 3 for Towards Understanding Text Hallucination of Diffusion Models via Local Generation Bias
Figure 4 for Towards Understanding Text Hallucination of Diffusion Models via Local Generation Bias
Viaarxiv icon

ProxyTransformation: Preshaping Point Cloud Manifold With Proxy Attention For 3D Visual Grounding

Add code
Feb 26, 2025
Viaarxiv icon

ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation

Add code
Feb 25, 2025
Viaarxiv icon

HoVLE: Unleashing the Power of Monolithic Vision-Language Models with Holistic Vision-Language Embedding

Add code
Dec 20, 2024
Viaarxiv icon