Picture for Gao Huang

Gao Huang

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Add code
Apr 18, 2025
Viaarxiv icon

CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning

Add code
Apr 18, 2025
Viaarxiv icon

EchoWorld: Learning Motion-Aware World Models for Echocardiography Probe Guidance

Add code
Apr 17, 2025
Viaarxiv icon

DyDiT++: Dynamic Diffusion Transformers for Efficient Visual Generation

Add code
Apr 09, 2025
Viaarxiv icon

4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models

Add code
Mar 13, 2025
Viaarxiv icon

Towards Understanding Text Hallucination of Diffusion Models via Local Generation Bias

Add code
Mar 05, 2025
Viaarxiv icon

ProxyTransformation: Preshaping Point Cloud Manifold With Proxy Attention For 3D Visual Grounding

Add code
Feb 26, 2025
Viaarxiv icon

ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation

Add code
Feb 25, 2025
Viaarxiv icon

HoVLE: Unleashing the Power of Monolithic Vision-Language Models with Holistic Vision-Language Embedding

Add code
Dec 20, 2024
Viaarxiv icon

Uni-AdaFocus: Spatial-temporal Dynamic Computation for Video Recognition

Add code
Dec 15, 2024
Viaarxiv icon