Picture for Gang Yu

Gang Yu

Department of Biomedical Engineering, School of Basic Medical Sciences, Central South University, Changsha, China

SERL: Self-Examining Reinforcement Learning on Open-Domain

Add code
Nov 18, 2025
Viaarxiv icon

SenseRay-3D: Generalizable and Physics-Informed Framework for End-to-End Indoor Propagation Modeling

Add code
Nov 15, 2025
Viaarxiv icon

Step-Audio-EditX Technical Report

Add code
Nov 05, 2025
Figure 1 for Step-Audio-EditX Technical Report
Figure 2 for Step-Audio-EditX Technical Report
Figure 3 for Step-Audio-EditX Technical Report
Figure 4 for Step-Audio-EditX Technical Report
Viaarxiv icon

IGGT: Instance-Grounded Geometry Transformer for Semantic 3D Reconstruction

Add code
Oct 26, 2025
Viaarxiv icon

WithAnyone: Towards Controllable and ID Consistent Image Generation

Add code
Oct 16, 2025
Viaarxiv icon

Mind-Paced Speaking: A Dual-Brain Approach to Real-Time Reasoning in Spoken Language Models

Add code
Oct 10, 2025
Viaarxiv icon

pFedSAM: Personalized Federated Learning of Segment Anything Model for Medical Image Segmentation

Add code
Sep 19, 2025
Viaarxiv icon

Training-Free Text-Guided Color Editing with Multi-Modal Diffusion Transformer

Add code
Aug 12, 2025
Figure 1 for Training-Free Text-Guided Color Editing with Multi-Modal Diffusion Transformer
Figure 2 for Training-Free Text-Guided Color Editing with Multi-Modal Diffusion Transformer
Figure 3 for Training-Free Text-Guided Color Editing with Multi-Modal Diffusion Transformer
Figure 4 for Training-Free Text-Guided Color Editing with Multi-Modal Diffusion Transformer
Viaarxiv icon

SC-Captioner: Improving Image Captioning with Self-Correction by Reinforcement Learning

Add code
Aug 08, 2025
Viaarxiv icon

Step-Audio 2 Technical Report

Add code
Jul 24, 2025
Figure 1 for Step-Audio 2 Technical Report
Figure 2 for Step-Audio 2 Technical Report
Figure 3 for Step-Audio 2 Technical Report
Figure 4 for Step-Audio 2 Technical Report
Viaarxiv icon