Picture for Hongxia Yang

Hongxia Yang

InfiR2: A Comprehensive FP8 Training Recipe for Reasoning-Enhanced Language Models

Add code
Sep 26, 2025
Viaarxiv icon

InfiAgent: Self-Evolving Pyramid Agent Framework for Infinite Scenarios

Add code
Sep 26, 2025
Viaarxiv icon

InfiMed-Foundation: Pioneering Advanced Multimodal Medical Models with Compute-Efficient Pre-Training and Multi-Stage Fine-Tuning

Add code
Sep 26, 2025
Viaarxiv icon

InfiAlign: A Scalable and Sample-Efficient Framework for Aligning LLMs to Enhance Reasoning Capabilities

Add code
Aug 07, 2025
Viaarxiv icon

OS Agents: A Survey on MLLM-based Agents for General Computing Devices Use

Add code
Aug 06, 2025
Viaarxiv icon

Infi-MMR: Curriculum-based Unlocking Multimodal Reasoning via Phased Reinforcement Learning in Multimodal Small Language Models

Add code
May 29, 2025
Viaarxiv icon

SwingArena: Competitive Programming Arena for Long-context GitHub Issue Solving

Add code
May 29, 2025
Viaarxiv icon

Infi-Med: Low-Resource Medical MLLMs with Robust Reasoning Evaluation

Add code
May 29, 2025
Viaarxiv icon

InfiGFusion: Graph-on-Logits Distillation via Efficient Gromov-Wasserstein for Model Fusion

Add code
May 20, 2025
Viaarxiv icon

InfiFPO: Implicit Model Fusion via Preference Optimization in Large Language Models

Add code
May 20, 2025
Viaarxiv icon