Picture for Hui Xiong

Hui Xiong

Learning to Think: Information-Theoretic Reinforcement Fine-Tuning for LLMs

Add code
May 15, 2025
Viaarxiv icon

GVPO: Group Variance Policy Optimization for Large Language Model Post-Training

Add code
Apr 28, 2025
Viaarxiv icon

A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment

Add code
Apr 22, 2025
Viaarxiv icon

Multimodal 3D Genome Pre-training

Add code
Apr 12, 2025
Viaarxiv icon

TP-RAG: Benchmarking Retrieval-Augmented Large Language Model Agents for Spatiotemporal-Aware Travel Planning

Add code
Apr 11, 2025
Viaarxiv icon

3DBonsai: Structure-Aware Bonsai Modeling Using Conditioned 3D Gaussian Splatting

Add code
Apr 02, 2025
Viaarxiv icon

Logic-in-Frames: Dynamic Keyframe Search via Visual Semantic-Logical Verification for Long Video Understanding

Add code
Mar 17, 2025
Viaarxiv icon

Cognitive Disentanglement for Referring Multi-Object Tracking

Add code
Mar 14, 2025
Viaarxiv icon

Exploring the Vulnerabilities of Federated Learning: A Deep Dive into Gradient Inversion Attacks

Add code
Mar 13, 2025
Viaarxiv icon

From Understanding to Excelling: Template-Free Algorithm Design through Structural-Functional Co-Evolution

Add code
Mar 13, 2025
Viaarxiv icon