Picture for Hui Xiong

Hui Xiong

WaterVideoQA: ASV-Centric Perception and Rule-Compliant Reasoning via Multi-Modal Agents

Add code
Feb 26, 2026
Viaarxiv icon

Turning Semantics into Topology: LLM-Driven Attribute Augmentation for Collaborative Filtering

Add code
Feb 24, 2026
Viaarxiv icon

VideoAfford: Grounding 3D Affordance from Human-Object-Interaction Videos via Multimodal Large Language Model

Add code
Feb 10, 2026
Viaarxiv icon

AutoFly: Vision-Language-Action Model for UAV Autonomous Navigation in the Wild

Add code
Feb 10, 2026
Viaarxiv icon

Towards Generalizable Reasoning: Group Causal Counterfactual Policy Optimization for LLM Reasoning

Add code
Feb 06, 2026
Viaarxiv icon

On the Plasticity and Stability for Post-Training Large Language Models

Add code
Feb 06, 2026
Viaarxiv icon

Causal Front-Door Adjustment for Robust Jailbreak Attacks on LLMs

Add code
Feb 05, 2026
Viaarxiv icon

PISA: Piecewise Sparse Attention Is Wiser for Efficient Diffusion Transformers

Add code
Feb 03, 2026
Viaarxiv icon

Modality Gap-Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models

Add code
Feb 02, 2026
Viaarxiv icon

A Brain-inspired Embodied Intelligence for Fluid and Fast Reflexive Robotics Control

Add code
Jan 21, 2026
Viaarxiv icon