Picture for Rong Xiao

Rong Xiao

Rethinking Practical and Efficient Quantization Calibration for Vision-Language Models

Add code
Feb 08, 2026
Viaarxiv icon

Open-Text Aerial Detection: A Unified Framework For Aerial Visual Grounding And Detection

Add code
Feb 08, 2026
Viaarxiv icon

PIO-FVLM: Rethinking Training-Free Visual Token Reduction for VLM Acceleration from an Inference-Objective Perspective

Add code
Feb 05, 2026
Viaarxiv icon

Delving into Muon and Beyond: Deep Analysis and Extensions

Add code
Feb 04, 2026
Viaarxiv icon

SimpleGPT: Improving GPT via A Simple Normalization Strategy

Add code
Feb 01, 2026
Viaarxiv icon

LLM-I2I: Boost Your Small Item2Item Recommendation Model with Large Language Model

Add code
Dec 25, 2025
Viaarxiv icon

UVLM: Benchmarking Video Language Model for Underwater World Understanding

Add code
Jul 03, 2025
Viaarxiv icon

Language Embedding Meets Dynamic Graph: A New Exploration for Neural Architecture Representation Learning

Add code
Jun 09, 2025
Viaarxiv icon

MiniMax-Remover: Taming Bad Noise Helps Video Object Removal

Add code
May 30, 2025
Viaarxiv icon

Taming Transformer Without Using Learning Rate Warmup

Add code
May 28, 2025
Viaarxiv icon