Picture for Kaiyuan Li

Kaiyuan Li

VLGOR: Visual-Language Knowledge Guided Offline Reinforcement Learning for Generalizable Agents

Add code
Mar 24, 2026
Viaarxiv icon

RLVR Training of LLMs Does Not Improve Thinking Ability for General QA: Evaluation Method and a Simple Solution

Add code
Mar 21, 2026
Viaarxiv icon

MALLOC: Benchmarking the Memory-aware Long Sequence Compression for Large Sequential Recommendation

Add code
Jan 29, 2026
Viaarxiv icon

VQL: An End-to-End Context-Aware Vector Quantization Attention for Ultra-Long User Behavior Modeling

Add code
Aug 23, 2025
Viaarxiv icon

Balanced Token Pruning: Accelerating Vision Language Models Beyond Local Optimization

Add code
May 28, 2025
Viaarxiv icon

ImagineBench: Evaluating Reinforcement Learning with Large Language Model Rollouts

Add code
May 15, 2025
Viaarxiv icon

CHIME: A Compressive Framework for Holistic Interest Modeling

Add code
Apr 09, 2025
Viaarxiv icon

BBQRec: Behavior-Bind Quantization for Multi-Modal Sequential Recommendation

Add code
Apr 09, 2025
Figure 1 for BBQRec: Behavior-Bind Quantization for Multi-Modal Sequential Recommendation
Figure 2 for BBQRec: Behavior-Bind Quantization for Multi-Modal Sequential Recommendation
Figure 3 for BBQRec: Behavior-Bind Quantization for Multi-Modal Sequential Recommendation
Figure 4 for BBQRec: Behavior-Bind Quantization for Multi-Modal Sequential Recommendation
Viaarxiv icon

The Point, the Vision and the Text: Does Point Cloud Boost Spatial Reasoning of Large Language Models?

Add code
Apr 06, 2025
Viaarxiv icon

PLPHP: Per-Layer Per-Head Vision Token Pruning for Efficient Large Vision-Language Models

Add code
Feb 20, 2025
Figure 1 for PLPHP: Per-Layer Per-Head Vision Token Pruning for Efficient Large Vision-Language Models
Figure 2 for PLPHP: Per-Layer Per-Head Vision Token Pruning for Efficient Large Vision-Language Models
Figure 3 for PLPHP: Per-Layer Per-Head Vision Token Pruning for Efficient Large Vision-Language Models
Figure 4 for PLPHP: Per-Layer Per-Head Vision Token Pruning for Efficient Large Vision-Language Models
Viaarxiv icon