Picture for Kaiyuan Li

Kaiyuan Li

Balanced Token Pruning: Accelerating Vision Language Models Beyond Local Optimization

Add code
May 28, 2025
Viaarxiv icon

ImagineBench: Evaluating Reinforcement Learning with Large Language Model Rollouts

Add code
May 15, 2025
Viaarxiv icon

CHIME: A Compressive Framework for Holistic Interest Modeling

Add code
Apr 09, 2025
Viaarxiv icon

BBQRec: Behavior-Bind Quantization for Multi-Modal Sequential Recommendation

Add code
Apr 09, 2025
Viaarxiv icon

The Point, the Vision and the Text: Does Point Cloud Boost Spatial Reasoning of Large Language Models?

Add code
Apr 06, 2025
Viaarxiv icon

PLPHP: Per-Layer Per-Head Vision Token Pruning for Efficient Large Vision-Language Models

Add code
Feb 20, 2025
Viaarxiv icon

Understanding and Evaluating Hallucinations in 3D Visual Language Models

Add code
Feb 18, 2025
Viaarxiv icon

CropCraft: Inverse Procedural Modeling for 3D Reconstruction of Crop Plants

Add code
Nov 14, 2024
Figure 1 for CropCraft: Inverse Procedural Modeling for 3D Reconstruction of Crop Plants
Figure 2 for CropCraft: Inverse Procedural Modeling for 3D Reconstruction of Crop Plants
Figure 3 for CropCraft: Inverse Procedural Modeling for 3D Reconstruction of Crop Plants
Figure 4 for CropCraft: Inverse Procedural Modeling for 3D Reconstruction of Crop Plants
Viaarxiv icon

WHALE: Towards Generalizable and Scalable World Models for Embodied Decision-making

Add code
Nov 08, 2024
Viaarxiv icon

FedASTA: Federated adaptive spatial-temporal attention for traffic flow prediction

Add code
May 21, 2024
Viaarxiv icon