Picture for Baiyang Song

Baiyang Song

ForestPrune: High-ratio Visual Token Compression for Video Multimodal Large Language Models via Spatial-Temporal Forest Modeling

Add code
Mar 24, 2026
Viaarxiv icon

KTV: Keyframes and Key Tokens Selection for Efficient Training-Free Video LLMs

Add code
Feb 03, 2026
Viaarxiv icon

Grounded Chain-of-Thought for Multimodal Large Language Models

Add code
Mar 17, 2025
Viaarxiv icon

KALE-LM: Unleash The Power Of AI For Science Via Knowledge And Logic Enhanced Large Model

Add code
Sep 27, 2024
Figure 1 for KALE-LM: Unleash The Power Of AI For Science Via Knowledge And Logic Enhanced Large Model
Figure 2 for KALE-LM: Unleash The Power Of AI For Science Via Knowledge And Logic Enhanced Large Model
Figure 3 for KALE-LM: Unleash The Power Of AI For Science Via Knowledge And Logic Enhanced Large Model
Figure 4 for KALE-LM: Unleash The Power Of AI For Science Via Knowledge And Logic Enhanced Large Model
Viaarxiv icon