Picture for Ce Zhang

Ce Zhang

SiLVR: A Simple Language-based Video Reasoning Framework

Add code
May 30, 2025
Viaarxiv icon

VScan: Rethinking Visual Token Reduction for Efficient Large Vision-Language Models

Add code
May 28, 2025
Viaarxiv icon

HAMburger: Accelerating LLM Inference via Token Smashing

Add code
May 26, 2025
Viaarxiv icon

InstructPart: Task-Oriented Part Segmentation with Instruction Reasoning

Add code
May 23, 2025
Viaarxiv icon

Spectral-Aware Global Fusion for RGB-Thermal Semantic Segmentation

Add code
May 21, 2025
Viaarxiv icon

Improving Model Alignment Through Collective Intelligence of Open-Source LLMS

Add code
May 05, 2025
Viaarxiv icon

Think Deep, Think Fast: Investigating Efficiency of Verifier-free Inference-time-scaling Methods

Add code
Apr 18, 2025
Viaarxiv icon

Scaling Instruction-Tuned LLMs to Million-Token Contexts via Hierarchical Synthetic Data Generation

Add code
Apr 17, 2025
Viaarxiv icon

MLKV: Efficiently Scaling up Large Embedding Model Training with Disk-based Key-Value Storage

Add code
Apr 02, 2025
Viaarxiv icon

BASKET: A Large-Scale Video Dataset for Fine-Grained Skill Estimation

Add code
Mar 26, 2025
Viaarxiv icon