Picture for Zhen Li

Zhen Li

LMO, CELESTE, HEC Paris

MiniCPM4: Ultra-Efficient LLMs on End Devices

Add code
Jun 09, 2025
Viaarxiv icon

Multi-Sourced Compositional Generalization in Visual Question Answering

Add code
May 29, 2025
Viaarxiv icon

Infi-MMR: Curriculum-based Unlocking Multimodal Reasoning via Phased Reinforcement Learning in Multimodal Small Language Models

Add code
May 29, 2025
Viaarxiv icon

Respond to Change with Constancy: Instruction-tuning with LLM for Non-I.I.D. Network Traffic Classification

Add code
May 27, 2025
Viaarxiv icon

ChartGalaxy: A Dataset for Infographic Chart Understanding and Generation

Add code
May 24, 2025
Viaarxiv icon

MAGE: A Multi-task Architecture for Gaze Estimation with an Efficient Calibration Module

Add code
May 22, 2025
Viaarxiv icon

InfiJanice: Joint Analysis and In-situ Correction Engine for Quantization-Induced Math Degradation in Large Language Models

Add code
May 16, 2025
Viaarxiv icon

VisualCloze: A Universal Image Generation Framework via Visual In-Context Learning

Add code
Apr 10, 2025
Viaarxiv icon

OmniCaptioner: One Captioner to Rule Them All

Add code
Apr 09, 2025
Viaarxiv icon

MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Models

Add code
Apr 08, 2025
Viaarxiv icon