Picture for Linjie Li

Linjie Li

Synthetic Visual Genome

Add code
Jun 09, 2025
Viaarxiv icon

Audio-Aware Large Language Models as Judges for Speaking Styles

Add code
Jun 06, 2025
Viaarxiv icon

Unfolding Spatial Cognition: Evaluating Multimodal Models on Visual Simulations

Add code
Jun 05, 2025
Viaarxiv icon

Seeing is Not Reasoning: MVPBench for Graph-based Evaluation of Multi-path Visual Physical CoT

Add code
May 30, 2025
Viaarxiv icon

Are Unified Vision-Language Models Necessary: Generalization Across Understanding and Generation

Add code
May 29, 2025
Viaarxiv icon

FullFront: Benchmarking MLLMs Across the Full Front-End Engineering Workflow

Add code
May 26, 2025
Viaarxiv icon

Point-RFT: Improving Multimodal Reasoning with Visually Grounded Reinforcement Finetuning

Add code
May 26, 2025
Viaarxiv icon

OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement Learning

Add code
May 13, 2025
Viaarxiv icon

RAGEN: Understanding Self-Evolution in LLM Agents via Multi-Turn Reinforcement Learning

Add code
Apr 24, 2025
Viaarxiv icon

SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement

Add code
Apr 10, 2025
Viaarxiv icon