Picture for Ji Li

Ji Li

InsightTok: Improving Text and Face Fidelity in Discrete Tokenization for Autoregressive Image Generation

Add code
May 14, 2026
Viaarxiv icon

Covering Human Action Space for Computer Use: Data Synthesis and Benchmark

Add code
May 12, 2026
Viaarxiv icon

MM-WebAgent: A Hierarchical Multimodal Web Agent for Webpage Generation

Add code
Apr 16, 2026
Viaarxiv icon

BizGenEval: A Systematic Benchmark for Commercial Visual Content Generation

Add code
Mar 26, 2026
Viaarxiv icon

SpecForge: A Flexible and Efficient Open-Source Training Framework for Speculative Decoding

Add code
Mar 19, 2026
Viaarxiv icon

Just-in-Time: Training-Free Spatial Acceleration for Diffusion Transformers

Add code
Mar 11, 2026
Viaarxiv icon

Can Vision Language Models Assess Graphic Design Aesthetics? A Benchmark, Evaluation, and Dataset Perspective

Add code
Mar 01, 2026
Viaarxiv icon

Improving MLLMs in Embodied Exploration and Question Answering with Human-Inspired Memory Modeling

Add code
Feb 17, 2026
Viaarxiv icon

DiffPlace: Street View Generation via Place-Controllable Diffusion Model Enhancing Place Recognition

Add code
Feb 12, 2026
Viaarxiv icon

ReLayout: Versatile and Structure-Preserving Design Layout Editing via Relation-Aware Design Reconstruction

Add code
Feb 01, 2026
Viaarxiv icon