Picture for Lei Li

Lei Li

Carnegie Mellon University

GLM-5: from Vibe Coding to Agentic Engineering

Add code
Feb 17, 2026
Viaarxiv icon

TimeChat-Captioner: Scripting Multi-Scene Videos with Time-Aware and Structural Audio-Visual Captions

Add code
Feb 09, 2026
Viaarxiv icon

FSP-Diff: Full-Spectrum Prior-Enhanced DualDomain Latent Diffusion for Ultra-Low-Dose Spectral CT Reconstruction

Add code
Feb 08, 2026
Viaarxiv icon

PSGS: Text-driven Panorama Sliding Scene Generation via Gaussian Splatting

Add code
Jan 31, 2026
Viaarxiv icon

RASST: Fast Cross-modal Retrieval-Augmented Simultaneous Speech Translation

Add code
Jan 30, 2026
Viaarxiv icon

Building Digital Twins of Different Human Organs for Personalized Healthcare

Add code
Jan 16, 2026
Viaarxiv icon

PEMNet: Towards Autonomous and Enhanced Environment-Aware Mobile Networks

Add code
Jan 16, 2026
Viaarxiv icon

A one-step generation model with a Single-Layer Transformer: Layer number re-distillation of FreeFlow

Add code
Jan 14, 2026
Viaarxiv icon

MiMo-V2-Flash Technical Report

Add code
Jan 08, 2026
Viaarxiv icon

AM$^3$Safety: Towards Data Efficient Alignment of Multi-modal Multi-turn Safety for MLLMs

Add code
Jan 08, 2026
Viaarxiv icon