Picture for Shufan Li

Shufan Li

From Masks to Worlds: A Hitchhiker's Guide to World Models

Add code
Oct 23, 2025
Viaarxiv icon

PredGen: Accelerated Inference of Large Language Models through Input-Time Speculation for Real-Time Speech Interaction

Add code
Jun 18, 2025
Viaarxiv icon

LaViDa: A Large Diffusion Language Model for Multimodal Understanding

Add code
May 22, 2025
Viaarxiv icon

Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection

Add code
Mar 15, 2025
Viaarxiv icon

MedMax: Mixed-Modal Instruction Tuning for Training Biomedical Assistants

Add code
Dec 17, 2024
Viaarxiv icon

OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows

Add code
Dec 02, 2024
Figure 1 for OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows
Figure 2 for OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows
Figure 3 for OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows
Figure 4 for OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows
Viaarxiv icon

Unlocking the Potential of Text-to-Image Diffusion with PAC-Bayesian Theory

Add code
Nov 25, 2024
Figure 1 for Unlocking the Potential of Text-to-Image Diffusion with PAC-Bayesian Theory
Figure 2 for Unlocking the Potential of Text-to-Image Diffusion with PAC-Bayesian Theory
Figure 3 for Unlocking the Potential of Text-to-Image Diffusion with PAC-Bayesian Theory
Figure 4 for Unlocking the Potential of Text-to-Image Diffusion with PAC-Bayesian Theory
Viaarxiv icon

SegLLM: Multi-round Reasoning Segmentation

Add code
Oct 24, 2024
Figure 1 for SegLLM: Multi-round Reasoning Segmentation
Figure 2 for SegLLM: Multi-round Reasoning Segmentation
Figure 3 for SegLLM: Multi-round Reasoning Segmentation
Figure 4 for SegLLM: Multi-round Reasoning Segmentation
Viaarxiv icon

PopAlign: Population-Level Alignment for Fair Text-to-Image Generation

Add code
Jun 28, 2024
Viaarxiv icon

Aligning Diffusion Models by Optimizing Human Utility

Add code
Apr 06, 2024
Figure 1 for Aligning Diffusion Models by Optimizing Human Utility
Figure 2 for Aligning Diffusion Models by Optimizing Human Utility
Figure 3 for Aligning Diffusion Models by Optimizing Human Utility
Figure 4 for Aligning Diffusion Models by Optimizing Human Utility
Viaarxiv icon