Picture for Si Li

Si Li

Towards Deeper Emotional Reflection: Crafting Affective Image Filters with Generative Priors

Add code
Dec 19, 2025
Viaarxiv icon

STAGE: Storyboard-Anchored Generation for Cinematic Multi-shot Narrative

Add code
Dec 13, 2025
Viaarxiv icon

PolarAnything: Diffusion-based Polarimetric Image Synthesis

Add code
Jul 24, 2025
Viaarxiv icon

Audio-Sync Video Generation with Multi-Stream Temporal Control

Add code
Jun 09, 2025
Viaarxiv icon

Affective Image Editing: Shaping Emotional Factors via Text Descriptions

Add code
May 24, 2025
Viaarxiv icon

M-DocSum: Do LVLMs Genuinely Comprehend Interleaved Image-Text in Document Summarization?

Add code
Mar 27, 2025
Figure 1 for M-DocSum: Do LVLMs Genuinely Comprehend Interleaved Image-Text in Document Summarization?
Figure 2 for M-DocSum: Do LVLMs Genuinely Comprehend Interleaved Image-Text in Document Summarization?
Figure 3 for M-DocSum: Do LVLMs Genuinely Comprehend Interleaved Image-Text in Document Summarization?
Figure 4 for M-DocSum: Do LVLMs Genuinely Comprehend Interleaved Image-Text in Document Summarization?
Viaarxiv icon

VIRES: Video Instance Repainting with Sketch and Text Guidance

Add code
Nov 26, 2024
Figure 1 for VIRES: Video Instance Repainting with Sketch and Text Guidance
Figure 2 for VIRES: Video Instance Repainting with Sketch and Text Guidance
Figure 3 for VIRES: Video Instance Repainting with Sketch and Text Guidance
Figure 4 for VIRES: Video Instance Repainting with Sketch and Text Guidance
Viaarxiv icon

Smart Audit System Empowered by LLM

Add code
Oct 10, 2024
Figure 1 for Smart Audit System Empowered by LLM
Figure 2 for Smart Audit System Empowered by LLM
Figure 3 for Smart Audit System Empowered by LLM
Figure 4 for Smart Audit System Empowered by LLM
Viaarxiv icon

L-C4: Language-Based Video Colorization for Creative and Consistent Color

Add code
Oct 07, 2024
Figure 1 for L-C4: Language-Based Video Colorization for Creative and Consistent Color
Figure 2 for L-C4: Language-Based Video Colorization for Creative and Consistent Color
Figure 3 for L-C4: Language-Based Video Colorization for Creative and Consistent Color
Figure 4 for L-C4: Language-Based Video Colorization for Creative and Consistent Color
Viaarxiv icon

Frequency-regularized Neural Representation Method for Sparse-view Tomographic Reconstruction

Add code
Sep 22, 2024
Viaarxiv icon