Picture for Lei Zhang

Lei Zhang

Sid

Prompt-Free Conditional Diffusion for Multi-object Image Augmentation

Add code
Jul 08, 2025
Figure 1 for Prompt-Free Conditional Diffusion for Multi-object Image Augmentation
Figure 2 for Prompt-Free Conditional Diffusion for Multi-object Image Augmentation
Figure 3 for Prompt-Free Conditional Diffusion for Multi-object Image Augmentation
Figure 4 for Prompt-Free Conditional Diffusion for Multi-object Image Augmentation
Viaarxiv icon

One-Step Diffusion for Detail-Rich and Temporally Consistent Video Super-Resolution

Add code
Jun 18, 2025
Figure 1 for One-Step Diffusion for Detail-Rich and Temporally Consistent Video Super-Resolution
Figure 2 for One-Step Diffusion for Detail-Rich and Temporally Consistent Video Super-Resolution
Figure 3 for One-Step Diffusion for Detail-Rich and Temporally Consistent Video Super-Resolution
Figure 4 for One-Step Diffusion for Detail-Rich and Temporally Consistent Video Super-Resolution
Viaarxiv icon

SWE-Flow: Synthesizing Software Engineering Data in a Test-Driven Manner

Add code
Jun 11, 2025
Viaarxiv icon

MiniCPM4: Ultra-Efficient LLMs on End Devices

Add code
Jun 09, 2025
Figure 1 for MiniCPM4: Ultra-Efficient LLMs on End Devices
Figure 2 for MiniCPM4: Ultra-Efficient LLMs on End Devices
Figure 3 for MiniCPM4: Ultra-Efficient LLMs on End Devices
Figure 4 for MiniCPM4: Ultra-Efficient LLMs on End Devices
Viaarxiv icon

Training Superior Sparse Autoencoders for Instruct Models

Add code
Jun 09, 2025
Viaarxiv icon

Representation Decomposition for Learning Similarity and Contrastness Across Modalities for Affective Computing

Add code
Jun 08, 2025
Figure 1 for Representation Decomposition for Learning Similarity and Contrastness Across Modalities for Affective Computing
Figure 2 for Representation Decomposition for Learning Similarity and Contrastness Across Modalities for Affective Computing
Figure 3 for Representation Decomposition for Learning Similarity and Contrastness Across Modalities for Affective Computing
Figure 4 for Representation Decomposition for Learning Similarity and Contrastness Across Modalities for Affective Computing
Viaarxiv icon

Perceive Anything: Recognize, Explain, Caption, and Segment Anything in Images and Videos

Add code
Jun 05, 2025
Figure 1 for Perceive Anything: Recognize, Explain, Caption, and Segment Anything in Images and Videos
Figure 2 for Perceive Anything: Recognize, Explain, Caption, and Segment Anything in Images and Videos
Figure 3 for Perceive Anything: Recognize, Explain, Caption, and Segment Anything in Images and Videos
Figure 4 for Perceive Anything: Recognize, Explain, Caption, and Segment Anything in Images and Videos
Viaarxiv icon

Rex-Thinker: Grounded Object Referring via Chain-of-Thought Reasoning

Add code
Jun 04, 2025
Viaarxiv icon

MIRAGE: Assessing Hallucination in Multimodal Reasoning Chains of MLLM

Add code
May 30, 2025
Viaarxiv icon

Iterative Corpus Refinement for Materials Property Prediction Based on Scientific Texts

Add code
May 27, 2025
Viaarxiv icon