Picture for Wei Liu

Wei Liu

Peter

VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control

Add code
Dec 30, 2024
Figure 1 for VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control
Figure 2 for VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control
Figure 3 for VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control
Figure 4 for VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control
Viaarxiv icon

Diving into Self-Evolving Training for Multimodal Reasoning

Add code
Dec 23, 2024
Figure 1 for Diving into Self-Evolving Training for Multimodal Reasoning
Figure 2 for Diving into Self-Evolving Training for Multimodal Reasoning
Figure 3 for Diving into Self-Evolving Training for Multimodal Reasoning
Figure 4 for Diving into Self-Evolving Training for Multimodal Reasoning
Viaarxiv icon

IDOL: Instant Photorealistic 3D Human Creation from a Single Image

Add code
Dec 19, 2024
Figure 1 for IDOL: Instant Photorealistic 3D Human Creation from a Single Image
Figure 2 for IDOL: Instant Photorealistic 3D Human Creation from a Single Image
Figure 3 for IDOL: Instant Photorealistic 3D Human Creation from a Single Image
Figure 4 for IDOL: Instant Photorealistic 3D Human Creation from a Single Image
Viaarxiv icon

A recent evaluation on the performance of LLMs on radiation oncology physics using questions of randomly shuffled options

Add code
Dec 14, 2024
Figure 1 for A recent evaluation on the performance of LLMs on radiation oncology physics using questions of randomly shuffled options
Figure 2 for A recent evaluation on the performance of LLMs on radiation oncology physics using questions of randomly shuffled options
Figure 3 for A recent evaluation on the performance of LLMs on radiation oncology physics using questions of randomly shuffled options
Figure 4 for A recent evaluation on the performance of LLMs on radiation oncology physics using questions of randomly shuffled options
Viaarxiv icon

Just a Few Glances: Open-Set Visual Perception with Image Prompt Paradigm

Add code
Dec 14, 2024
Figure 1 for Just a Few Glances: Open-Set Visual Perception with Image Prompt Paradigm
Figure 2 for Just a Few Glances: Open-Set Visual Perception with Image Prompt Paradigm
Figure 3 for Just a Few Glances: Open-Set Visual Perception with Image Prompt Paradigm
Figure 4 for Just a Few Glances: Open-Set Visual Perception with Image Prompt Paradigm
Viaarxiv icon

A Decade of Deep Learning: A Survey on The Magnificent Seven

Add code
Dec 13, 2024
Viaarxiv icon

STIV: Scalable Text and Image Conditioned Video Generation

Add code
Dec 10, 2024
Viaarxiv icon

KG-Retriever: Efficient Knowledge Indexing for Retrieval-Augmented Large Language Models

Add code
Dec 07, 2024
Viaarxiv icon

Mix-Modality Person Re-Identification: A New and Practical Paradigm

Add code
Dec 06, 2024
Viaarxiv icon

Exploring the Generalization Capabilities of AID-based Bi-level Optimization

Add code
Nov 25, 2024
Viaarxiv icon