Picture for Guo-Jun Qi

Guo-Jun Qi

S2AFormer: Strip Self-Attention for Efficient Vision Transformer

Add code
May 28, 2025
Viaarxiv icon

Reinforcing the Diffusion Chain of Lateral Thought with Diffusion Language Models

Add code
May 15, 2025
Viaarxiv icon

Self-Guidance: Boosting Flow and Diffusion Generation on Their Own

Add code
Dec 08, 2024
Figure 1 for Self-Guidance: Boosting Flow and Diffusion Generation on Their Own
Figure 2 for Self-Guidance: Boosting Flow and Diffusion Generation on Their Own
Figure 3 for Self-Guidance: Boosting Flow and Diffusion Generation on Their Own
Figure 4 for Self-Guidance: Boosting Flow and Diffusion Generation on Their Own
Viaarxiv icon

Schedule On the Fly: Diffusion Time Prediction for Faster and Better Image Generation

Add code
Dec 02, 2024
Figure 1 for Schedule On the Fly: Diffusion Time Prediction for Faster and Better Image Generation
Figure 2 for Schedule On the Fly: Diffusion Time Prediction for Faster and Better Image Generation
Figure 3 for Schedule On the Fly: Diffusion Time Prediction for Faster and Better Image Generation
Figure 4 for Schedule On the Fly: Diffusion Time Prediction for Faster and Better Image Generation
Viaarxiv icon

SCASeg: Strip Cross-Attention for Efficient Semantic Segmentation

Add code
Nov 26, 2024
Viaarxiv icon

Openstory++: A Large-scale Dataset and Benchmark for Instance-aware Open-domain Visual Storytelling

Add code
Aug 07, 2024
Figure 1 for Openstory++: A Large-scale Dataset and Benchmark for Instance-aware Open-domain Visual Storytelling
Figure 2 for Openstory++: A Large-scale Dataset and Benchmark for Instance-aware Open-domain Visual Storytelling
Figure 3 for Openstory++: A Large-scale Dataset and Benchmark for Instance-aware Open-domain Visual Storytelling
Figure 4 for Openstory++: A Large-scale Dataset and Benchmark for Instance-aware Open-domain Visual Storytelling
Viaarxiv icon

Multi-Condition Latent Diffusion Network for Scene-Aware Neural Human Motion Prediction

Add code
May 30, 2024
Viaarxiv icon

Towards Open Domain Text-Driven Synthesis of Multi-Person Motions

Add code
May 28, 2024
Viaarxiv icon

PoseAnimate: Zero-shot high fidelity pose controllable character animation

Add code
Apr 30, 2024
Viaarxiv icon

BARET : Balanced Attention based Real image Editing driven by Target-text Inversion

Add code
Dec 09, 2023
Viaarxiv icon