Picture for Yizhou Yu

Yizhou Yu

Rethinking Discrete Tokens: Treating Them as Conditions for Continuous Autoregressive Image Synthesis

Add code
Jul 02, 2025
Viaarxiv icon

Aerial Vision-and-Language Navigation with Grid-based View Selection and Map Construction

Add code
Mar 14, 2025
Viaarxiv icon

OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels

Add code
Feb 27, 2025
Viaarxiv icon

SuperNeRF-GAN: A Universal 3D-Consistent Super-Resolution Framework for Efficient and Enhanced 3D-Aware Image Synthesis

Add code
Jan 12, 2025
Figure 1 for SuperNeRF-GAN: A Universal 3D-Consistent Super-Resolution Framework for Efficient and Enhanced 3D-Aware Image Synthesis
Figure 2 for SuperNeRF-GAN: A Universal 3D-Consistent Super-Resolution Framework for Efficient and Enhanced 3D-Aware Image Synthesis
Figure 3 for SuperNeRF-GAN: A Universal 3D-Consistent Super-Resolution Framework for Efficient and Enhanced 3D-Aware Image Synthesis
Figure 4 for SuperNeRF-GAN: A Universal 3D-Consistent Super-Resolution Framework for Efficient and Enhanced 3D-Aware Image Synthesis
Viaarxiv icon

Real-time One-Step Diffusion-based Expressive Portrait Videos Generation

Add code
Dec 18, 2024
Figure 1 for Real-time One-Step Diffusion-based Expressive Portrait Videos Generation
Figure 2 for Real-time One-Step Diffusion-based Expressive Portrait Videos Generation
Figure 3 for Real-time One-Step Diffusion-based Expressive Portrait Videos Generation
Figure 4 for Real-time One-Step Diffusion-based Expressive Portrait Videos Generation
Viaarxiv icon

SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic Segmentation

Add code
Dec 16, 2024
Viaarxiv icon

Enhanced MRI Representation via Cross-series Masking

Add code
Dec 10, 2024
Viaarxiv icon

SparX: A Sparse Cross-Layer Connection Mechanism for Hierarchical Vision Mamba and Transformer Networks

Add code
Sep 15, 2024
Figure 1 for SparX: A Sparse Cross-Layer Connection Mechanism for Hierarchical Vision Mamba and Transformer Networks
Figure 2 for SparX: A Sparse Cross-Layer Connection Mechanism for Hierarchical Vision Mamba and Transformer Networks
Figure 3 for SparX: A Sparse Cross-Layer Connection Mechanism for Hierarchical Vision Mamba and Transformer Networks
Figure 4 for SparX: A Sparse Cross-Layer Connection Mechanism for Hierarchical Vision Mamba and Transformer Networks
Viaarxiv icon

Autoregressive Sequence Modeling for 3D Medical Image Representation

Add code
Sep 13, 2024
Figure 1 for Autoregressive Sequence Modeling for 3D Medical Image Representation
Figure 2 for Autoregressive Sequence Modeling for 3D Medical Image Representation
Figure 3 for Autoregressive Sequence Modeling for 3D Medical Image Representation
Figure 4 for Autoregressive Sequence Modeling for 3D Medical Image Representation
Viaarxiv icon

LaMamba-Diff: Linear-Time High-Fidelity Diffusion Models Based on Local Attention and Mamba

Add code
Aug 05, 2024
Viaarxiv icon