Picture for Wanli Ouyang

Wanli Ouyang

School of Electrical and Information Engineering, The University of Sydney, Australia

FiTv2: Scalable and Improved Flexible Vision Transformer for Diffusion Model

Add code
Oct 17, 2024
Figure 1 for FiTv2: Scalable and Improved Flexible Vision Transformer for Diffusion Model
Figure 2 for FiTv2: Scalable and Improved Flexible Vision Transformer for Diffusion Model
Figure 3 for FiTv2: Scalable and Improved Flexible Vision Transformer for Diffusion Model
Figure 4 for FiTv2: Scalable and Improved Flexible Vision Transformer for Diffusion Model
Viaarxiv icon

A CLIP-Powered Framework for Robust and Generalizable Data Selection

Add code
Oct 15, 2024
Figure 1 for A CLIP-Powered Framework for Robust and Generalizable Data Selection
Figure 2 for A CLIP-Powered Framework for Robust and Generalizable Data Selection
Figure 3 for A CLIP-Powered Framework for Robust and Generalizable Data Selection
Figure 4 for A CLIP-Powered Framework for Robust and Generalizable Data Selection
Viaarxiv icon

Depth Any Video with Scalable Synthetic Data

Add code
Oct 14, 2024
Figure 1 for Depth Any Video with Scalable Synthetic Data
Figure 2 for Depth Any Video with Scalable Synthetic Data
Figure 3 for Depth Any Video with Scalable Synthetic Data
Figure 4 for Depth Any Video with Scalable Synthetic Data
Viaarxiv icon

Two Heads Are Better Than One: A Multi-Agent System Has the Potential to Improve Scientific Idea Generation

Add code
Oct 12, 2024
Figure 1 for Two Heads Are Better Than One: A Multi-Agent System Has the Potential to Improve Scientific Idea Generation
Figure 2 for Two Heads Are Better Than One: A Multi-Agent System Has the Potential to Improve Scientific Idea Generation
Figure 3 for Two Heads Are Better Than One: A Multi-Agent System Has the Potential to Improve Scientific Idea Generation
Figure 4 for Two Heads Are Better Than One: A Multi-Agent System Has the Potential to Improve Scientific Idea Generation
Viaarxiv icon

Diffusion Models Need Visual Priors for Image Generation

Add code
Oct 11, 2024
Figure 1 for Diffusion Models Need Visual Priors for Image Generation
Figure 2 for Diffusion Models Need Visual Priors for Image Generation
Figure 3 for Diffusion Models Need Visual Priors for Image Generation
Figure 4 for Diffusion Models Need Visual Priors for Image Generation
Viaarxiv icon

MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific Hypotheses

Add code
Oct 09, 2024
Viaarxiv icon

HiSplat: Hierarchical 3D Gaussian Splatting for Generalizable Sparse-View Reconstruction

Add code
Oct 08, 2024
Figure 1 for HiSplat: Hierarchical 3D Gaussian Splatting for Generalizable Sparse-View Reconstruction
Figure 2 for HiSplat: Hierarchical 3D Gaussian Splatting for Generalizable Sparse-View Reconstruction
Figure 3 for HiSplat: Hierarchical 3D Gaussian Splatting for Generalizable Sparse-View Reconstruction
Figure 4 for HiSplat: Hierarchical 3D Gaussian Splatting for Generalizable Sparse-View Reconstruction
Viaarxiv icon

PostCast: Generalizable Postprocessing for Precipitation Nowcasting via Unsupervised Blurriness Modeling

Add code
Oct 08, 2024
Figure 1 for PostCast: Generalizable Postprocessing for Precipitation Nowcasting via Unsupervised Blurriness Modeling
Figure 2 for PostCast: Generalizable Postprocessing for Precipitation Nowcasting via Unsupervised Blurriness Modeling
Figure 3 for PostCast: Generalizable Postprocessing for Precipitation Nowcasting via Unsupervised Blurriness Modeling
Figure 4 for PostCast: Generalizable Postprocessing for Precipitation Nowcasting via Unsupervised Blurriness Modeling
Viaarxiv icon

LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning

Add code
Oct 03, 2024
Viaarxiv icon

GigaGS: Scaling up Planar-Based 3D Gaussians for Large Scene Surface Reconstruction

Add code
Sep 10, 2024
Viaarxiv icon