Picture for Wanli Ouyang

Wanli Ouyang

School of Electrical and Information Engineering, The University of Sydney, Australia

GigaGS: Scaling up Planar-Based 3D Gaussians for Large Scene Surface Reconstruction

Add code
Sep 10, 2024
Viaarxiv icon

GenAgent: Build Collaborative AI Systems with Automated Workflow Generation -- Case Studies on ComfyUI

Add code
Sep 02, 2024
Figure 1 for GenAgent: Build Collaborative AI Systems with Automated Workflow Generation -- Case Studies on ComfyUI
Figure 2 for GenAgent: Build Collaborative AI Systems with Automated Workflow Generation -- Case Studies on ComfyUI
Figure 3 for GenAgent: Build Collaborative AI Systems with Automated Workflow Generation -- Case Studies on ComfyUI
Figure 4 for GenAgent: Build Collaborative AI Systems with Automated Workflow Generation -- Case Studies on ComfyUI
Viaarxiv icon

MambaDS: Near-Surface Meteorological Field Downscaling with Topography Constrained Selective State Space Modeling

Add code
Aug 20, 2024
Figure 1 for MambaDS: Near-Surface Meteorological Field Downscaling with Topography Constrained Selective State Space Modeling
Figure 2 for MambaDS: Near-Surface Meteorological Field Downscaling with Topography Constrained Selective State Space Modeling
Figure 3 for MambaDS: Near-Surface Meteorological Field Downscaling with Topography Constrained Selective State Space Modeling
Figure 4 for MambaDS: Near-Surface Meteorological Field Downscaling with Topography Constrained Selective State Space Modeling
Viaarxiv icon

NeuRodin: A Two-stage Framework for High-Fidelity Neural Surface Reconstruction

Add code
Aug 19, 2024
Viaarxiv icon

ChemVLM: Exploring the Power of Multimodal Large Language Models in Chemistry Area

Add code
Aug 16, 2024
Figure 1 for ChemVLM: Exploring the Power of Multimodal Large Language Models in Chemistry Area
Figure 2 for ChemVLM: Exploring the Power of Multimodal Large Language Models in Chemistry Area
Figure 3 for ChemVLM: Exploring the Power of Multimodal Large Language Models in Chemistry Area
Figure 4 for ChemVLM: Exploring the Power of Multimodal Large Language Models in Chemistry Area
Viaarxiv icon

Seeing and Understanding: Bridging Vision with Chemical Knowledge Via ChemVLM

Add code
Aug 14, 2024
Figure 1 for Seeing and Understanding: Bridging Vision with Chemical Knowledge Via ChemVLM
Figure 2 for Seeing and Understanding: Bridging Vision with Chemical Knowledge Via ChemVLM
Figure 3 for Seeing and Understanding: Bridging Vision with Chemical Knowledge Via ChemVLM
Figure 4 for Seeing and Understanding: Bridging Vision with Chemical Knowledge Via ChemVLM
Viaarxiv icon

Fast Information Streaming Handler (FisH): A Unified Seismic Neural Network for Single Station Real-Time Earthquake Early Warning

Add code
Aug 13, 2024
Figure 1 for Fast Information Streaming Handler (FisH): A Unified Seismic Neural Network for Single Station Real-Time Earthquake Early Warning
Figure 2 for Fast Information Streaming Handler (FisH): A Unified Seismic Neural Network for Single Station Real-Time Earthquake Early Warning
Figure 3 for Fast Information Streaming Handler (FisH): A Unified Seismic Neural Network for Single Station Real-Time Earthquake Early Warning
Figure 4 for Fast Information Streaming Handler (FisH): A Unified Seismic Neural Network for Single Station Real-Time Earthquake Early Warning
Viaarxiv icon

Point Transformer V3 Extreme: 1st Place Solution for 2024 Waymo Open Dataset Challenge in Semantic Segmentation

Add code
Jul 21, 2024
Figure 1 for Point Transformer V3 Extreme: 1st Place Solution for 2024 Waymo Open Dataset Challenge in Semantic Segmentation
Figure 2 for Point Transformer V3 Extreme: 1st Place Solution for 2024 Waymo Open Dataset Challenge in Semantic Segmentation
Figure 3 for Point Transformer V3 Extreme: 1st Place Solution for 2024 Waymo Open Dataset Challenge in Semantic Segmentation
Figure 4 for Point Transformer V3 Extreme: 1st Place Solution for 2024 Waymo Open Dataset Challenge in Semantic Segmentation
Viaarxiv icon

VegeDiff: Latent Diffusion Model for Geospatial Vegetation Forecasting

Add code
Jul 17, 2024
Viaarxiv icon

TCFormer: Visual Recognition via Token Clustering Transformer

Add code
Jul 16, 2024
Viaarxiv icon