Picture for Hang Xu

Hang Xu

From Summary to Action: Enhancing Large Language Models for Complex Tasks with Open World APIs

Add code
Feb 28, 2024
Viaarxiv icon

Optimal Parallelization Strategies for Active Flow Control in Deep Reinforcement Learning-Based Computational Fluid Dynamics

Add code
Feb 18, 2024
Viaarxiv icon

Translating Images to Road Network:A Non-Autoregressive Sequence-to-Sequence Approach

Add code
Feb 13, 2024
Figure 1 for Translating Images to Road Network:A Non-Autoregressive Sequence-to-Sequence Approach
Figure 2 for Translating Images to Road Network:A Non-Autoregressive Sequence-to-Sequence Approach
Figure 3 for Translating Images to Road Network:A Non-Autoregressive Sequence-to-Sequence Approach
Figure 4 for Translating Images to Road Network:A Non-Autoregressive Sequence-to-Sequence Approach
Viaarxiv icon

GS-CLIP: Gaussian Splatting for Contrastive Language-Image-3D Pretraining from Real-World Data

Add code
Feb 13, 2024
Figure 1 for GS-CLIP: Gaussian Splatting for Contrastive Language-Image-3D Pretraining from Real-World Data
Figure 2 for GS-CLIP: Gaussian Splatting for Contrastive Language-Image-3D Pretraining from Real-World Data
Figure 3 for GS-CLIP: Gaussian Splatting for Contrastive Language-Image-3D Pretraining from Real-World Data
Viaarxiv icon

Task-customized Masked AutoEncoder via Mixture of Cluster-conditional Experts

Add code
Feb 08, 2024
Figure 1 for Task-customized Masked AutoEncoder via Mixture of Cluster-conditional Experts
Figure 2 for Task-customized Masked AutoEncoder via Mixture of Cluster-conditional Experts
Figure 3 for Task-customized Masked AutoEncoder via Mixture of Cluster-conditional Experts
Figure 4 for Task-customized Masked AutoEncoder via Mixture of Cluster-conditional Experts
Viaarxiv icon

LaneGraph2Seq: Lane Topology Extraction with Language Model via Vertex-Edge Encoding and Connectivity Enhancement

Add code
Jan 31, 2024
Viaarxiv icon

Holistic Autonomous Driving Understanding by Bird's-Eye-View Injected Multi-Modal Large Models

Add code
Jan 02, 2024
Figure 1 for Holistic Autonomous Driving Understanding by Bird's-Eye-View Injected Multi-Modal Large Models
Figure 2 for Holistic Autonomous Driving Understanding by Bird's-Eye-View Injected Multi-Modal Large Models
Figure 3 for Holistic Autonomous Driving Understanding by Bird's-Eye-View Injected Multi-Modal Large Models
Figure 4 for Holistic Autonomous Driving Understanding by Bird's-Eye-View Injected Multi-Modal Large Models
Viaarxiv icon

PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion

Add code
Dec 29, 2023
Figure 1 for PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion
Figure 2 for PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion
Figure 3 for PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion
Figure 4 for PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion
Viaarxiv icon

Rotational Augmented Noise2Inverse for Low-dose Computed Tomography Reconstruction

Add code
Dec 19, 2023
Viaarxiv icon

Mixture of Cluster-conditional LoRA Experts for Vision-language Instruction Tuning

Add code
Dec 19, 2023
Figure 1 for Mixture of Cluster-conditional LoRA Experts for Vision-language Instruction Tuning
Figure 2 for Mixture of Cluster-conditional LoRA Experts for Vision-language Instruction Tuning
Figure 3 for Mixture of Cluster-conditional LoRA Experts for Vision-language Instruction Tuning
Figure 4 for Mixture of Cluster-conditional LoRA Experts for Vision-language Instruction Tuning
Viaarxiv icon