Picture for Tsun-Hsuan Wang

Tsun-Hsuan Wang

Architect: Generating Vivid and Interactive 3D Scenes with Hierarchical 2D Inpainting

Add code
Nov 14, 2024
Figure 1 for Architect: Generating Vivid and Interactive 3D Scenes with Hierarchical 2D Inpainting
Figure 2 for Architect: Generating Vivid and Interactive 3D Scenes with Hierarchical 2D Inpainting
Figure 3 for Architect: Generating Vivid and Interactive 3D Scenes with Hierarchical 2D Inpainting
Figure 4 for Architect: Generating Vivid and Interactive 3D Scenes with Hierarchical 2D Inpainting
Viaarxiv icon

Flex: End-to-End Text-Instructed Visual Navigation with Foundation Models

Add code
Oct 16, 2024
Figure 1 for Flex: End-to-End Text-Instructed Visual Navigation with Foundation Models
Figure 2 for Flex: End-to-End Text-Instructed Visual Navigation with Foundation Models
Figure 3 for Flex: End-to-End Text-Instructed Visual Navigation with Foundation Models
Figure 4 for Flex: End-to-End Text-Instructed Visual Navigation with Foundation Models
Viaarxiv icon

Human Insights Driven Latent Space for Different Driving Perspectives: A Unified Encoder for Efficient Multi-Task Inference

Add code
Sep 16, 2024
Figure 1 for Human Insights Driven Latent Space for Different Driving Perspectives: A Unified Encoder for Efficient Multi-Task Inference
Figure 2 for Human Insights Driven Latent Space for Different Driving Perspectives: A Unified Encoder for Efficient Multi-Task Inference
Figure 3 for Human Insights Driven Latent Space for Different Driving Perspectives: A Unified Encoder for Efficient Multi-Task Inference
Figure 4 for Human Insights Driven Latent Space for Different Driving Perspectives: A Unified Encoder for Efficient Multi-Task Inference
Viaarxiv icon

ABNet: Attention BarrierNet for Safe and Scalable Robot Learning

Add code
Jun 18, 2024
Figure 1 for ABNet: Attention BarrierNet for Safe and Scalable Robot Learning
Figure 2 for ABNet: Attention BarrierNet for Safe and Scalable Robot Learning
Figure 3 for ABNet: Attention BarrierNet for Safe and Scalable Robot Learning
Figure 4 for ABNet: Attention BarrierNet for Safe and Scalable Robot Learning
Viaarxiv icon

Text-to-Drive: Diverse Driving Behavior Synthesis via Large Language Models

Add code
Jun 06, 2024
Figure 1 for Text-to-Drive: Diverse Driving Behavior Synthesis via Large Language Models
Figure 2 for Text-to-Drive: Diverse Driving Behavior Synthesis via Large Language Models
Figure 3 for Text-to-Drive: Diverse Driving Behavior Synthesis via Large Language Models
Figure 4 for Text-to-Drive: Diverse Driving Behavior Synthesis via Large Language Models
Viaarxiv icon

LLM and Simulation as Bilevel Optimizers: A New Paradigm to Advance Physical Scientific Discovery

Add code
May 16, 2024
Figure 1 for LLM and Simulation as Bilevel Optimizers: A New Paradigm to Advance Physical Scientific Discovery
Figure 2 for LLM and Simulation as Bilevel Optimizers: A New Paradigm to Advance Physical Scientific Discovery
Figure 3 for LLM and Simulation as Bilevel Optimizers: A New Paradigm to Advance Physical Scientific Discovery
Figure 4 for LLM and Simulation as Bilevel Optimizers: A New Paradigm to Advance Physical Scientific Discovery
Viaarxiv icon

Probing Multimodal LLMs as World Models for Driving

Add code
May 09, 2024
Figure 1 for Probing Multimodal LLMs as World Models for Driving
Figure 2 for Probing Multimodal LLMs as World Models for Driving
Figure 3 for Probing Multimodal LLMs as World Models for Driving
Figure 4 for Probing Multimodal LLMs as World Models for Driving
Viaarxiv icon

Toward Efficient Visual Gyroscopes: Spherical Moments, Harmonics Filtering, and Masking Techniques for Spherical Camera Applications

Add code
Apr 02, 2024
Figure 1 for Toward Efficient Visual Gyroscopes: Spherical Moments, Harmonics Filtering, and Masking Techniques for Spherical Camera Applications
Figure 2 for Toward Efficient Visual Gyroscopes: Spherical Moments, Harmonics Filtering, and Masking Techniques for Spherical Camera Applications
Figure 3 for Toward Efficient Visual Gyroscopes: Spherical Moments, Harmonics Filtering, and Masking Techniques for Spherical Camera Applications
Figure 4 for Toward Efficient Visual Gyroscopes: Spherical Moments, Harmonics Filtering, and Masking Techniques for Spherical Camera Applications
Viaarxiv icon

Grounding Language Plans in Demonstrations Through Counterfactual Perturbations

Add code
Mar 25, 2024
Figure 1 for Grounding Language Plans in Demonstrations Through Counterfactual Perturbations
Figure 2 for Grounding Language Plans in Demonstrations Through Counterfactual Perturbations
Figure 3 for Grounding Language Plans in Demonstrations Through Counterfactual Perturbations
Figure 4 for Grounding Language Plans in Demonstrations Through Counterfactual Perturbations
Viaarxiv icon

Curiosity-driven Red-teaming for Large Language Models

Add code
Feb 29, 2024
Figure 1 for Curiosity-driven Red-teaming for Large Language Models
Figure 2 for Curiosity-driven Red-teaming for Large Language Models
Figure 3 for Curiosity-driven Red-teaming for Large Language Models
Figure 4 for Curiosity-driven Red-teaming for Large Language Models
Viaarxiv icon