Picture for Chuang Gan

Chuang Gan

Disentangled Acoustic Fields For Multimodal Physical Scene Understanding

Add code
Jul 16, 2024
Viaarxiv icon

ConMe: Rethinking Evaluation of Compositional Reasoning for Modern VLMs

Add code
Jun 12, 2024
Viaarxiv icon

CoNav: A Benchmark for Human-Centered Collaborative Navigation

Add code
Jun 04, 2024
Viaarxiv icon

Physically Compatible 3D Object Modeling from a Single Image

Add code
Jun 03, 2024
Figure 1 for Physically Compatible 3D Object Modeling from a Single Image
Figure 2 for Physically Compatible 3D Object Modeling from a Single Image
Figure 3 for Physically Compatible 3D Object Modeling from a Single Image
Figure 4 for Physically Compatible 3D Object Modeling from a Single Image
Viaarxiv icon

RapVerse: Coherent Vocals and Whole-Body Motions Generations from Text

Add code
May 30, 2024
Viaarxiv icon

SOK-Bench: A Situated Video Reasoning Benchmark with Aligned Open-World Knowledge

Add code
May 17, 2024
Figure 1 for SOK-Bench: A Situated Video Reasoning Benchmark with Aligned Open-World Knowledge
Figure 2 for SOK-Bench: A Situated Video Reasoning Benchmark with Aligned Open-World Knowledge
Figure 3 for SOK-Bench: A Situated Video Reasoning Benchmark with Aligned Open-World Knowledge
Figure 4 for SOK-Bench: A Situated Video Reasoning Benchmark with Aligned Open-World Knowledge
Viaarxiv icon

LLM and Simulation as Bilevel Optimizers: A New Paradigm to Advance Physical Scientific Discovery

Add code
May 16, 2024
Viaarxiv icon

STAR: A Benchmark for Situated Reasoning in Real-World Videos

Add code
May 15, 2024
Figure 1 for STAR: A Benchmark for Situated Reasoning in Real-World Videos
Figure 2 for STAR: A Benchmark for Situated Reasoning in Real-World Videos
Figure 3 for STAR: A Benchmark for Situated Reasoning in Real-World Videos
Figure 4 for STAR: A Benchmark for Situated Reasoning in Real-World Videos
Viaarxiv icon

QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving

Add code
May 07, 2024
Figure 1 for QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving
Figure 2 for QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving
Figure 3 for QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving
Figure 4 for QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving
Viaarxiv icon

RoboDreamer: Learning Compositional World Models for Robot Imagination

Add code
Apr 18, 2024
Viaarxiv icon