Picture for Jing Gu

Jing Gu

Self-Evolving 3D Scene Generation from a Single Image

Add code
Dec 09, 2025
Viaarxiv icon

WALT: Web Agents that Learn Tools

Add code
Oct 01, 2025
Viaarxiv icon

SCUBA: Salesforce Computer Use Benchmark

Add code
Sep 30, 2025
Figure 1 for SCUBA: Salesforce Computer Use Benchmark
Figure 2 for SCUBA: Salesforce Computer Use Benchmark
Figure 3 for SCUBA: Salesforce Computer Use Benchmark
Figure 4 for SCUBA: Salesforce Computer Use Benchmark
Viaarxiv icon

"PhyWorldBench": A Comprehensive Evaluation of Physical Realism in Text-to-Video Models

Add code
Jul 17, 2025
Viaarxiv icon

Struct2D: A Perception-Guided Framework for Spatial Reasoning in Large Multimodal Models

Add code
Jun 04, 2025
Viaarxiv icon

Constructing a 3D Town from a Single Image

Add code
May 21, 2025
Viaarxiv icon

Reconfigurable legged metamachines that run on autonomous modular legs

Add code
May 01, 2025
Figure 1 for Reconfigurable legged metamachines that run on autonomous modular legs
Figure 2 for Reconfigurable legged metamachines that run on autonomous modular legs
Figure 3 for Reconfigurable legged metamachines that run on autonomous modular legs
Figure 4 for Reconfigurable legged metamachines that run on autonomous modular legs
Viaarxiv icon

Klein Model for Hyperbolic Neural Networks

Add code
Oct 22, 2024
Viaarxiv icon

TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models

Add code
Oct 15, 2024
Figure 1 for TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models
Figure 2 for TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models
Figure 3 for TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models
Figure 4 for TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models
Viaarxiv icon

EditRoom: LLM-parameterized Graph Diffusion for Composable 3D Room Layout Editing

Add code
Oct 03, 2024
Figure 1 for EditRoom: LLM-parameterized Graph Diffusion for Composable 3D Room Layout Editing
Figure 2 for EditRoom: LLM-parameterized Graph Diffusion for Composable 3D Room Layout Editing
Figure 3 for EditRoom: LLM-parameterized Graph Diffusion for Composable 3D Room Layout Editing
Figure 4 for EditRoom: LLM-parameterized Graph Diffusion for Composable 3D Room Layout Editing
Viaarxiv icon