Picture for Jing Gu

Jing Gu

"PhyWorldBench": A Comprehensive Evaluation of Physical Realism in Text-to-Video Models

Add code
Jul 17, 2025
Viaarxiv icon

Struct2D: A Perception-Guided Framework for Spatial Reasoning in Large Multimodal Models

Add code
Jun 04, 2025
Viaarxiv icon

Constructing a 3D Town from a Single Image

Add code
May 21, 2025
Viaarxiv icon

Reconfigurable legged metamachines that run on autonomous modular legs

Add code
May 01, 2025
Figure 1 for Reconfigurable legged metamachines that run on autonomous modular legs
Figure 2 for Reconfigurable legged metamachines that run on autonomous modular legs
Figure 3 for Reconfigurable legged metamachines that run on autonomous modular legs
Figure 4 for Reconfigurable legged metamachines that run on autonomous modular legs
Viaarxiv icon

Klein Model for Hyperbolic Neural Networks

Add code
Oct 22, 2024
Viaarxiv icon

TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models

Add code
Oct 15, 2024
Figure 1 for TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models
Figure 2 for TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models
Figure 3 for TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models
Figure 4 for TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models
Viaarxiv icon

EditRoom: LLM-parameterized Graph Diffusion for Composable 3D Room Layout Editing

Add code
Oct 03, 2024
Figure 1 for EditRoom: LLM-parameterized Graph Diffusion for Composable 3D Room Layout Editing
Figure 2 for EditRoom: LLM-parameterized Graph Diffusion for Composable 3D Room Layout Editing
Figure 3 for EditRoom: LLM-parameterized Graph Diffusion for Composable 3D Room Layout Editing
Figure 4 for EditRoom: LLM-parameterized Graph Diffusion for Composable 3D Room Layout Editing
Viaarxiv icon

LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing

Add code
Jun 25, 2024
Figure 1 for LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing
Figure 2 for LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing
Figure 3 for LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing
Figure 4 for LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing
Viaarxiv icon

VIA: A Spatiotemporal Video Adaptation Framework for Global and Local Video Editing

Add code
Jun 18, 2024
Figure 1 for VIA: A Spatiotemporal Video Adaptation Framework for Global and Local Video Editing
Figure 2 for VIA: A Spatiotemporal Video Adaptation Framework for Global and Local Video Editing
Figure 3 for VIA: A Spatiotemporal Video Adaptation Framework for Global and Local Video Editing
Figure 4 for VIA: A Spatiotemporal Video Adaptation Framework for Global and Local Video Editing
Viaarxiv icon

The AI Collaborator: Bridging Human-AI Interaction in Educational and Professional Settings

Add code
May 16, 2024
Figure 1 for The AI Collaborator: Bridging Human-AI Interaction in Educational and Professional Settings
Figure 2 for The AI Collaborator: Bridging Human-AI Interaction in Educational and Professional Settings
Viaarxiv icon