Picture for Jing Gu

Jing Gu

Struct2D: A Perception-Guided Framework for Spatial Reasoning in Large Multimodal Models

Add code
Jun 04, 2025
Viaarxiv icon

Constructing a 3D Town from a Single Image

Add code
May 21, 2025
Viaarxiv icon

Reconfigurable legged metamachines that run on autonomous modular legs

Add code
May 01, 2025
Viaarxiv icon

Klein Model for Hyperbolic Neural Networks

Add code
Oct 22, 2024
Viaarxiv icon

TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models

Add code
Oct 15, 2024
Figure 1 for TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models
Figure 2 for TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models
Figure 3 for TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models
Figure 4 for TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models
Viaarxiv icon

EditRoom: LLM-parameterized Graph Diffusion for Composable 3D Room Layout Editing

Add code
Oct 03, 2024
Figure 1 for EditRoom: LLM-parameterized Graph Diffusion for Composable 3D Room Layout Editing
Figure 2 for EditRoom: LLM-parameterized Graph Diffusion for Composable 3D Room Layout Editing
Figure 3 for EditRoom: LLM-parameterized Graph Diffusion for Composable 3D Room Layout Editing
Figure 4 for EditRoom: LLM-parameterized Graph Diffusion for Composable 3D Room Layout Editing
Viaarxiv icon

LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing

Add code
Jun 25, 2024
Figure 1 for LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing
Figure 2 for LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing
Figure 3 for LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing
Figure 4 for LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing
Viaarxiv icon

VIA: A Spatiotemporal Video Adaptation Framework for Global and Local Video Editing

Add code
Jun 18, 2024
Figure 1 for VIA: A Spatiotemporal Video Adaptation Framework for Global and Local Video Editing
Figure 2 for VIA: A Spatiotemporal Video Adaptation Framework for Global and Local Video Editing
Figure 3 for VIA: A Spatiotemporal Video Adaptation Framework for Global and Local Video Editing
Figure 4 for VIA: A Spatiotemporal Video Adaptation Framework for Global and Local Video Editing
Viaarxiv icon

The AI Collaborator: Bridging Human-AI Interaction in Educational and Professional Settings

Add code
May 16, 2024
Viaarxiv icon

SwapAnything: Enabling Arbitrary Object Swapping in Personalized Visual Editing

Add code
Apr 08, 2024
Figure 1 for SwapAnything: Enabling Arbitrary Object Swapping in Personalized Visual Editing
Figure 2 for SwapAnything: Enabling Arbitrary Object Swapping in Personalized Visual Editing
Figure 3 for SwapAnything: Enabling Arbitrary Object Swapping in Personalized Visual Editing
Figure 4 for SwapAnything: Enabling Arbitrary Object Swapping in Personalized Visual Editing
Viaarxiv icon