Picture for Jun Zhu

Jun Zhu

Tsinghua University

LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models

Add code
Nov 14, 2024
Figure 1 for LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models
Figure 2 for LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models
Figure 3 for LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models
Figure 4 for LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models
Viaarxiv icon

MRJ-Agent: An Effective Jailbreak Agent for Multi-Round Dialogue

Add code
Nov 06, 2024
Figure 1 for MRJ-Agent: An Effective Jailbreak Agent for Multi-Round Dialogue
Figure 2 for MRJ-Agent: An Effective Jailbreak Agent for Multi-Round Dialogue
Figure 3 for MRJ-Agent: An Effective Jailbreak Agent for Multi-Round Dialogue
Figure 4 for MRJ-Agent: An Effective Jailbreak Agent for Multi-Round Dialogue
Viaarxiv icon

ManiBox: Enhancing Spatial Grasping Generalization via Scalable Simulation Data Generation

Add code
Nov 04, 2024
Figure 1 for ManiBox: Enhancing Spatial Grasping Generalization via Scalable Simulation Data Generation
Figure 2 for ManiBox: Enhancing Spatial Grasping Generalization via Scalable Simulation Data Generation
Figure 3 for ManiBox: Enhancing Spatial Grasping Generalization via Scalable Simulation Data Generation
Figure 4 for ManiBox: Enhancing Spatial Grasping Generalization via Scalable Simulation Data Generation
Viaarxiv icon

Consistency Diffusion Bridge Models

Add code
Oct 31, 2024
Figure 1 for Consistency Diffusion Bridge Models
Figure 2 for Consistency Diffusion Bridge Models
Figure 3 for Consistency Diffusion Bridge Models
Figure 4 for Consistency Diffusion Bridge Models
Viaarxiv icon

Localization, balance and affinity: a stronger multifaceted collaborative salient object detector in remote sensing images

Add code
Oct 31, 2024
Figure 1 for Localization, balance and affinity: a stronger multifaceted collaborative salient object detector in remote sensing images
Figure 2 for Localization, balance and affinity: a stronger multifaceted collaborative salient object detector in remote sensing images
Figure 3 for Localization, balance and affinity: a stronger multifaceted collaborative salient object detector in remote sensing images
Figure 4 for Localization, balance and affinity: a stronger multifaceted collaborative salient object detector in remote sensing images
Viaarxiv icon

Decentralized Hybrid Precoding for Massive MU-MIMO ISAC

Add code
Oct 21, 2024
Figure 1 for Decentralized Hybrid Precoding for Massive MU-MIMO ISAC
Figure 2 for Decentralized Hybrid Precoding for Massive MU-MIMO ISAC
Figure 3 for Decentralized Hybrid Precoding for Massive MU-MIMO ISAC
Figure 4 for Decentralized Hybrid Precoding for Massive MU-MIMO ISAC
Viaarxiv icon

FrameBridge: Improving Image-to-Video Generation with Bridge Models

Add code
Oct 20, 2024
Figure 1 for FrameBridge: Improving Image-to-Video Generation with Bridge Models
Figure 2 for FrameBridge: Improving Image-to-Video Generation with Bridge Models
Figure 3 for FrameBridge: Improving Image-to-Video Generation with Bridge Models
Figure 4 for FrameBridge: Improving Image-to-Video Generation with Bridge Models
Viaarxiv icon

Dynamic Open-Vocabulary 3D Scene Graphs for Long-term Language-Guided Mobile Manipulation

Add code
Oct 17, 2024
Figure 1 for Dynamic Open-Vocabulary 3D Scene Graphs for Long-term Language-Guided Mobile Manipulation
Figure 2 for Dynamic Open-Vocabulary 3D Scene Graphs for Long-term Language-Guided Mobile Manipulation
Figure 3 for Dynamic Open-Vocabulary 3D Scene Graphs for Long-term Language-Guided Mobile Manipulation
Figure 4 for Dynamic Open-Vocabulary 3D Scene Graphs for Long-term Language-Guided Mobile Manipulation
Viaarxiv icon

Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment

Add code
Oct 12, 2024
Figure 1 for Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment
Figure 2 for Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment
Figure 3 for Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment
Figure 4 for Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment
Viaarxiv icon

RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation

Add code
Oct 10, 2024
Figure 1 for RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
Figure 2 for RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
Figure 3 for RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
Figure 4 for RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
Viaarxiv icon