Picture for Wenjun Zeng

Wenjun Zeng

Proactive Agents for Multi-Turn Text-to-Image Generation Under Uncertainty

Add code
Dec 09, 2024
Viaarxiv icon

UniScene: Unified Occupancy-centric Driving Scene Generation

Add code
Dec 06, 2024
Viaarxiv icon

MotionBank: A Large-scale Video Motion Benchmark with Disentangled Rule-based Annotations

Add code
Oct 17, 2024
Figure 1 for MotionBank: A Large-scale Video Motion Benchmark with Disentangled Rule-based Annotations
Figure 2 for MotionBank: A Large-scale Video Motion Benchmark with Disentangled Rule-based Annotations
Figure 3 for MotionBank: A Large-scale Video Motion Benchmark with Disentangled Rule-based Annotations
Figure 4 for MotionBank: A Large-scale Video Motion Benchmark with Disentangled Rule-based Annotations
Viaarxiv icon

Open-World Reinforcement Learning over Long Short-Term Imagination

Add code
Oct 04, 2024
Figure 1 for Open-World Reinforcement Learning over Long Short-Term Imagination
Figure 2 for Open-World Reinforcement Learning over Long Short-Term Imagination
Figure 3 for Open-World Reinforcement Learning over Long Short-Term Imagination
Figure 4 for Open-World Reinforcement Learning over Long Short-Term Imagination
Viaarxiv icon

Scene Graph Disentanglement and Composition for Generalizable Complex Image Generation

Add code
Oct 01, 2024
Figure 1 for Scene Graph Disentanglement and Composition for Generalizable Complex Image Generation
Figure 2 for Scene Graph Disentanglement and Composition for Generalizable Complex Image Generation
Figure 3 for Scene Graph Disentanglement and Composition for Generalizable Complex Image Generation
Figure 4 for Scene Graph Disentanglement and Composition for Generalizable Complex Image Generation
Viaarxiv icon

Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs

Add code
Aug 16, 2024
Figure 1 for Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs
Figure 2 for Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs
Figure 3 for Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs
Figure 4 for Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs
Viaarxiv icon

ShieldGemma: Generative AI Content Moderation Based on Gemma

Add code
Jul 31, 2024
Figure 1 for ShieldGemma: Generative AI Content Moderation Based on Gemma
Figure 2 for ShieldGemma: Generative AI Content Moderation Based on Gemma
Figure 3 for ShieldGemma: Generative AI Content Moderation Based on Gemma
Figure 4 for ShieldGemma: Generative AI Content Moderation Based on Gemma
Viaarxiv icon

Graph-based Unsupervised Disentangled Representation Learning via Multimodal Large Language Models

Add code
Jul 26, 2024
Figure 1 for Graph-based Unsupervised Disentangled Representation Learning via Multimodal Large Language Models
Figure 2 for Graph-based Unsupervised Disentangled Representation Learning via Multimodal Large Language Models
Figure 3 for Graph-based Unsupervised Disentangled Representation Learning via Multimodal Large Language Models
Figure 4 for Graph-based Unsupervised Disentangled Representation Learning via Multimodal Large Language Models
Viaarxiv icon

HIMO: A New Benchmark for Full-Body Human Interacting with Multiple Objects

Add code
Jul 17, 2024
Figure 1 for HIMO: A New Benchmark for Full-Body Human Interacting with Multiple Objects
Figure 2 for HIMO: A New Benchmark for Full-Body Human Interacting with Multiple Objects
Figure 3 for HIMO: A New Benchmark for Full-Body Human Interacting with Multiple Objects
Figure 4 for HIMO: A New Benchmark for Full-Body Human Interacting with Multiple Objects
Viaarxiv icon

Rate-Distortion-Cognition Controllable Versatile Neural Image Compression

Add code
Jul 16, 2024
Viaarxiv icon