Picture for Zhen Li

Zhen Li

LMO, CELESTE, HEC Paris

Empowering Large Language Models with 3D Situation Awareness

Add code
Mar 29, 2025
Viaarxiv icon

LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis

Add code
Mar 27, 2025
Viaarxiv icon

Lumina-Image 2.0: A Unified and Efficient Image Generative Framework

Add code
Mar 27, 2025
Figure 1 for Lumina-Image 2.0: A Unified and Efficient Image Generative Framework
Figure 2 for Lumina-Image 2.0: A Unified and Efficient Image Generative Framework
Figure 3 for Lumina-Image 2.0: A Unified and Efficient Image Generative Framework
Figure 4 for Lumina-Image 2.0: A Unified and Efficient Image Generative Framework
Viaarxiv icon

AR-1-to-3: Single Image to Consistent 3D Object Generation via Next-View Prediction

Add code
Mar 17, 2025
Viaarxiv icon

DriveGEN: Generalized and Robust 3D Detection in Driving via Controllable Text-to-Image Diffusion Generation

Add code
Mar 14, 2025
Figure 1 for DriveGEN: Generalized and Robust 3D Detection in Driving via Controllable Text-to-Image Diffusion Generation
Figure 2 for DriveGEN: Generalized and Robust 3D Detection in Driving via Controllable Text-to-Image Diffusion Generation
Figure 3 for DriveGEN: Generalized and Robust 3D Detection in Driving via Controllable Text-to-Image Diffusion Generation
Figure 4 for DriveGEN: Generalized and Robust 3D Detection in Driving via Controllable Text-to-Image Diffusion Generation
Viaarxiv icon

PiSA: A Self-Augmented Data Engine and Training Strategy for 3D Understanding with Large Models

Add code
Mar 13, 2025
Figure 1 for PiSA: A Self-Augmented Data Engine and Training Strategy for 3D Understanding with Large Models
Figure 2 for PiSA: A Self-Augmented Data Engine and Training Strategy for 3D Understanding with Large Models
Figure 3 for PiSA: A Self-Augmented Data Engine and Training Strategy for 3D Understanding with Large Models
Figure 4 for PiSA: A Self-Augmented Data Engine and Training Strategy for 3D Understanding with Large Models
Viaarxiv icon

DSPNet: Dual-vision Scene Perception for Robust 3D Question Answering

Add code
Mar 06, 2025
Viaarxiv icon

Cite Before You Speak: Enhancing Context-Response Grounding in E-commerce Conversational LLM-Agents

Add code
Mar 05, 2025
Viaarxiv icon

A General Framework to Enhance Fine-tuning-based LLM Unlearning

Add code
Feb 25, 2025
Figure 1 for A General Framework to Enhance Fine-tuning-based LLM Unlearning
Figure 2 for A General Framework to Enhance Fine-tuning-based LLM Unlearning
Figure 3 for A General Framework to Enhance Fine-tuning-based LLM Unlearning
Figure 4 for A General Framework to Enhance Fine-tuning-based LLM Unlearning
Viaarxiv icon

K-LoRA: Unlocking Training-Free Fusion of Any Subject and Style LoRAs

Add code
Feb 25, 2025
Viaarxiv icon