Picture for Lizhuang Ma

Lizhuang Ma

DG-PIC: Domain Generalized Point-In-Context Learning for Point Cloud Understanding

Add code
Jul 11, 2024
Viaarxiv icon

Portrait3D: 3D Head Generation from Single In-the-wild Portrait Image

Add code
Jun 24, 2024
Viaarxiv icon

PIG: Prompt Images Guidance for Night-Time Scene Parsing

Add code
Jun 15, 2024
Figure 1 for PIG: Prompt Images Guidance for Night-Time Scene Parsing
Figure 2 for PIG: Prompt Images Guidance for Night-Time Scene Parsing
Figure 3 for PIG: Prompt Images Guidance for Night-Time Scene Parsing
Figure 4 for PIG: Prompt Images Guidance for Night-Time Scene Parsing
Viaarxiv icon

FastLGS: Speeding up Language Embedded Gaussians with Feature Grid Mapping

Add code
Jun 04, 2024
Viaarxiv icon

M3DM-NR: RGB-3D Noisy-Resistant Industrial Anomaly Detection via Multimodal Denoising

Add code
Jun 04, 2024
Figure 1 for M3DM-NR: RGB-3D Noisy-Resistant Industrial Anomaly Detection via Multimodal Denoising
Figure 2 for M3DM-NR: RGB-3D Noisy-Resistant Industrial Anomaly Detection via Multimodal Denoising
Figure 3 for M3DM-NR: RGB-3D Noisy-Resistant Industrial Anomaly Detection via Multimodal Denoising
Figure 4 for M3DM-NR: RGB-3D Noisy-Resistant Industrial Anomaly Detection via Multimodal Denoising
Viaarxiv icon

Adversarial Attacks on Both Face Recognition and Face Anti-spoofing Models

Add code
May 27, 2024
Figure 1 for Adversarial Attacks on Both Face Recognition and Face Anti-spoofing Models
Figure 2 for Adversarial Attacks on Both Face Recognition and Face Anti-spoofing Models
Figure 3 for Adversarial Attacks on Both Face Recognition and Face Anti-spoofing Models
Figure 4 for Adversarial Attacks on Both Face Recognition and Face Anti-spoofing Models
Viaarxiv icon

FreeMotion: A Unified Framework for Number-free Text-to-Motion Synthesis

Add code
May 24, 2024
Viaarxiv icon

GEOcc: Geometrically Enhanced 3D Occupancy Network with Implicit-Explicit Depth Fusion and Contextual Self-Supervision

Add code
May 17, 2024
Figure 1 for GEOcc: Geometrically Enhanced 3D Occupancy Network with Implicit-Explicit Depth Fusion and Contextual Self-Supervision
Figure 2 for GEOcc: Geometrically Enhanced 3D Occupancy Network with Implicit-Explicit Depth Fusion and Contextual Self-Supervision
Figure 3 for GEOcc: Geometrically Enhanced 3D Occupancy Network with Implicit-Explicit Depth Fusion and Contextual Self-Supervision
Figure 4 for GEOcc: Geometrically Enhanced 3D Occupancy Network with Implicit-Explicit Depth Fusion and Contextual Self-Supervision
Viaarxiv icon

Efficient Multimodal Large Language Models: A Survey

Add code
May 17, 2024
Figure 1 for Efficient Multimodal Large Language Models: A Survey
Figure 2 for Efficient Multimodal Large Language Models: A Survey
Figure 3 for Efficient Multimodal Large Language Models: A Survey
Figure 4 for Efficient Multimodal Large Language Models: A Survey
Viaarxiv icon

MotionMaster: Training-free Camera Motion Transfer For Video Generation

Add code
May 01, 2024
Figure 1 for MotionMaster: Training-free Camera Motion Transfer For Video Generation
Figure 2 for MotionMaster: Training-free Camera Motion Transfer For Video Generation
Figure 3 for MotionMaster: Training-free Camera Motion Transfer For Video Generation
Figure 4 for MotionMaster: Training-free Camera Motion Transfer For Video Generation
Viaarxiv icon