Picture for Chengjie Wang

Chengjie Wang

Decision Boundary-aware Knowledge Consolidation Generates Better Instance-Incremental Learner

Add code
Jun 05, 2024
Viaarxiv icon

M3DM-NR: RGB-3D Noisy-Resistant Industrial Anomaly Detection via Multimodal Denoising

Add code
Jun 04, 2024
Viaarxiv icon

NoiseBoost: Alleviating Hallucination with Noise Perturbation for Multimodal Large Language Models

Add code
May 31, 2024
Figure 1 for NoiseBoost: Alleviating Hallucination with Noise Perturbation for Multimodal Large Language Models
Figure 2 for NoiseBoost: Alleviating Hallucination with Noise Perturbation for Multimodal Large Language Models
Figure 3 for NoiseBoost: Alleviating Hallucination with Noise Perturbation for Multimodal Large Language Models
Figure 4 for NoiseBoost: Alleviating Hallucination with Noise Perturbation for Multimodal Large Language Models
Viaarxiv icon

VividPose: Advancing Stable Video Diffusion for Realistic Human Image Animation

Add code
May 28, 2024
Figure 1 for VividPose: Advancing Stable Video Diffusion for Realistic Human Image Animation
Figure 2 for VividPose: Advancing Stable Video Diffusion for Realistic Human Image Animation
Figure 3 for VividPose: Advancing Stable Video Diffusion for Realistic Human Image Animation
Figure 4 for VividPose: Advancing Stable Video Diffusion for Realistic Human Image Animation
Viaarxiv icon

AdapNet: Adaptive Noise-Based Network for Low-Quality Image Retrieval

Add code
May 28, 2024
Figure 1 for AdapNet: Adaptive Noise-Based Network for Low-Quality Image Retrieval
Figure 2 for AdapNet: Adaptive Noise-Based Network for Low-Quality Image Retrieval
Figure 3 for AdapNet: Adaptive Noise-Based Network for Low-Quality Image Retrieval
Figure 4 for AdapNet: Adaptive Noise-Based Network for Low-Quality Image Retrieval
Viaarxiv icon

FreeMotion: A Unified Framework for Number-free Text-to-Motion Synthesis

Add code
May 24, 2024
Figure 1 for FreeMotion: A Unified Framework for Number-free Text-to-Motion Synthesis
Figure 2 for FreeMotion: A Unified Framework for Number-free Text-to-Motion Synthesis
Figure 3 for FreeMotion: A Unified Framework for Number-free Text-to-Motion Synthesis
Figure 4 for FreeMotion: A Unified Framework for Number-free Text-to-Motion Synthesis
Viaarxiv icon

StyleMaster: Towards Flexible Stylized Image Generation with Diffusion Models

Add code
May 24, 2024
Figure 1 for StyleMaster: Towards Flexible Stylized Image Generation with Diffusion Models
Figure 2 for StyleMaster: Towards Flexible Stylized Image Generation with Diffusion Models
Figure 3 for StyleMaster: Towards Flexible Stylized Image Generation with Diffusion Models
Figure 4 for StyleMaster: Towards Flexible Stylized Image Generation with Diffusion Models
Viaarxiv icon

PointRWKV: Efficient RWKV-Like Model for Hierarchical Point Cloud Learning

Add code
May 24, 2024
Viaarxiv icon

Face Adapter for Pre-Trained Diffusion Models with Fine-Grained ID and Attribute Control

Add code
May 21, 2024
Figure 1 for Face Adapter for Pre-Trained Diffusion Models with Fine-Grained ID and Attribute Control
Figure 2 for Face Adapter for Pre-Trained Diffusion Models with Fine-Grained ID and Attribute Control
Figure 3 for Face Adapter for Pre-Trained Diffusion Models with Fine-Grained ID and Attribute Control
Figure 4 for Face Adapter for Pre-Trained Diffusion Models with Fine-Grained ID and Attribute Control
Viaarxiv icon

Efficient Multimodal Large Language Models: A Survey

Add code
May 17, 2024
Figure 1 for Efficient Multimodal Large Language Models: A Survey
Figure 2 for Efficient Multimodal Large Language Models: A Survey
Figure 3 for Efficient Multimodal Large Language Models: A Survey
Figure 4 for Efficient Multimodal Large Language Models: A Survey
Viaarxiv icon