Picture for Zhenyang Li

Zhenyang Li

Enhanced Velocity Field Modeling for Gaussian Video Reconstruction

Add code
Jul 31, 2025
Viaarxiv icon

Joint Flashback Adaptation for Forgetting-Resistant Instruction Tuning

Add code
May 21, 2025
Viaarxiv icon

Evaluating Model Robustness Using Adaptive Sparse L0 Regularization

Add code
Aug 28, 2024
Figure 1 for Evaluating Model Robustness Using Adaptive Sparse L0 Regularization
Figure 2 for Evaluating Model Robustness Using Adaptive Sparse L0 Regularization
Viaarxiv icon

Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning

Add code
Jul 09, 2024
Figure 1 for Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning
Figure 2 for Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning
Figure 3 for Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning
Figure 4 for Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning
Viaarxiv icon

Training-free CryoET Tomogram Segmentation

Add code
Jul 08, 2024
Figure 1 for Training-free CryoET Tomogram Segmentation
Figure 2 for Training-free CryoET Tomogram Segmentation
Figure 3 for Training-free CryoET Tomogram Segmentation
Figure 4 for Training-free CryoET Tomogram Segmentation
Viaarxiv icon

Do Vision-Language Transformers Exhibit Visual Commonsense? An Empirical Study of VCR

Add code
May 27, 2024
Figure 1 for Do Vision-Language Transformers Exhibit Visual Commonsense? An Empirical Study of VCR
Figure 2 for Do Vision-Language Transformers Exhibit Visual Commonsense? An Empirical Study of VCR
Figure 3 for Do Vision-Language Transformers Exhibit Visual Commonsense? An Empirical Study of VCR
Figure 4 for Do Vision-Language Transformers Exhibit Visual Commonsense? An Empirical Study of VCR
Viaarxiv icon

Point Resampling and Ray Transformation Aid to Editable NeRF Models

Add code
May 12, 2024
Figure 1 for Point Resampling and Ray Transformation Aid to Editable NeRF Models
Figure 2 for Point Resampling and Ray Transformation Aid to Editable NeRF Models
Figure 3 for Point Resampling and Ray Transformation Aid to Editable NeRF Models
Figure 4 for Point Resampling and Ray Transformation Aid to Editable NeRF Models
Viaarxiv icon

Attribute-driven Disentangled Representation Learning for Multimodal Recommendation

Add code
Dec 22, 2023
Viaarxiv icon

Unsupervised Anomaly Detection with Local-Sensitive VQVAE and Global-Sensitive Transformers

Add code
Mar 29, 2023
Figure 1 for Unsupervised Anomaly Detection with Local-Sensitive VQVAE and Global-Sensitive Transformers
Figure 2 for Unsupervised Anomaly Detection with Local-Sensitive VQVAE and Global-Sensitive Transformers
Figure 3 for Unsupervised Anomaly Detection with Local-Sensitive VQVAE and Global-Sensitive Transformers
Figure 4 for Unsupervised Anomaly Detection with Local-Sensitive VQVAE and Global-Sensitive Transformers
Viaarxiv icon

Learning to Agree on Vision Attention for Visual Commonsense Reasoning

Add code
Feb 19, 2023
Viaarxiv icon