Picture for Zhenyang Li

Zhenyang Li

Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning

Add code
Jul 09, 2024
Viaarxiv icon

Training-free CryoET Tomogram Segmentation

Add code
Jul 08, 2024
Viaarxiv icon

Do Vision-Language Transformers Exhibit Visual Commonsense? An Empirical Study of VCR

Add code
May 27, 2024
Figure 1 for Do Vision-Language Transformers Exhibit Visual Commonsense? An Empirical Study of VCR
Figure 2 for Do Vision-Language Transformers Exhibit Visual Commonsense? An Empirical Study of VCR
Figure 3 for Do Vision-Language Transformers Exhibit Visual Commonsense? An Empirical Study of VCR
Figure 4 for Do Vision-Language Transformers Exhibit Visual Commonsense? An Empirical Study of VCR
Viaarxiv icon

Point Resampling and Ray Transformation Aid to Editable NeRF Models

Add code
May 12, 2024
Figure 1 for Point Resampling and Ray Transformation Aid to Editable NeRF Models
Figure 2 for Point Resampling and Ray Transformation Aid to Editable NeRF Models
Figure 3 for Point Resampling and Ray Transformation Aid to Editable NeRF Models
Figure 4 for Point Resampling and Ray Transformation Aid to Editable NeRF Models
Viaarxiv icon

Attribute-driven Disentangled Representation Learning for Multimodal Recommendation

Add code
Dec 22, 2023
Viaarxiv icon

Unsupervised Anomaly Detection with Local-Sensitive VQVAE and Global-Sensitive Transformers

Add code
Mar 29, 2023
Figure 1 for Unsupervised Anomaly Detection with Local-Sensitive VQVAE and Global-Sensitive Transformers
Figure 2 for Unsupervised Anomaly Detection with Local-Sensitive VQVAE and Global-Sensitive Transformers
Figure 3 for Unsupervised Anomaly Detection with Local-Sensitive VQVAE and Global-Sensitive Transformers
Figure 4 for Unsupervised Anomaly Detection with Local-Sensitive VQVAE and Global-Sensitive Transformers
Viaarxiv icon

Learning to Agree on Vision Attention for Visual Commonsense Reasoning

Add code
Feb 19, 2023
Figure 1 for Learning to Agree on Vision Attention for Visual Commonsense Reasoning
Figure 2 for Learning to Agree on Vision Attention for Visual Commonsense Reasoning
Figure 3 for Learning to Agree on Vision Attention for Visual Commonsense Reasoning
Figure 4 for Learning to Agree on Vision Attention for Visual Commonsense Reasoning
Viaarxiv icon

Alignment-guided Temporal Attention for Video Action Recognition

Add code
Sep 30, 2022
Figure 1 for Alignment-guided Temporal Attention for Video Action Recognition
Figure 2 for Alignment-guided Temporal Attention for Video Action Recognition
Figure 3 for Alignment-guided Temporal Attention for Video Action Recognition
Figure 4 for Alignment-guided Temporal Attention for Video Action Recognition
Viaarxiv icon

Factorized and Controllable Neural Re-Rendering of Outdoor Scene for Photo Extrapolation

Add code
Jul 14, 2022
Figure 1 for Factorized and Controllable Neural Re-Rendering of Outdoor Scene for Photo Extrapolation
Figure 2 for Factorized and Controllable Neural Re-Rendering of Outdoor Scene for Photo Extrapolation
Figure 3 for Factorized and Controllable Neural Re-Rendering of Outdoor Scene for Photo Extrapolation
Figure 4 for Factorized and Controllable Neural Re-Rendering of Outdoor Scene for Photo Extrapolation
Viaarxiv icon

Enhancing Multi-view Stereo with Contrastive Matching and Weighted Focal Loss

Add code
Jun 21, 2022
Figure 1 for Enhancing Multi-view Stereo with Contrastive Matching and Weighted Focal Loss
Figure 2 for Enhancing Multi-view Stereo with Contrastive Matching and Weighted Focal Loss
Figure 3 for Enhancing Multi-view Stereo with Contrastive Matching and Weighted Focal Loss
Figure 4 for Enhancing Multi-view Stereo with Contrastive Matching and Weighted Focal Loss
Viaarxiv icon