Alert button

"Image": models, code, and papers
Alert button

OHTA: One-shot Hand Avatar via Data-driven Implicit Priors

Add code
Bookmark button
Alert button
Feb 29, 2024
Xiaozheng Zheng, Chao Wen, Zhuo Su, Zeran Xu, Zhaohu Li, Yang Zhao, Zhou Xue

Viaarxiv icon

NRDF: Neural Riemannian Distance Fields for Learning Articulated Pose Priors

Mar 05, 2024
Yannan He, Garvita Tiwari, Tolga Birdal, Jan Eric Lenssen, Gerard Pons-Moll

Figure 1 for NRDF: Neural Riemannian Distance Fields for Learning Articulated Pose Priors
Figure 2 for NRDF: Neural Riemannian Distance Fields for Learning Articulated Pose Priors
Figure 3 for NRDF: Neural Riemannian Distance Fields for Learning Articulated Pose Priors
Figure 4 for NRDF: Neural Riemannian Distance Fields for Learning Articulated Pose Priors
Viaarxiv icon

Feast Your Eyes: Mixture-of-Resolution Adaptation for Multimodal Large Language Models

Add code
Bookmark button
Alert button
Mar 05, 2024
Gen Luo, Yiyi Zhou, Yuxin Zhang, Xiawu Zheng, Xiaoshuai Sun, Rongrong Ji

Figure 1 for Feast Your Eyes: Mixture-of-Resolution Adaptation for Multimodal Large Language Models
Figure 2 for Feast Your Eyes: Mixture-of-Resolution Adaptation for Multimodal Large Language Models
Figure 3 for Feast Your Eyes: Mixture-of-Resolution Adaptation for Multimodal Large Language Models
Figure 4 for Feast Your Eyes: Mixture-of-Resolution Adaptation for Multimodal Large Language Models
Viaarxiv icon

Matrix Completion with Convex Optimization and Column Subset Selection

Add code
Bookmark button
Alert button
Mar 05, 2024
Antonina Krajewska, Ewa Niewiadomska-Szynkiewicz

Figure 1 for Matrix Completion with Convex Optimization and Column Subset Selection
Figure 2 for Matrix Completion with Convex Optimization and Column Subset Selection
Figure 3 for Matrix Completion with Convex Optimization and Column Subset Selection
Figure 4 for Matrix Completion with Convex Optimization and Column Subset Selection
Viaarxiv icon

Behavior Generation with Latent Actions

Add code
Bookmark button
Alert button
Mar 05, 2024
Seungjae Lee, Yibin Wang, Haritheja Etukuru, H. Jin Kim, Nur Muhammad Mahi Shafiullah, Lerrel Pinto

Figure 1 for Behavior Generation with Latent Actions
Figure 2 for Behavior Generation with Latent Actions
Figure 3 for Behavior Generation with Latent Actions
Figure 4 for Behavior Generation with Latent Actions
Viaarxiv icon

Enhancing Generalization in Medical Visual Question Answering Tasks via Gradient-Guided Model Perturbation

Mar 05, 2024
Gang Liu, Hongyang Li, Zerui He, Shenjun Zhong

Figure 1 for Enhancing Generalization in Medical Visual Question Answering Tasks via Gradient-Guided Model Perturbation
Figure 2 for Enhancing Generalization in Medical Visual Question Answering Tasks via Gradient-Guided Model Perturbation
Figure 3 for Enhancing Generalization in Medical Visual Question Answering Tasks via Gradient-Guided Model Perturbation
Figure 4 for Enhancing Generalization in Medical Visual Question Answering Tasks via Gradient-Guided Model Perturbation
Viaarxiv icon

Domain-Agnostic Mutual Prompting for Unsupervised Domain Adaptation

Add code
Bookmark button
Alert button
Mar 05, 2024
Zhekai Du, Xinyao Li, Fengling Li, Ke Lu, Lei Zhu, Jingjing Li

Figure 1 for Domain-Agnostic Mutual Prompting for Unsupervised Domain Adaptation
Figure 2 for Domain-Agnostic Mutual Prompting for Unsupervised Domain Adaptation
Figure 3 for Domain-Agnostic Mutual Prompting for Unsupervised Domain Adaptation
Figure 4 for Domain-Agnostic Mutual Prompting for Unsupervised Domain Adaptation
Viaarxiv icon

Learning semantic image quality for fetal ultrasound from noisy ranking annotation

Feb 13, 2024
Manxi Lin, Jakob Ambsdorf, Emilie Pi Fogtmann Sejer, Zahra Bashir, Chun Kit Wong, Paraskevas Pegios, Alberto Raheli, Morten Bo Søndergaard Svendsen, Mads Nielsen, Martin Grønnebæk Tolsgaard, Anders Nymark Christensen, Aasa Feragen

Viaarxiv icon

VisionLLaMA: A Unified LLaMA Interface for Vision Tasks

Add code
Bookmark button
Alert button
Mar 01, 2024
Xiangxiang Chu, Jianlin Su, Bo Zhang, Chunhua Shen

Figure 1 for VisionLLaMA: A Unified LLaMA Interface for Vision Tasks
Figure 2 for VisionLLaMA: A Unified LLaMA Interface for Vision Tasks
Figure 3 for VisionLLaMA: A Unified LLaMA Interface for Vision Tasks
Figure 4 for VisionLLaMA: A Unified LLaMA Interface for Vision Tasks
Viaarxiv icon

TAMM: TriAdapter Multi-Modal Learning for 3D Shape Understanding

Add code
Bookmark button
Alert button
Feb 28, 2024
Zhihao Zhang, Shengcao Cao, Yu-Xiong Wang

Viaarxiv icon