Alert button

"Image": models, code, and papers
Alert button

Low-Rank Rescaled Vision Transformer Fine-Tuning: A Residual Design Approach

Add code
Bookmark button
Alert button
Mar 28, 2024
Wei Dong, Xing Zhang, Bihui Chen, Dawei Yan, Zhijun Lin, Qingsen Yan, Peng Wang, Yang Yang

Viaarxiv icon

OAKINK2: A Dataset of Bimanual Hands-Object Manipulation in Complex Task Completion

Mar 28, 2024
Xinyu Zhan, Lixin Yang, Yifei Zhao, Kangrui Mao, Hanlin Xu, Zenan Lin, Kailin Li, Cewu Lu

Figure 1 for OAKINK2: A Dataset of Bimanual Hands-Object Manipulation in Complex Task Completion
Figure 2 for OAKINK2: A Dataset of Bimanual Hands-Object Manipulation in Complex Task Completion
Figure 3 for OAKINK2: A Dataset of Bimanual Hands-Object Manipulation in Complex Task Completion
Figure 4 for OAKINK2: A Dataset of Bimanual Hands-Object Manipulation in Complex Task Completion
Viaarxiv icon

BiTT: Bi-directional Texture Reconstruction of Interacting Two Hands from a Single Image

Add code
Bookmark button
Alert button
Mar 13, 2024
Minje Kim, Tae-Kyun Kim

Figure 1 for BiTT: Bi-directional Texture Reconstruction of Interacting Two Hands from a Single Image
Figure 2 for BiTT: Bi-directional Texture Reconstruction of Interacting Two Hands from a Single Image
Figure 3 for BiTT: Bi-directional Texture Reconstruction of Interacting Two Hands from a Single Image
Figure 4 for BiTT: Bi-directional Texture Reconstruction of Interacting Two Hands from a Single Image
Viaarxiv icon

Conditional Wasserstein Distances with Applications in Bayesian OT Flow Matching

Mar 27, 2024
Jannis Chemseddine, Paul Hagemann, Christian Wald, Gabriele Steidl

Viaarxiv icon

Visually Guided Generative Text-Layout Pre-training for Document Intelligence

Mar 27, 2024
Zhiming Mao, Haoli Bai, Lu Hou, Jiansheng Wei, Xin Jiang, Qun Liu, Kam-Fai Wong

Viaarxiv icon

Semi-Supervised Learning for Deep Causal Generative Models

Add code
Bookmark button
Alert button
Mar 27, 2024
Yasin Ibrahim, Hermione Warr, Konstantinos Kamnitsas

Viaarxiv icon

A Recommender System for NFT Collectibles with Item Feature

Mar 27, 2024
Minjoo Choi, Seonmi Kim, Yejin Kim, Youngbin Lee, Joohwan Hong, Yongjae Lee

Viaarxiv icon

Illicit object detection in X-ray images using Vision Transformers

Add code
Bookmark button
Alert button
Mar 27, 2024
Jorgen Cani, Ioannis Mademlis, Adamantia Anna Rebolledo Chrysochoou, Georgios Th. Papadopoulos

Viaarxiv icon

A Survey on Large Language Models from Concept to Implementation

Mar 27, 2024
Chen Wang, Jin Zhao, Jiaqi Gong

Viaarxiv icon

Enhancing Adversarial Training with Prior Knowledge Distillation for Robust Image Compression

Mar 11, 2024
Cao Zhi, Bao Youneng, Meng Fanyang, Li Chao, Tan Wen, Wang Genhong, Liang Yongsheng

Figure 1 for Enhancing Adversarial Training with Prior Knowledge Distillation for Robust Image Compression
Figure 2 for Enhancing Adversarial Training with Prior Knowledge Distillation for Robust Image Compression
Figure 3 for Enhancing Adversarial Training with Prior Knowledge Distillation for Robust Image Compression
Figure 4 for Enhancing Adversarial Training with Prior Knowledge Distillation for Robust Image Compression
Viaarxiv icon