Alert button

"Image": models, code, and papers
Alert button

SLIQ: Quantum Image Similarity Networks on Noisy Quantum Computers

Add code
Bookmark button
Alert button
Sep 26, 2023
Daniel Silver, Tirthak Patel, Aditya Ranjan, Harshitta Gandhi, William Cutler, Devesh Tiwari

Viaarxiv icon

1st Place Solution of Egocentric 3D Hand Pose Estimation Challenge 2023 Technical Report:A Concise Pipeline for Egocentric Hand Pose Reconstruction

Oct 10, 2023
Zhishan Zhou, Zhi Lv, Shihao Zhou, Minqiang Zou, Tong Wu, Mochen Yu, Yao Tang, Jiajun Liang

Figure 1 for 1st Place Solution of Egocentric 3D Hand Pose Estimation Challenge 2023 Technical Report:A Concise Pipeline for Egocentric Hand Pose Reconstruction
Figure 2 for 1st Place Solution of Egocentric 3D Hand Pose Estimation Challenge 2023 Technical Report:A Concise Pipeline for Egocentric Hand Pose Reconstruction
Viaarxiv icon

Pixel State Value Network for Combined Prediction and Planning in Interactive Environments

Oct 11, 2023
Sascha Rosbach, Stefan M. Leupold, Simon Großjohann, Stefan Roth

Viaarxiv icon

Generalized Neural Sorting Networks with Error-Free Differentiable Swap Functions

Oct 11, 2023
Jungtaek Kim, Jeongbeen Yoon, Minsu Cho

Viaarxiv icon

Drivable Avatar Clothing: Faithful Full-Body Telepresence with Dynamic Clothing Driven by Sparse RGB-D Input

Add code
Bookmark button
Alert button
Oct 11, 2023
Donglai Xiang, Fabian Prada, Zhe Cao, Kaiwen Guo, Chenglei Wu, Jessica Hodgins, Timur Bagautdinov

Figure 1 for Drivable Avatar Clothing: Faithful Full-Body Telepresence with Dynamic Clothing Driven by Sparse RGB-D Input
Figure 2 for Drivable Avatar Clothing: Faithful Full-Body Telepresence with Dynamic Clothing Driven by Sparse RGB-D Input
Figure 3 for Drivable Avatar Clothing: Faithful Full-Body Telepresence with Dynamic Clothing Driven by Sparse RGB-D Input
Figure 4 for Drivable Avatar Clothing: Faithful Full-Body Telepresence with Dynamic Clothing Driven by Sparse RGB-D Input
Viaarxiv icon

Semantic Scene Difference Detection in Daily Life Patroling by Mobile Robots using Pre-Trained Large-Scale Vision-Language Model

Sep 28, 2023
Yoshiki Obinata, Kento Kawaharazuka, Naoaki Kanazawa, Naoya Yamaguchi, Naoto Tsukamoto, Iori Yanokura, Shingo Kitagawa, Koki Shinjo, Kei Okada, Masayuki Inaba

Viaarxiv icon

LOVECon: Text-driven Training-Free Long Video Editing with ControlNet

Add code
Bookmark button
Alert button
Oct 15, 2023
Zhenyi Liao, Zhijie Deng

Viaarxiv icon

FuseSR: Super Resolution for Real-time Rendering through Efficient Multi-resolution Fusion

Add code
Bookmark button
Alert button
Oct 15, 2023
Zhihua Zhong, Jingsen Zhu, Yuxin Dai, Chuankun Zheng, Yuchi Huo, Guanlin Chen, Hujun Bao, Rui Wang

Viaarxiv icon

Self-Supervised Neuron Segmentation with Multi-Agent Reinforcement Learning

Add code
Bookmark button
Alert button
Oct 06, 2023
Yinda Chen, Wei Huang, Shenglong Zhou, Qi Chen, Zhiwei Xiong

Figure 1 for Self-Supervised Neuron Segmentation with Multi-Agent Reinforcement Learning
Figure 2 for Self-Supervised Neuron Segmentation with Multi-Agent Reinforcement Learning
Figure 3 for Self-Supervised Neuron Segmentation with Multi-Agent Reinforcement Learning
Figure 4 for Self-Supervised Neuron Segmentation with Multi-Agent Reinforcement Learning
Viaarxiv icon

ClusVPR: Efficient Visual Place Recognition with Clustering-based Weighted Transformer

Oct 06, 2023
Yifan Xu, Pourya Shamsolmoali, Jie Yang

Figure 1 for ClusVPR: Efficient Visual Place Recognition with Clustering-based Weighted Transformer
Figure 2 for ClusVPR: Efficient Visual Place Recognition with Clustering-based Weighted Transformer
Figure 3 for ClusVPR: Efficient Visual Place Recognition with Clustering-based Weighted Transformer
Figure 4 for ClusVPR: Efficient Visual Place Recognition with Clustering-based Weighted Transformer
Viaarxiv icon