Alert button

"Image": models, code, and papers
Alert button

Cross-view and Cross-pose Completion for 3D Human Understanding

Nov 15, 2023
Matthieu Armando, Salma Galaaoui, Fabien Baradel, Thomas Lucas, Vincent Leroy, Romain Brégier, Philippe Weinzaepfel, Grégory Rogez

Viaarxiv icon

USLR: an open-source tool for unbiased and smooth longitudinal registration of brain MR

Nov 14, 2023
Adrià Casamitjana, Roser Sala-Llonch, Karim Lekadir, Juan Eugenio Iglesias

Viaarxiv icon

Diffusion-based generation of Histopathological Whole Slide Images at a Gigapixel scale

Nov 14, 2023
Robert Harb, Thomas Pock, Heimo Müller

Viaarxiv icon

Volcano: Mitigating Multimodal Hallucination through Self-Feedback Guided Revision

Nov 14, 2023
Seongyun Lee, Sue Hyun Park, Yongrae Jo, Minjoon Seo

Viaarxiv icon

Diffusion Model Alignment Using Direct Preference Optimization

Nov 21, 2023
Bram Wallace, Meihua Dang, Rafael Rafailov, Linqi Zhou, Aaron Lou, Senthil Purushwalkam, Stefano Ermon, Caiming Xiong, Shafiq Joty, Nikhil Naik

Viaarxiv icon

SOccDPT: Semi-Supervised 3D Semantic Occupancy from Dense Prediction Transformers trained under memory constraints

Nov 19, 2023
Aditya Nalgunda Ganesh

Figure 1 for SOccDPT: Semi-Supervised 3D Semantic Occupancy from Dense Prediction Transformers trained under memory constraints
Figure 2 for SOccDPT: Semi-Supervised 3D Semantic Occupancy from Dense Prediction Transformers trained under memory constraints
Figure 3 for SOccDPT: Semi-Supervised 3D Semantic Occupancy from Dense Prediction Transformers trained under memory constraints
Figure 4 for SOccDPT: Semi-Supervised 3D Semantic Occupancy from Dense Prediction Transformers trained under memory constraints
Viaarxiv icon

Tiny-VBF: Resource-Efficient Vision Transformer based Lightweight Beamformer for Ultrasound Single-Angle Plane Wave Imaging

Nov 20, 2023
Abdul Rahoof, Vivek Chaturvedi, Mahesh Raveendranatha Panicker, Muhammad Shafique

Viaarxiv icon

GP-NeRF: Generalized Perception NeRF for Context-Aware 3D Scene Understanding

Nov 20, 2023
Hao Li, Dingwen Zhang, Yalun Dai, Nian Liu, Lechao Cheng, Jingfeng Li, Jingdong Wang, Junwei Han

Figure 1 for GP-NeRF: Generalized Perception NeRF for Context-Aware 3D Scene Understanding
Figure 2 for GP-NeRF: Generalized Perception NeRF for Context-Aware 3D Scene Understanding
Figure 3 for GP-NeRF: Generalized Perception NeRF for Context-Aware 3D Scene Understanding
Figure 4 for GP-NeRF: Generalized Perception NeRF for Context-Aware 3D Scene Understanding
Viaarxiv icon

Language Agents for Detecting Implicit Stereotypes in Text-to-image Models at Scale

Oct 18, 2023
Qichao Wang, Tian Bian, Yian Yin, Tingyang Xu, Hong Cheng, Helen M. Meng, Zibin Zheng, Liang Chen, Bingzhe Wu

Figure 1 for Language Agents for Detecting Implicit Stereotypes in Text-to-image Models at Scale
Figure 2 for Language Agents for Detecting Implicit Stereotypes in Text-to-image Models at Scale
Figure 3 for Language Agents for Detecting Implicit Stereotypes in Text-to-image Models at Scale
Figure 4 for Language Agents for Detecting Implicit Stereotypes in Text-to-image Models at Scale
Viaarxiv icon

Spatial Audio and Individualized HRTFs using a Convolutional Neural Network (CNN)

Nov 22, 2023
Ludovic Pirard

Figure 1 for Spatial Audio and Individualized HRTFs using a Convolutional Neural Network (CNN)
Figure 2 for Spatial Audio and Individualized HRTFs using a Convolutional Neural Network (CNN)
Figure 3 for Spatial Audio and Individualized HRTFs using a Convolutional Neural Network (CNN)
Figure 4 for Spatial Audio and Individualized HRTFs using a Convolutional Neural Network (CNN)
Viaarxiv icon