Alert button

"Image": models, code, and papers
Alert button

Pedestrian Tracking with Monocular Camera using Unconstrained 3D Motion Model

Mar 18, 2024
Jan Krejčí, Oliver Kost, Ondřej Straka, Jindřich Duník

Viaarxiv icon

Differentially Private Representation Learning via Image Captioning

Mar 04, 2024
Tom Sander, Yaodong Yu, Maziar Sanjabi, Alain Durmus, Yi Ma, Kamalika Chaudhuri, Chuan Guo

Viaarxiv icon

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Mar 14, 2024
Brandon McKinzie, Zhe Gan, Jean-Philippe Fauconnier, Sam Dodge, Bowen Zhang, Philipp Dufter, Dhruti Shah, Xianzhi Du, Futang Peng, Floris Weers, Anton Belyi, Haotian Zhang, Karanjeet Singh, Doug Kang, Hongyu Hè, Max Schwarzer, Tom Gunter, Xiang Kong, Aonan Zhang, Jianyu Wang, Chong Wang, Nan Du, Tao Lei, Sam Wiseman, Mark Lee, Zirui Wang, Ruoming Pang, Peter Grasch, Alexander Toshev, Yinfei Yang

Viaarxiv icon

ProxNF: Neural Field Proximal Training for High-Resolution 4D Dynamic Image Reconstruction

Mar 06, 2024
Luke Lozenski, Refik Mert Cam, Mark A. Anastasio, Umberto Villa

Figure 1 for ProxNF: Neural Field Proximal Training for High-Resolution 4D Dynamic Image Reconstruction
Figure 2 for ProxNF: Neural Field Proximal Training for High-Resolution 4D Dynamic Image Reconstruction
Figure 3 for ProxNF: Neural Field Proximal Training for High-Resolution 4D Dynamic Image Reconstruction
Figure 4 for ProxNF: Neural Field Proximal Training for High-Resolution 4D Dynamic Image Reconstruction
Viaarxiv icon

SQ-LLaVA: Self-Questioning for Large Vision-Language Assistant

Mar 17, 2024
Guohao Sun, Can Qin, Jiamian Wang, Zeyuan Chen, Ran Xu, Zhiqiang Tao

Viaarxiv icon

Mitigating Data Consistency Induced Discrepancy in Cascaded Diffusion Models for Sparse-view CT Reconstruction

Mar 14, 2024
Hanyu Chen, Zhixiu Hao, Lin Guo, Liying Xiao

Viaarxiv icon

PrimeComposer: Faster Progressively Combined Diffusion for Image Composition with Attention Steering

Mar 08, 2024
Yibin Wang, Weizhong Zhang, Jianwei Zheng, Cheng Jin

Figure 1 for PrimeComposer: Faster Progressively Combined Diffusion for Image Composition with Attention Steering
Figure 2 for PrimeComposer: Faster Progressively Combined Diffusion for Image Composition with Attention Steering
Figure 3 for PrimeComposer: Faster Progressively Combined Diffusion for Image Composition with Attention Steering
Figure 4 for PrimeComposer: Faster Progressively Combined Diffusion for Image Composition with Attention Steering
Viaarxiv icon

A Simple Framework Uniting Visual In-context Learning with Masked Image Modeling to Improve Ultrasound Segmentation

Mar 08, 2024
Yuyue Zhou, Banafshe Felfeliyan, Shrimanti Ghosh, Jessica Knight, Fatima Alves-Pereira, Christopher Keen, Jessica Küpper, Abhilash Rakkunedeth Hareendranathan, Jacob L. Jaremko

Viaarxiv icon

Meta-Prompting for Automating Zero-shot Visual Recognition with LLMs

Mar 19, 2024
M. Jehanzeb Mirza, Leonid Karlinsky, Wei Lin, Sivan Doveh, Jakub Micorek, Mateusz Kozinski, Hilde Kuhene, Horst Possegger

Viaarxiv icon

Using evolutionary computation to optimize task performance of unclocked, recurrent Boolean circuits in FPGAs

Mar 19, 2024
Raphael Norman-Tenazas, David Kleinberg, Erik C. Johnson, Daniel P. Lathrop, Matthew J. Roos

Viaarxiv icon