Alert button

"Image": models, code, and papers
Alert button

Toward Accurate Camera-based 3D Object Detection via Cascade Depth Estimation and Calibration

Feb 07, 2024
Chaoqun Wang, Yiran Qin, Zijian Kang, Ningning Ma, Ruimao Zhang

Viaarxiv icon

Robust Inverse Graphics via Probabilistic Inference

Feb 02, 2024
Tuan Anh Le, Pavel Sountsov, Matthew D. Hoffman, Ben Lee, Brian Patton, Rif A. Saurous

Viaarxiv icon

PMFSNet: Polarized Multi-scale Feature Self-attention Network For Lightweight Medical Image Segmentation

Add code
Bookmark button
Alert button
Jan 15, 2024
Jiahui Zhong, Wenhong Tian, Yuanlun Xie, Zhijia Liu, Jie Ou, Taoran Tian, Lei Zhang

Viaarxiv icon

Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for Text-to-Image Generation

Jan 11, 2024
Seung Hyun Lee, Yinxiao Li, Junjie Ke, Innfarn Yoo, Han Zhang, Jiahui Yu, Qifei Wang, Fei Deng, Glenn Entis, Junfeng He, Gang Li, Sangpil Kim, Irfan Essa, Feng Yang

Viaarxiv icon

MouSi: Poly-Visual-Expert Vision-Language Models

Jan 30, 2024
Xiaoran Fan, Tao Ji, Changhao Jiang, Shuo Li, Senjie Jin, Sirui Song, Junke Wang, Boyang Hong, Lu Chen, Guodong Zheng, Ming Zhang, Caishuang Huang, Rui Zheng, Zhiheng Xi, Yuhao Zhou, Shihan Dou, Junjie Ye, Hang Yan, Tao Gui, Qi Zhang, Xipeng Qiu, Xuanjing Huang, Zuxuan Wu, Yu-Gang Jiang

Viaarxiv icon

High-Fidelity Diffusion-based Image Editing

Jan 04, 2024
Chen Hou, Guoqiang Wei, Zhibo Chen

Viaarxiv icon

LATENTPATCH: A Non-Parametric Approach for Face Generation and Editing

Jan 30, 2024
Benjamin Samuth, Julien Rabin, David Tschumperlé, Frédéric Jurie

Viaarxiv icon

AEROBLADE: Training-Free Detection of Latent Diffusion Images Using Autoencoder Reconstruction Error

Jan 31, 2024
Jonas Ricker, Denis Lukovnikov, Asja Fischer

Viaarxiv icon

Pixel-Wise Color Constancy via Smoothness Techniques in Multi-Illuminant Scenes

Feb 05, 2024
Umut Cem Entok, Firas Laakom, Farhad Pakdaman, Moncef Gabbouj

Viaarxiv icon

Vision-Language Models Provide Promptable Representations for Reinforcement Learning

Feb 05, 2024
William Chen, Oier Mees, Aviral Kumar, Sergey Levine

Viaarxiv icon