Alert button

"Image": models, code, and papers
Alert button

Weakly Supervised Gaussian Contrastive Grounding with Large Multimodal Models for Video Question Answering

Jan 28, 2024
Haibo Wang, Chenghang Lai, Yixuan Sun, Weifeng Ge

Viaarxiv icon

Lumiere: A Space-Time Diffusion Model for Video Generation

Add code
Bookmark button
Alert button
Jan 23, 2024
Omer Bar-Tal, Hila Chefer, Omer Tov, Charles Herrmann, Roni Paiss, Shiran Zada, Ariel Ephrat, Junhwa Hur, Yuanzhen Li, Tomer Michaeli, Oliver Wang, Deqing Sun, Tali Dekel, Inbar Mosseri

Viaarxiv icon

Few and Fewer: Learning Better from Few Examples Using Fewer Base Classes

Add code
Bookmark button
Alert button
Jan 29, 2024
Raphael Lafargue, Yassir Bendou, Bastien Pasdeloup, Jean-Philippe Diguet, Ian Reid, Vincent Gripon, Jack Valmadre

Viaarxiv icon

Democratizing the Creation of Animatable Facial Avatars

Jan 29, 2024
Yilin Zhu, Dalton Omens, Haodi He, Ron Fedkiw

Viaarxiv icon

Single-View 3D Human Digitalization with Large Reconstruction Models

Jan 22, 2024
Zhenzhen Weng, Jingyuan Liu, Hao Tan, Zhan Xu, Yang Zhou, Serena Yeung-Levy, Jimei Yang

Viaarxiv icon

Integrating 3D Slicer with a Dynamic Simulator for Situational Aware Robotic Interventions

Jan 22, 2024
Manish Sahu, Hisashi Ishida, Laura Connolly, Hongyi Fan, Anton Deguet, Peter Kazanzides, Francis X. Creighton, Russell H. Taylor, Adnan Munawar

Viaarxiv icon

CLASS-M: Adaptive stain separation-based contrastive learning with pseudo-labeling for histopathological image classification

Jan 04, 2024
Bodong Zhang, Hamid Manoochehri, Man Minh Ho, Fahimeh Fooladgar, Yosep Chong, Beatrice S. Knudsen, Deepika Sirohi, Tolga Tasdizen

Viaarxiv icon

Beyond the Surface: A Global-Scale Analysis of Visual Stereotypes in Text-to-Image Generation

Jan 12, 2024
Akshita Jha, Vinodkumar Prabhakaran, Remi Denton, Sarah Laszlo, Shachi Dave, Rida Qadri, Chandan K. Reddy, Sunipa Dev

Viaarxiv icon

TriSAM: Tri-Plane SAM for zero-shot cortical blood vessel segmentation in VEM images

Jan 25, 2024
Jia Wan, Wanhua Li, Atmadeep Banerjee, Jason Ken Adhinarta, Evelina Sjostedt, Jingpeng Wu, Jeff Lichtman, Hanspeter Pfister, Donglai Wei

Viaarxiv icon

MapChange: Enhancing Semantic Change Detection with Temporal-Invariant Historical Maps Based on Deep Triplet Network

Jan 21, 2024
Yinhe Liu, Sunan Shi, Zhuo Zheng, Jue Wang, Shiqi Tian, Yanfei Zhong

Viaarxiv icon