Alert button

"Image": models, code, and papers
Alert button

GEA: Reconstructing Expressive 3D Gaussian Avatar from Monocular Video

Add code
Bookmark button
Alert button
Feb 26, 2024
Xinqi Liu, Chenming Wu, Xing Liu, Jialun Liu, Jinbo Wu, Chen Zhao, Haocheng Feng, Errui Ding, Jingdong Wang

Viaarxiv icon

The All-Seeing Project V2: Towards General Relation Comprehension of the Open World

Add code
Bookmark button
Alert button
Feb 29, 2024
Weiyun Wang, Yiming Ren, Haowen Luo, Tiantong Li, Chenxiang Yan, Zhe Chen, Wenhai Wang, Qingyun Li, Lewei Lu, Xizhou Zhu, Yu Qiao, Jifeng Dai

Viaarxiv icon

Learning a Generalized Physical Face Model From Data

Feb 29, 2024
Lingchen Yang, Gaspard Zoss, Prashanth Chandran, Markus Gross, Barbara Solenthaler, Eftychios Sifakis, Derek Bradley

Viaarxiv icon

A Novel Approach to Industrial Defect Generation through Blended Latent Diffusion Model with Online Adaptation

Add code
Bookmark button
Alert button
Feb 29, 2024
Hanxi Li, Zhengxun Zhang, Hao Chen, Lin Wu, Bo Li, Deyin Liu, Mingwen Wang

Viaarxiv icon

VideoMAC: Video Masked Autoencoders Meet ConvNets

Feb 29, 2024
Gensheng Pei, Tao Chen, Xiruo Jiang, Huafeng Liu, Zeren Sun, Yazhou Yao

Viaarxiv icon

MOSAIC: A Modular System for Assistive and Interactive Cooking

Add code
Bookmark button
Alert button
Feb 29, 2024
Huaxiaoyue Wang, Kushal Kedia, Juntao Ren, Rahma Abdullah, Atiksh Bhardwaj, Angela Chao, Kelly Y Chen, Nathaniel Chin, Prithwish Dan, Xinyi Fan, Gonzalo Gonzalez-Pumariega, Aditya Kompella, Maximus Adrian Pace, Yash Sharma, Xiangwan Sun, Neha Sunkara, Sanjiban Choudhury

Viaarxiv icon

MIKO: Multimodal Intention Knowledge Distillation from Large Language Models for Social-Media Commonsense Discovery

Feb 29, 2024
Feihong Lu, Weiqi Wang, Yangyifei Luo, Ziqin Zhu, Qingyun Sun, Baixuan Xu, Haochen Shi, Shiqi Gao, Qian Li, Yangqiu Song, Jianxin Li

Viaarxiv icon

Fine-tuning CLIP Text Encoders with Two-step Paraphrasing

Feb 23, 2024
Hyunjae Kim, Seunghyun Yoon, Trung Bui, Handong Zhao, Quan Tran, Franck Dernoncourt, Jaewoo Kang

Viaarxiv icon

On normalization-equivariance properties of supervised and unsupervised denoising methods: a survey

Feb 23, 2024
Sébastien Herbreteau, Charles Kervrann

Viaarxiv icon

CommVQA: Situating Visual Question Answering in Communicative Contexts

Add code
Bookmark button
Alert button
Feb 22, 2024
Nandita Shankar Naik, Christopher Potts, Elisa Kreiss

Viaarxiv icon