Alert button

"Image": models, code, and papers
Alert button

Mining Fine-Grained Image-Text Alignment for Zero-Shot Captioning via Text-Only Training

Add code
Bookmark button
Alert button
Jan 04, 2024
Longtian Qiu, Shan Ning, Xuming He

Viaarxiv icon

LLMRA: Multi-modal Large Language Model based Restoration Assistant

Jan 21, 2024
Xiaoyu Jin, Yuan Shi, Bin Xia, Wenming Yang

Viaarxiv icon

GEM: Boost Simple Network for Glass Surface Segmentation via Segment Anything Model and Data Synthesis

Add code
Bookmark button
Alert button
Jan 27, 2024
Jing Hao, Moyun Liu, Kuo Feng Hung

Viaarxiv icon

SonicVisionLM: Playing Sound with Vision Language Models

Add code
Bookmark button
Alert button
Jan 27, 2024
Zhifeng Xie, Shengye Yu, Qile He, Mengtian Li

Viaarxiv icon

Towards Model Predictive Control for Acrobatic Quadrotor Flights

Jan 30, 2024
Saransh Jain, Yash Shethwala, Jnaneshwar Das

Viaarxiv icon

Media2Face: Co-speech Facial Animation Generation With Multi-Modality Guidance

Jan 30, 2024
Qingcheng Zhao, Pengyu Long, Qixuan Zhang, Dafei Qin, Han Liang, Longwen Zhang, Yingliang Zhang, Jingyi Yu, Lan Xu

Viaarxiv icon

Static and Dynamic Synthesis of Bengali and Devanagari Signatures

Jan 30, 2024
Miguel A. Ferrer, Sukalpa Chanda, Moises Diaz, Chayan Kr. Banerjee, Anirban Majumdar, Cristina Carmona-Duarte, Parikshit Acharya, Umapada Pal

Viaarxiv icon

Improving Image Restoration through Removing Degradations in Textual Representations

Add code
Bookmark button
Alert button
Dec 28, 2023
Jingbo Lin, Zhilu Zhang, Yuxiang Wei, Dongwei Ren, Dongsheng Jiang, Wangmeng Zuo

Viaarxiv icon

SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning

Add code
Bookmark button
Alert button
Feb 01, 2024
Jianlan Luo, Zheyuan Hu, Charles Xu, You Liang Tan, Jacob Berg, Archit Sharma, Stefan Schaal, Chelsea Finn, Abhishek Gupta, Sergey Levine

Viaarxiv icon

MACE CT Reconstruction for Modular Material Decomposition from Energy Resolving Photon-Counting Data

Feb 01, 2024
Natalie M. Jadue, Madhuri Nagare, Jonathan S. Maltz, Gregery T. Buzzard, Charles A. Bouman

Viaarxiv icon