Alert button

"Image": models, code, and papers
Alert button

LMT: Longitudinal Mixing Training, a Framework to Predict Disease Progression from a Single Image

Oct 16, 2023
Rachid Zeghlache, Pierre-Henri Conze, Mostafa El Habib Daho, Yihao Li, Hugo Le boite, Ramin Tadayoni, Pascal Massin, Béatrice Cochener, Ikram Brahim, Gwenolé Quellec, Mathieu Lamard

Viaarxiv icon

Contextual Emotion Estimation from Image Captions

Add code
Bookmark button
Alert button
Sep 22, 2023
Vera Yang, Archita Srivastava, Yasaman Etesam, Chuxuan Zhang, Angelica Lim

Figure 1 for Contextual Emotion Estimation from Image Captions
Figure 2 for Contextual Emotion Estimation from Image Captions
Figure 3 for Contextual Emotion Estimation from Image Captions
Figure 4 for Contextual Emotion Estimation from Image Captions
Viaarxiv icon

SPHINX: The Joint Mixing of Weights, Tasks, and Visual Embeddings for Multi-modal Large Language Models

Add code
Bookmark button
Alert button
Nov 13, 2023
Ziyi Lin, Chris Liu, Renrui Zhang, Peng Gao, Longtian Qiu, Han Xiao, Han Qiu, Chen Lin, Wenqi Shao, Keqin Chen, Jiaming Han, Siyuan Huang, Yichi Zhang, Xuming He, Hongsheng Li, Yu Qiao

Viaarxiv icon

GPT-4V(ision) as A Social Media Analysis Engine

Nov 13, 2023
Hanjia Lyu, Jinfa Huang, Daoan Zhang, Yongsheng Yu, Xinyi Mou, Jinsheng Pan, Zhengyuan Yang, Zhongyu Wei, Jiebo Luo

Viaarxiv icon

The Development of LLMs for Embodied Navigation

Add code
Bookmark button
Alert button
Nov 10, 2023
Jinzhou Lin, Han Gao, Rongtao Xu, Changwei Wang, Li Guo, Shibiao Xu

Viaarxiv icon

Instant3D: Fast Text-to-3D with Sparse-View Generation and Large Reconstruction Model

Nov 10, 2023
Jiahao Li, Hao Tan, Kai Zhang, Zexiang Xu, Fujun Luan, Yinghao Xu, Yicong Hong, Kalyan Sunkavalli, Greg Shakhnarovich, Sai Bi

Figure 1 for Instant3D: Fast Text-to-3D with Sparse-View Generation and Large Reconstruction Model
Figure 2 for Instant3D: Fast Text-to-3D with Sparse-View Generation and Large Reconstruction Model
Figure 3 for Instant3D: Fast Text-to-3D with Sparse-View Generation and Large Reconstruction Model
Figure 4 for Instant3D: Fast Text-to-3D with Sparse-View Generation and Large Reconstruction Model
Viaarxiv icon

TacIPC: Intersection- and Inversion-free FEM-based Elastomer Simulation For Optical Tactile Sensors

Nov 10, 2023
Wenxin Du, Wenqiang Xu, Jieji Ren, Zhenjun Yu, Cewu Lu

Viaarxiv icon

Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization

Add code
Bookmark button
Alert button
Nov 10, 2023
Weiyang Liu, Zeju Qiu, Yao Feng, Yuliang Xiu, Yuxuan Xue, Longhui Yu, Haiwen Feng, Zhen Liu, Juyeon Heo, Songyou Peng, Yandong Wen, Michael J. Black, Adrian Weller, Bernhard Schölkopf

Figure 1 for Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization
Figure 2 for Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization
Figure 3 for Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization
Figure 4 for Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization
Viaarxiv icon

Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks

Nov 10, 2023
Bin Xiao, Haiping Wu, Weijian Xu, Xiyang Dai, Houdong Hu, Yumao Lu, Michael Zeng, Ce Liu, Lu Yuan

Figure 1 for Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
Figure 2 for Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
Figure 3 for Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
Figure 4 for Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
Viaarxiv icon

GCS-ICHNet: Assessment of Intracerebral Hemorrhage Prognosis using Self-Attention with Domain Knowledge Integration

Add code
Bookmark button
Alert button
Nov 08, 2023
Xuhao Shan, Xinyang Li, Ruiquan Ge, Shibin Wu, Ahmed Elazab, Jichao Zhu, Lingyan Zhang, Gangyong Jia, Qingying Xiao, Xiang Wan, Changmiao Wang

Viaarxiv icon