Alert button

"Image": models, code, and papers
Alert button

Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization

Mar 13, 2024
Renjie Pi, Tianyang Han, Wei Xiong, Jipeng Zhang, Runtao Liu, Rui Pan, Tong Zhang

Figure 1 for Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization
Figure 2 for Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization
Figure 3 for Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization
Figure 4 for Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization
Viaarxiv icon

TNF: Tri-branch Neural Fusion for Multimodal Medical Data Classification

Mar 10, 2024
Tong Zheng, Shusaku Sone, Yoshitaka Ushiku, Yuki Oba, Jiaxin Ma

Figure 1 for TNF: Tri-branch Neural Fusion for Multimodal Medical Data Classification
Figure 2 for TNF: Tri-branch Neural Fusion for Multimodal Medical Data Classification
Figure 3 for TNF: Tri-branch Neural Fusion for Multimodal Medical Data Classification
Figure 4 for TNF: Tri-branch Neural Fusion for Multimodal Medical Data Classification
Viaarxiv icon

BLO-SAM: Bi-level Optimization Based Overfitting-Preventing Finetuning of SAM

Add code
Bookmark button
Alert button
Mar 11, 2024
Li Zhang, Youwei Liang, Ruiyi Zhang, Amirhosein Javadi, Pengtao Xie

Viaarxiv icon

A Converting Autoencoder Toward Low-latency and Energy-efficient DNN Inference at the Edge

Add code
Bookmark button
Alert button
Mar 11, 2024
Hasanul Mahmud, Peng Kang, Kevin Desai, Palden Lama, Sushil Prasad

Figure 1 for A Converting Autoencoder Toward Low-latency and Energy-efficient DNN Inference at the Edge
Figure 2 for A Converting Autoencoder Toward Low-latency and Energy-efficient DNN Inference at the Edge
Figure 3 for A Converting Autoencoder Toward Low-latency and Energy-efficient DNN Inference at the Edge
Figure 4 for A Converting Autoencoder Toward Low-latency and Energy-efficient DNN Inference at the Edge
Viaarxiv icon

A Cognitive Evaluation Benchmark of Image Reasoning and Description for Large Vision Language Models

Add code
Bookmark button
Alert button
Feb 29, 2024
Xiujie Song, Mengyue Wu, Kenny Q. Zhu, Chunhao Zhang, Yanyi Chen

Viaarxiv icon

FedLPPA: Learning Personalized Prompt and Aggregation for Federated Weakly-supervised Medical Image Segmentation

Add code
Bookmark button
Alert button
Feb 27, 2024
Li Lin, Yixiang Liu, Jiewei Wu, Pujin Cheng, Zhiyuan Cai, Kenneth K. Y. Wong, Xiaoying Tang

Viaarxiv icon

SpikeReveal: Unlocking Temporal Sequences from Real Blurry Inputs with Spike Streams

Add code
Bookmark button
Alert button
Mar 14, 2024
Kang Chen, Shiyan Chen, Jiyuan Zhang, Baoyue Zhang, Yajing Zheng, Tiejun Huang, Zhaofei Yu

Figure 1 for SpikeReveal: Unlocking Temporal Sequences from Real Blurry Inputs with Spike Streams
Figure 2 for SpikeReveal: Unlocking Temporal Sequences from Real Blurry Inputs with Spike Streams
Figure 3 for SpikeReveal: Unlocking Temporal Sequences from Real Blurry Inputs with Spike Streams
Figure 4 for SpikeReveal: Unlocking Temporal Sequences from Real Blurry Inputs with Spike Streams
Viaarxiv icon

Intention-driven Ego-to-Exo Video Generation

Mar 14, 2024
Hongchen Luo, Kai Zhu, Wei Zhai, Yang Cao

Figure 1 for Intention-driven Ego-to-Exo Video Generation
Figure 2 for Intention-driven Ego-to-Exo Video Generation
Figure 3 for Intention-driven Ego-to-Exo Video Generation
Figure 4 for Intention-driven Ego-to-Exo Video Generation
Viaarxiv icon

FastSAM3D: An Efficient Segment Anything Model for 3D Volumetric Medical Images

Add code
Bookmark button
Alert button
Mar 14, 2024
Yiqing Shen, Jingxing Li, Xinyuan Shao, Blanca Inigo Romillo, Ankush Jindal, David Dreizin, Mathias Unberath

Figure 1 for FastSAM3D: An Efficient Segment Anything Model for 3D Volumetric Medical Images
Figure 2 for FastSAM3D: An Efficient Segment Anything Model for 3D Volumetric Medical Images
Figure 3 for FastSAM3D: An Efficient Segment Anything Model for 3D Volumetric Medical Images
Figure 4 for FastSAM3D: An Efficient Segment Anything Model for 3D Volumetric Medical Images
Viaarxiv icon

DuDoUniNeXt: Dual-domain unified hybrid model for single and multi-contrast undersampled MRI reconstruction

Mar 08, 2024
Ziqi Gao, Yue Zhang, Xinwen Liu, Kaiyan Li, S. Kevin Zhou

Figure 1 for DuDoUniNeXt: Dual-domain unified hybrid model for single and multi-contrast undersampled MRI reconstruction
Figure 2 for DuDoUniNeXt: Dual-domain unified hybrid model for single and multi-contrast undersampled MRI reconstruction
Figure 3 for DuDoUniNeXt: Dual-domain unified hybrid model for single and multi-contrast undersampled MRI reconstruction
Figure 4 for DuDoUniNeXt: Dual-domain unified hybrid model for single and multi-contrast undersampled MRI reconstruction
Viaarxiv icon