Alert button

"Information": models, code, and papers
Alert button

FSMR: A Feature Swapping Multi-modal Reasoning Approach with Joint Textual and Visual Clues

Add code
Bookmark button
Alert button
Mar 29, 2024
Shuang Li, Jiahua Wang, Lijie Wen

Figure 1 for FSMR: A Feature Swapping Multi-modal Reasoning Approach with Joint Textual and Visual Clues
Figure 2 for FSMR: A Feature Swapping Multi-modal Reasoning Approach with Joint Textual and Visual Clues
Figure 3 for FSMR: A Feature Swapping Multi-modal Reasoning Approach with Joint Textual and Visual Clues
Figure 4 for FSMR: A Feature Swapping Multi-modal Reasoning Approach with Joint Textual and Visual Clues
Viaarxiv icon

A Signature Based Approach Towards Global Channel Charting with Ultra Low Complexity

Mar 29, 2024
Longhai Zhao, Yunchuan Yang, Qi Xiong, He Wang, Bin Yu, Feifei Sun, Chengjun Sun

Figure 1 for A Signature Based Approach Towards Global Channel Charting with Ultra Low Complexity
Figure 2 for A Signature Based Approach Towards Global Channel Charting with Ultra Low Complexity
Figure 3 for A Signature Based Approach Towards Global Channel Charting with Ultra Low Complexity
Figure 4 for A Signature Based Approach Towards Global Channel Charting with Ultra Low Complexity
Viaarxiv icon

Inclusive Design Insights from a Preliminary Image-Based Conversational Search Systems Evaluation

Mar 29, 2024
Yue Zheng, Lei Yu, Junmian Chen, Tianyu Xia, Yuanyuan Yin, Shan Wang, Haiming Liu

Figure 1 for Inclusive Design Insights from a Preliminary Image-Based Conversational Search Systems Evaluation
Figure 2 for Inclusive Design Insights from a Preliminary Image-Based Conversational Search Systems Evaluation
Figure 3 for Inclusive Design Insights from a Preliminary Image-Based Conversational Search Systems Evaluation
Figure 4 for Inclusive Design Insights from a Preliminary Image-Based Conversational Search Systems Evaluation
Viaarxiv icon

LLaMA-Excitor: General Instruction Tuning via Indirect Feature Interaction

Apr 01, 2024
Bo Zou, Chao Yang, Yu Qiao, Chengbin Quan, Youjian Zhao

Viaarxiv icon

Multimodal Pretraining, Adaptation, and Generation for Recommendation: A Survey

Mar 31, 2024
Qijiong Liu, Jieming Zhu, Yanting Yang, Quanyu Dai, Zhaocheng Du, Xiao-Ming Wu, Zhou Zhao, Rui Zhang, Zhenhua Dong

Viaarxiv icon

Privacy-preserving Optics for Enhancing Protection in Face De-identification

Mar 31, 2024
Jhon Lopez, Carlos Hinojosa, Henry Arguello, Bernard Ghanem

Viaarxiv icon

Recover: A Neuro-Symbolic Framework for Failure Detection and Recovery

Mar 31, 2024
Cristina Cornelio, Mohammed Diab

Viaarxiv icon

M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language Models

Add code
Bookmark button
Alert button
Mar 31, 2024
Fan Bai, Yuxin Du, Tiejun Huang, Max Q. -H. Meng, Bo Zhao

Viaarxiv icon

KnowCoder: Coding Structured Knowledge into LLMs for Universal Information Extraction

Add code
Bookmark button
Alert button
Mar 14, 2024
Zixuan Li, Yutao Zeng, Yuxin Zuo, Weicheng Ren, Wenxuan Liu, Miao Su, Yucan Guo, Yantao Liu, Xiang Li, Zhilei Hu, Long Bai, Wei Li, Yidan Liu, Pan Yang, Xiaolong Jin, Jiafeng Guo, Xueqi Cheng

Figure 1 for KnowCoder: Coding Structured Knowledge into LLMs for Universal Information Extraction
Figure 2 for KnowCoder: Coding Structured Knowledge into LLMs for Universal Information Extraction
Figure 3 for KnowCoder: Coding Structured Knowledge into LLMs for Universal Information Extraction
Figure 4 for KnowCoder: Coding Structured Knowledge into LLMs for Universal Information Extraction
Viaarxiv icon

GeoAuxNet: Towards Universal 3D Representation Learning for Multi-sensor Point Clouds

Add code
Bookmark button
Alert button
Mar 28, 2024
Shengjun Zhang, Xin Fei, Yueqi Duan

Viaarxiv icon