Alert button

"Information": models, code, and papers
Alert button

Aligned with LLM: a new multi-modal training paradigm for encoding fMRI activity in visual cortex

Jan 08, 2024
Shuxiao Ma, Linyuan Wang, Senbao Hou, Bin Yan

Viaarxiv icon

Inverse-like Antagonistic Scene Text Spotting via Reading-Order Estimation and Dynamic Sampling

Jan 08, 2024
Shi-Xue Zhang, Chun Yang, Xiaobin Zhu, Hongyang Zhou, Hongfa Wang, Xu-Cheng Yin

Viaarxiv icon

Bringing Back the Context: Camera Trap Species Identification as Link Prediction on Multimodal Knowledge Graphs

Jan 08, 2024
Vardaan Pahuja, Weidi Luo, Yu Gu, Cheng-Hao Tu, Hong-You Chen, Tanya Berger-Wolf, Charles Stewart, Song Gao, Wei-Lun Chao, Yu Su

Viaarxiv icon

Expand BERT Representation with Visual Information via Grounded Language Learning with Multimodal Partial Alignment

Dec 04, 2023
Cong-Duy Nguyen, The-Anh Vu-Le, Thong Nguyen, Tho Quan, Luu Anh Tuan

Figure 1 for Expand BERT Representation with Visual Information via Grounded Language Learning with Multimodal Partial Alignment
Figure 2 for Expand BERT Representation with Visual Information via Grounded Language Learning with Multimodal Partial Alignment
Figure 3 for Expand BERT Representation with Visual Information via Grounded Language Learning with Multimodal Partial Alignment
Figure 4 for Expand BERT Representation with Visual Information via Grounded Language Learning with Multimodal Partial Alignment
Viaarxiv icon

Phase-shifted remote photoplethysmography for estimating heart rate and blood pressure from facial video

Jan 09, 2024
Gyutae Hwang, Sang Jun Lee

Viaarxiv icon

Tensor Networks for Explainable Machine Learning in Cybersecurity

Jan 05, 2024
Borja Aizpurua, Roman Orus

Viaarxiv icon

Perceptual Image Compression with Cooperative Cross-Modal Side Information

Nov 28, 2023
Shiyu Qin, Bin Chen, Yujun Huang, Baoyi An, Tao Dai, Shu-Tao Xia

Viaarxiv icon

Uncertainty Resolution in Misinformation Detection

Jan 02, 2024
Yury Orlovskiy, Camille Thibault, Anne Imouza, Jean-François Godbout, Reihaneh Rabbany, Kellin Pelrine

Viaarxiv icon

Navigating Uncertainty: Optimizing API Dependency for Hallucination Reduction in Closed-Book Question Answering

Jan 03, 2024
Pierre Erbacher, Louis Falissar, Vincent Guigue, Laure Soulier

Viaarxiv icon

STAF: 3D Human Mesh Recovery from Video with Spatio-Temporal Alignment Fusion

Add code
Bookmark button
Alert button
Jan 03, 2024
Wei Yao, Hongwen Zhang, Yunlian Sun, Jinhui Tang

Viaarxiv icon