Alert button

"Information": models, code, and papers
Alert button

Predicting performance difficulty from piano sheet music images

Add code
Bookmark button
Alert button
Sep 28, 2023
Pedro Ramoneda, Jose J. Valero-Mas, Dasaem Jeong, Xavier Serra

Viaarxiv icon

Tile Classification Based Viewport Prediction with Multi-modal Fusion Transformer

Sep 28, 2023
Zhihao Zhang, Yiwei Chen, Weizhan Zhang, Caixia Yan, Qinghua Zheng, Qi Wang, Wangdu Chen

Figure 1 for Tile Classification Based Viewport Prediction with Multi-modal Fusion Transformer
Figure 2 for Tile Classification Based Viewport Prediction with Multi-modal Fusion Transformer
Figure 3 for Tile Classification Based Viewport Prediction with Multi-modal Fusion Transformer
Figure 4 for Tile Classification Based Viewport Prediction with Multi-modal Fusion Transformer
Viaarxiv icon

AgriSORT: A Simple Online Real-time Tracking-by-Detection framework for robotics in precision agriculture

Sep 28, 2023
Leonardo Saraceni, Ionut M. Motoi, Daniele Nardi, Thomas A. Ciarfuglia

Viaarxiv icon

Reliable Majority Vote Computation with Complementary Sequences for UAV Waypoint Flight Control

Sep 26, 2023
Alphan Sahin, Xiaofeng Wang

Viaarxiv icon

Generating Visual Scenes from Touch

Add code
Bookmark button
Alert button
Sep 26, 2023
Fengyu Yang, Jiacheng Zhang, Andrew Owens

Viaarxiv icon

Demystifying Visual Features of Movie Posters for Multi-Label Genre Identification

Sep 21, 2023
Utsav Kumar Nareti, Chandranath Adak, Soumi Chattopadhyay

Viaarxiv icon

Navigation with shadow prices to optimize multi-commodity flow rates

Sep 25, 2023
Ignacio Boero, Igor Spasojevic, Mariana del Castillo, George Pappas, Vijay Kumar, Alejandro Ribeiro

Figure 1 for Navigation with shadow prices to optimize multi-commodity flow rates
Figure 2 for Navigation with shadow prices to optimize multi-commodity flow rates
Figure 3 for Navigation with shadow prices to optimize multi-commodity flow rates
Figure 4 for Navigation with shadow prices to optimize multi-commodity flow rates
Viaarxiv icon

Teaching Text-to-Image Models to Communicate

Add code
Bookmark button
Alert button
Sep 27, 2023
Xiaowen Sun, Jiazhan Feng, Yuxuan Wang, Yuxuan Lai, Xingyu Shen, Dongyan Zhao

Viaarxiv icon

Experience and Evidence are the eyes of an excellent summarizer! Towards Knowledge Infused Multi-modal Clinical Conversation Summarization

Add code
Bookmark button
Alert button
Sep 27, 2023
Abhisek Tiwari, Anisha Saha, Sriparna Saha, Pushpak Bhattacharyya, Minakshi Dhar

Figure 1 for Experience and Evidence are the eyes of an excellent summarizer! Towards Knowledge Infused Multi-modal Clinical Conversation Summarization
Figure 2 for Experience and Evidence are the eyes of an excellent summarizer! Towards Knowledge Infused Multi-modal Clinical Conversation Summarization
Figure 3 for Experience and Evidence are the eyes of an excellent summarizer! Towards Knowledge Infused Multi-modal Clinical Conversation Summarization
Figure 4 for Experience and Evidence are the eyes of an excellent summarizer! Towards Knowledge Infused Multi-modal Clinical Conversation Summarization
Viaarxiv icon

NLPBench: Evaluating Large Language Models on Solving NLP Problems

Add code
Bookmark button
Alert button
Sep 27, 2023
Linxin Song, Jieyu Zhang, Lechao Cheng, Pengyuan Zhou, Tianyi Zhou, Irene Li

Viaarxiv icon