Alert button

"Information": models, code, and papers
Alert button

The Curious Case of Nonverbal Abstract Reasoning with Multi-Modal Large Language Models

Add code
Bookmark button
Alert button
Jan 22, 2024
Kian Ahrabian, Zhivar Sourati, Kexuan Sun, Jiarui Zhang, Yifan Jiang, Fred Morstatter, Jay Pujara

Viaarxiv icon

Spatial-Contextual Discrepancy Information Compensation for GAN Inversion

Dec 12, 2023
Ziqiang Zhang, Yan Yan, Jing-Hao Xue, Hanzi Wang

Figure 1 for Spatial-Contextual Discrepancy Information Compensation for GAN Inversion
Figure 2 for Spatial-Contextual Discrepancy Information Compensation for GAN Inversion
Figure 3 for Spatial-Contextual Discrepancy Information Compensation for GAN Inversion
Figure 4 for Spatial-Contextual Discrepancy Information Compensation for GAN Inversion
Viaarxiv icon

On the Audio Hallucinations in Large Audio-Video Language Models

Jan 18, 2024
Taichi Nishimura, Shota Nakada, Masayoshi Kondo

Viaarxiv icon

Fine-grained Contract NER using instruction based model

Jan 24, 2024
Hiranmai Sri Adibhatla, Pavan Baswani, Manish Shrivastava

Viaarxiv icon

Question answering systems for health professionals at the point of care -- a systematic review

Jan 24, 2024
Gregory Kell, Angus Roberts, Serge Umansky, Linglong Qian, Davide Ferrari, Frank Soboczenski, Byron Wallace, Nikhil Patel, Iain J Marshall

Viaarxiv icon

Multilingual Visual Speech Recognition with a Single Model by Learning with Discrete Visual Speech Units

Jan 18, 2024
Minsu Kim, Jeong Hun Yeo, Jeongsoo Choi, Se Jin Park, Yong Man Ro

Viaarxiv icon

Truck Parking Usage Prediction with Decomposed Graph Neural Networks

Jan 23, 2024
Rei Tamaru, Yang Cheng, Steven Parker, Ernie Perry, Bin Ran, Soyoung Ahn

Viaarxiv icon

Control-Aware Trajectory Predictions for Communication-Efficient Drone Swarm Coordination in Cluttered Environments

Jan 23, 2024
Longhao Yan, Jingyuan Zhou, Kaidi Yang

Viaarxiv icon

TD^2-Net: Toward Denoising and Debiasing for Dynamic Scene Graph Generation

Jan 23, 2024
Xin Lin, Chong Shi, Yibing Zhan, Zuopeng Yang, Yaqi Wu, Dacheng Tao

Viaarxiv icon

Boosting Unknown-number Speaker Separation with Transformer Decoder-based Attractor

Jan 23, 2024
Younglo Lee, Shukjae Choi, Byeong-Yeol Kim, Zhong-Qiu Wang, Shinji Watanabe

Viaarxiv icon