Alert button

"Text": models, code, and papers
Alert button

Monkey: Image Resolution and Text Label Are Important Things for Large Multi-modal Models

Nov 11, 2023
Zhang Li, Biao Yang, Qiang Liu, Zhiyin Ma, Shuo Zhang, Jingxu Yang, Yabo Sun, Yuliang Liu, Xiang Bai

Figure 1 for Monkey: Image Resolution and Text Label Are Important Things for Large Multi-modal Models
Figure 2 for Monkey: Image Resolution and Text Label Are Important Things for Large Multi-modal Models
Figure 3 for Monkey: Image Resolution and Text Label Are Important Things for Large Multi-modal Models
Figure 4 for Monkey: Image Resolution and Text Label Are Important Things for Large Multi-modal Models
Viaarxiv icon

Lasagna: Layered Score Distillation for Disentangled Object Relighting

Nov 30, 2023
Dina Bashkirova, Arijit Ray, Rupayan Mallick, Sarah Adel Bargal, Jianming Zhang, Ranjay Krishna, Kate Saenko

Viaarxiv icon

SQLformer: Deep Auto-Regressive Query Graph Generation for Text-to-SQL Translation

Oct 27, 2023
Adrián Bazaga, Pietro Liò, Gos Micklem

Figure 1 for SQLformer: Deep Auto-Regressive Query Graph Generation for Text-to-SQL Translation
Figure 2 for SQLformer: Deep Auto-Regressive Query Graph Generation for Text-to-SQL Translation
Figure 3 for SQLformer: Deep Auto-Regressive Query Graph Generation for Text-to-SQL Translation
Figure 4 for SQLformer: Deep Auto-Regressive Query Graph Generation for Text-to-SQL Translation
Viaarxiv icon

Disentangled Representation Learning with Large Language Models for Text-Attributed Graphs

Oct 27, 2023
Yijian Qin, Xin Wang, Ziwei Zhang, Wenwu Zhu

Figure 1 for Disentangled Representation Learning with Large Language Models for Text-Attributed Graphs
Figure 2 for Disentangled Representation Learning with Large Language Models for Text-Attributed Graphs
Figure 3 for Disentangled Representation Learning with Large Language Models for Text-Attributed Graphs
Figure 4 for Disentangled Representation Learning with Large Language Models for Text-Attributed Graphs
Viaarxiv icon

Align before Adapt: Leveraging Entity-to-Region Alignments for Generalizable Video Action Recognition

Nov 27, 2023
Yifei Chen, Dapeng Chen, Ruijin Liu, Sai Zhou, Wenyuan Xue, Wei Peng

Viaarxiv icon

Italian Crossword Generator: Enhancing Education through Interactive Word Puzzles

Nov 27, 2023
Kamyar Zeinalipour, Tommaso laquinta, Asya Zanollo, Giovanni Angelini, Leonardo Rigutini, Marco Maggini, Marco Gori

Viaarxiv icon

SoUnD Framework: Analyzing (So)cial Representation in (Un)structured (D)ata

Dec 01, 2023
Mark Díaz, Sunipa Dev, Emily Reif, Emily Denton, Vinodkumar Prabhakaran

Viaarxiv icon

Custom Data Augmentation for low resource ASR using Bark and Retrieval-Based Voice Conversion

Dec 02, 2023
Anand Kamble, Aniket Tathe, Suyash Kumbharkar, Atharva Bhandare, Anirban C. Mitra

Viaarxiv icon

Can language agents be alternatives to PPO? A Preliminary Empirical Study On OpenAI Gym

Dec 06, 2023
Junjie Sheng, Zixiao Huang, Chuyun Shen, Wenhao Li, Yun Hua, Bo Jin, Hongyuan Zha, Xiangfeng Wang

Viaarxiv icon

Detecting Rumor Veracity with Only Textual Information by Double-Channel Structure

Dec 06, 2023
Alex Kim, Sangwon Yoon

Viaarxiv icon