Alert button

"Information": models, code, and papers
Alert button

QwenGrasp: A Usage of Large Vision-Language Model for Target-Oriented Grasping

Oct 08, 2023
Xinyu Chen, Jian Yang, Zonghan He, Haobin Yang, Qi Zhao, Yuhui Shi

Figure 1 for QwenGrasp: A Usage of Large Vision-Language Model for Target-Oriented Grasping
Figure 2 for QwenGrasp: A Usage of Large Vision-Language Model for Target-Oriented Grasping
Figure 3 for QwenGrasp: A Usage of Large Vision-Language Model for Target-Oriented Grasping
Viaarxiv icon

UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model

Oct 08, 2023
Jiabo Ye, Anwen Hu, Haiyang Xu, Qinghao Ye, Ming Yan, Guohai Xu, Chenliang Li, Junfeng Tian, Qi Qian, Ji Zhang, Qin Jin, Liang He, Xin Alex Lin, Fei Huang

Figure 1 for UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model
Figure 2 for UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model
Figure 3 for UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model
Figure 4 for UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model
Viaarxiv icon

Influence of Camera-LiDAR Configuration on 3D Object Detection for Autonomous Driving

Oct 08, 2023
Ye Li, Hanjiang Hu, Zuxin Liu, Ding Zhao

Figure 1 for Influence of Camera-LiDAR Configuration on 3D Object Detection for Autonomous Driving
Figure 2 for Influence of Camera-LiDAR Configuration on 3D Object Detection for Autonomous Driving
Figure 3 for Influence of Camera-LiDAR Configuration on 3D Object Detection for Autonomous Driving
Figure 4 for Influence of Camera-LiDAR Configuration on 3D Object Detection for Autonomous Driving
Viaarxiv icon

RA-DIT: Retrieval-Augmented Dual Instruction Tuning

Oct 08, 2023
Xi Victoria Lin, Xilun Chen, Mingda Chen, Weijia Shi, Maria Lomeli, Rich James, Pedro Rodriguez, Jacob Kahn, Gergely Szilvasy, Mike Lewis, Luke Zettlemoyer, Scott Yih

Figure 1 for RA-DIT: Retrieval-Augmented Dual Instruction Tuning
Figure 2 for RA-DIT: Retrieval-Augmented Dual Instruction Tuning
Figure 3 for RA-DIT: Retrieval-Augmented Dual Instruction Tuning
Figure 4 for RA-DIT: Retrieval-Augmented Dual Instruction Tuning
Viaarxiv icon

Temporally Aligning Long Audio Interviews with Questions: A Case Study in Multimodal Data Integration

Oct 10, 2023
Piyush Singh Pasi, Karthikeya Battepati, Preethi Jyothi, Ganesh Ramakrishnan, Tanmay Mahapatra, Manoj Singh

Figure 1 for Temporally Aligning Long Audio Interviews with Questions: A Case Study in Multimodal Data Integration
Figure 2 for Temporally Aligning Long Audio Interviews with Questions: A Case Study in Multimodal Data Integration
Figure 3 for Temporally Aligning Long Audio Interviews with Questions: A Case Study in Multimodal Data Integration
Figure 4 for Temporally Aligning Long Audio Interviews with Questions: A Case Study in Multimodal Data Integration
Viaarxiv icon

SC2GAN: Rethinking Entanglement by Self-correcting Correlated GAN Space

Oct 10, 2023
Zikun Chen, Han Zhao, Parham Aarabi, Ruowei Jiang

Figure 1 for SC2GAN: Rethinking Entanglement by Self-correcting Correlated GAN Space
Figure 2 for SC2GAN: Rethinking Entanglement by Self-correcting Correlated GAN Space
Figure 3 for SC2GAN: Rethinking Entanglement by Self-correcting Correlated GAN Space
Figure 4 for SC2GAN: Rethinking Entanglement by Self-correcting Correlated GAN Space
Viaarxiv icon

EmoTwiCS: A Corpus for Modelling Emotion Trajectories in Dutch Customer Service Dialogues on Twitter

Oct 10, 2023
Sofie Labat, Thomas Demeester, Véronique Hoste

Viaarxiv icon

Automated clinical coding using off-the-shelf large language models

Oct 10, 2023
Joseph S. Boyle, Antanas Kascenas, Pat Lok, Maria Liakata, Alison Q. O'Neil

Viaarxiv icon

Quantum computer-enabled receivers for optical communication

Sep 27, 2023
John Crossman, Spencer Dimitroff, Lukasz Cincio, Mohan Sarovar

Viaarxiv icon

StructChart: Perception, Structuring, Reasoning for Visual Chart Understanding

Sep 25, 2023
Renqiu Xia, Bo Zhang, Haoyang Peng, Ning Liao, Peng Ye, Botian Shi, Junchi Yan, Yu Qiao

Figure 1 for StructChart: Perception, Structuring, Reasoning for Visual Chart Understanding
Figure 2 for StructChart: Perception, Structuring, Reasoning for Visual Chart Understanding
Figure 3 for StructChart: Perception, Structuring, Reasoning for Visual Chart Understanding
Figure 4 for StructChart: Perception, Structuring, Reasoning for Visual Chart Understanding
Viaarxiv icon