Alert button

"Text": models, code, and papers
Alert button

VisionGPT: Vision-Language Understanding Agent Using Generalized Multimodal Framework

Mar 14, 2024
Chris Kelly, Luhui Hu, Bang Yang, Yu Tian, Deshun Yang, Cindy Yang, Zaoshan Huang, Zihao Li, Jiayin Hu, Yuexian Zou

Viaarxiv icon

XReal: Realistic Anatomy and Pathology-Aware X-ray Generation via Controllable Diffusion Model

Mar 14, 2024
Anees Ur Rehman Hashmi, Ibrahim Almakky, Mohammad Areeb Qazi, Santosh Sanjeev, Vijay Ram Papineni, Dwarikanath Mahapatra, Mohammad Yaqub

Viaarxiv icon

ORPO: Monolithic Preference Optimization without Reference Model

Mar 14, 2024
Jiwoo Hong, Noah Lee, James Thorne

Viaarxiv icon

RAD-PHI2: Instruction Tuning PHI-2 for Radiology

Mar 12, 2024
Mercy Ranjit, Gopinath Ganapathy, Shaury Srivastav, Tanuja Ganu, Srujana Oruganti

Viaarxiv icon

Measuring Technological Convergence in Encryption Technologies with Proximity Indices: A Text Mining and Bibliometric Analysis using OpenAlex

Mar 03, 2024
Alessandro Tavazzi, Dimitri Percia David, Julian Jang-Jaccard, Alain Mermoud

Figure 1 for Measuring Technological Convergence in Encryption Technologies with Proximity Indices: A Text Mining and Bibliometric Analysis using OpenAlex
Figure 2 for Measuring Technological Convergence in Encryption Technologies with Proximity Indices: A Text Mining and Bibliometric Analysis using OpenAlex
Figure 3 for Measuring Technological Convergence in Encryption Technologies with Proximity Indices: A Text Mining and Bibliometric Analysis using OpenAlex
Figure 4 for Measuring Technological Convergence in Encryption Technologies with Proximity Indices: A Text Mining and Bibliometric Analysis using OpenAlex
Viaarxiv icon

Text-guided HuBERT: Self-Supervised Speech Pre-training via Generative Adversarial Networks

Feb 24, 2024
Duo Ma, Xianghu Yue, Junyi Ao, Xiaoxue Gao, Haizhou Li

Viaarxiv icon

Ar-Spider: Text-to-SQL in Arabic

Feb 22, 2024
Saleh Almohaimeed, Saad Almohaimeed, Mansour Al Ghanim, Liqiang Wang

Viaarxiv icon

Surveying the Dead Minds: Historical-Psychological Text Analysis with Contextualized Construct Representation (CCR) for Classical Chinese

Mar 01, 2024
Yuqi Chen, Sixuan Li, Ying Li, Mohammad Atari

Figure 1 for Surveying the Dead Minds: Historical-Psychological Text Analysis with Contextualized Construct Representation (CCR) for Classical Chinese
Figure 2 for Surveying the Dead Minds: Historical-Psychological Text Analysis with Contextualized Construct Representation (CCR) for Classical Chinese
Figure 3 for Surveying the Dead Minds: Historical-Psychological Text Analysis with Contextualized Construct Representation (CCR) for Classical Chinese
Figure 4 for Surveying the Dead Minds: Historical-Psychological Text Analysis with Contextualized Construct Representation (CCR) for Classical Chinese
Viaarxiv icon

AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shield Prompting

Mar 14, 2024
Yu Wang, Xiaogeng Liu, Yu Li, Muhao Chen, Chaowei Xiao

Viaarxiv icon

VIXEN: Visual Text Comparison Network for Image Difference Captioning

Feb 29, 2024
Alexander Black, Jing Shi, Yifei Fai, Tu Bui, John Collomosse

Viaarxiv icon