Alert button

"Text": models, code, and papers
Alert button

VisionGPT-3D: A Generalized Multimodal Agent for Enhanced 3D Vision Understanding

Mar 14, 2024
Chris Kelly, Luhui Hu, Jiayin Hu, Yu Tian, Deshun Yang, Bang Yang, Cindy Yang, Zihao Li, Zaoshan Huang, Yuexian Zou

Viaarxiv icon

Information Extraction: An application to the domain of hyper-local financial data on developing countries

Mar 14, 2024
Abuzar Royesh, Olamide Oladeji

Viaarxiv icon

Efficiently Leveraging Linguistic Priors for Scene Text Spotting

Feb 27, 2024
Nguyen Nguyen, Yapeng Tian, Chenliang Xu

Viaarxiv icon

Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking

Mar 14, 2024
Eric Zelikman, Georges Harik, Yijia Shao, Varuna Jayasiri, Nick Haber, Noah D. Goodman

Viaarxiv icon

3DTopia: Large Text-to-3D Generation Model with Hybrid Diffusion Priors

Mar 04, 2024
Fangzhou Hong, Jiaxiang Tang, Ziang Cao, Min Shi, Tong Wu, Zhaoxi Chen, Tengfei Wang, Liang Pan, Dahua Lin, Ziwei Liu

Figure 1 for 3DTopia: Large Text-to-3D Generation Model with Hybrid Diffusion Priors
Figure 2 for 3DTopia: Large Text-to-3D Generation Model with Hybrid Diffusion Priors
Figure 3 for 3DTopia: Large Text-to-3D Generation Model with Hybrid Diffusion Priors
Figure 4 for 3DTopia: Large Text-to-3D Generation Model with Hybrid Diffusion Priors
Viaarxiv icon

What Text Design Characterizes Book Genres?

Feb 26, 2024
Daichi Haraguchi, Brian Kenji Iwana, Seiichi Uchida

Viaarxiv icon

The Impact of Quantization on the Robustness of Transformer-based Text Classifiers

Mar 08, 2024
Seyed Parsa Neshaei, Yasaman Boreshban, Gholamreza Ghassem-Sani, Seyed Abolghasem Mirroshandel

Figure 1 for The Impact of Quantization on the Robustness of Transformer-based Text Classifiers
Figure 2 for The Impact of Quantization on the Robustness of Transformer-based Text Classifiers
Figure 3 for The Impact of Quantization on the Robustness of Transformer-based Text Classifiers
Viaarxiv icon

Distilling Text Style Transfer With Self-Explanation From LLMs

Mar 02, 2024
Chiyu Zhang, Honglong Cai, Yuezhang, Li, Yuexin Wu, Le Hou, Muhammad Abdul-Mageed

Figure 1 for Distilling Text Style Transfer With Self-Explanation From LLMs
Figure 2 for Distilling Text Style Transfer With Self-Explanation From LLMs
Figure 3 for Distilling Text Style Transfer With Self-Explanation From LLMs
Figure 4 for Distilling Text Style Transfer With Self-Explanation From LLMs
Viaarxiv icon

Eta Inversion: Designing an Optimal Eta Function for Diffusion-based Real Image Editing

Mar 14, 2024
Wonjun Kang, Kevin Galim, Hyung Il Koo

Viaarxiv icon

RORA: Robust Free-Text Rationale Evaluation

Feb 28, 2024
Zhengping Jiang, Yining Lu, Hanjie Chen, Daniel Khashabi, Benjamin Van Durme, Anqi Liu

Viaarxiv icon