Alert button

"Text": models, code, and papers
Alert button

Chaining text-to-image and large language model: A novel approach for generating personalized e-commerce banners

Feb 28, 2024
Shanu Vashishtha, Abhinav Prakash, Lalitesh Morishetti, Kaushiki Nag, Yokila Arora, Sushant Kumar, Kannan Achan

Figure 1 for Chaining text-to-image and large language model: A novel approach for generating personalized e-commerce banners
Figure 2 for Chaining text-to-image and large language model: A novel approach for generating personalized e-commerce banners
Figure 3 for Chaining text-to-image and large language model: A novel approach for generating personalized e-commerce banners
Figure 4 for Chaining text-to-image and large language model: A novel approach for generating personalized e-commerce banners
Viaarxiv icon

BjTT: A Large-scale Multimodal Dataset for Traffic Prediction

Mar 08, 2024
Chengyang Zhang, Yong Zhang, Qitan Shao, Bo Li, Yisheng Lv, Xinglin Piao, Baocai Yin

Figure 1 for BjTT: A Large-scale Multimodal Dataset for Traffic Prediction
Figure 2 for BjTT: A Large-scale Multimodal Dataset for Traffic Prediction
Figure 3 for BjTT: A Large-scale Multimodal Dataset for Traffic Prediction
Figure 4 for BjTT: A Large-scale Multimodal Dataset for Traffic Prediction
Viaarxiv icon

Renovating Names in Open-Vocabulary Segmentation Benchmarks

Mar 14, 2024
Haiwen Huang, Songyou Peng, Dan Zhang, Andreas Geiger

Viaarxiv icon

Hyper-CL: Conditioning Sentence Representations with Hypernetworks

Mar 14, 2024
Young Hyun Yoo, Jii Cha, Changhyeon Kim, Taeuk Kim

Viaarxiv icon

Annotation Free Semantic Segmentation with Vision Foundation Models

Mar 14, 2024
Soroush Seifi, Daniel Olmeda Reino, Fabien Despinoy, Rahaf Aljundi

Viaarxiv icon

Text2Pic Swift: Enhancing Long-Text to Image Retrieval for Large-Scale Libraries

Feb 28, 2024
Zijun Long, Xuri Ge, Richard Mccreadie, Joemon Jose

Viaarxiv icon

ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment

Mar 08, 2024
Xiwei Hu, Rui Wang, Yixiao Fang, Bin Fu, Pei Cheng, Gang Yu

Figure 1 for ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
Figure 2 for ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
Figure 3 for ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
Figure 4 for ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
Viaarxiv icon

Gemma: Open Models Based on Gemini Research and Technology

Mar 13, 2024
Gemma Team, Thomas Mesnard, Cassidy Hardin, Robert Dadashi, Surya Bhupatiraju, Shreya Pathak, Laurent Sifre, Morgane Rivière, Mihir Sanjay Kale, Juliette Love, Pouya Tafti, Léonard Hussenot, Aakanksha Chowdhery, Adam Roberts, Aditya Barua, Alex Botev, Alex Castro-Ros, Ambrose Slone, Amélie Héliou, Andrea Tacchetti, Anna Bulanova, Antonia Paterson, Beth Tsai, Bobak Shahriari, Charline Le Lan, Christopher A. Choquette-Choo, Clément Crepy, Daniel Cer, Daphne Ippolito, David Reid, Elena Buchatskaya, Eric Ni, Eric Noland, Geng Yan, George Tucker, George-Christian Muraru, Grigory Rozhdestvenskiy, Henryk Michalewski, Ian Tenney, Ivan Grishchenko, Jacob Austin, James Keeling, Jane Labanowski, Jean-Baptiste Lespiau, Jeff Stanway, Jenny Brennan, Jeremy Chen, Johan Ferret, Justin Chiu, Justin Mao-Jones, Katherine Lee, Kathy Yu, Katie Millican, Lars Lowe Sjoesund, Lisa Lee, Lucas Dixon, Machel Reid, Maciej Mikuła, Mateo Wirth, Michael Sharman, Nikolai Chinaev, Nithum Thain, Olivier Bachem, Oscar Chang, Oscar Wahltinez, Paige Bailey, Paul Michel, Petko Yotov, Pier Giuseppe Sessa, Rahma Chaabouni, Ramona Comanescu, Reena Jana, Rohan Anil, Ross McIlroy, Ruibo Liu, Ryan Mullins, Samuel L Smith, Sebastian Borgeaud, Sertan Girgin, Sholto Douglas, Shree Pandya, Siamak Shakeri, Soham De, Ted Klimenko, Tom Hennigan, Vlad Feinberg, Wojciech Stokowiec, Yu-hui Chen, Zafarali Ahmed, Zhitao Gong, Tris Warkentin, Ludovic Peran, Minh Giang, Clément Farabet, Oriol Vinyals, Jeff Dean, Koray Kavukcuoglu, Demis Hassabis, Zoubin Ghahramani, Douglas Eck, Joelle Barral, Fernando Pereira, Eli Collins, Armand Joulin, Noah Fiedel, Evan Senter, Alek Andreev, Kathleen Kenealy

Viaarxiv icon

Knowledge Condensation and Reasoning for Knowledge-based VQA

Mar 15, 2024
Dongze Hao, Jian Jia, Longteng Guo, Qunbo Wang, Te Yang, Yan Li, Yanhua Cheng, Bo Wang, Quan Chen, Han Li, Jing Liu

Viaarxiv icon

Generating Clarification Questions for Disambiguating Contracts

Mar 12, 2024
Anmol Singhal, Chirag Jain, Preethu Rose Anish, Arkajyoti Chakraborty, Smita Ghaisas

Viaarxiv icon