Alert button

"Text": models, code, and papers
Alert button

Bridging the Gap between 2D and 3D Visual Question Answering: A Fusion Approach for 3D VQA

Feb 24, 2024
Wentao Mo, Yang Liu

Viaarxiv icon

Named Entity Recognition for Address Extraction in Speech-to-Text Transcriptions Using Synthetic Data

Feb 08, 2024
Bibiána Lajčinová, Patrik Valábek, Michal Spišiak

Viaarxiv icon

From Keywords to Structured Summaries: Streamlining Scholarly Knowledge Access

Feb 22, 2024
Mahsa Shamsabadi, Jennifer D'Souza

Viaarxiv icon

Cleaner Pretraining Corpus Curation with Neural Web Scraping

Feb 22, 2024
Zhipeng Xu, Zhenghao Liu, Yukun Yan, Zhiyuan Liu, Chenyan Xiong, Ge Yu

Viaarxiv icon

Tree-Planted Transformers: Large Language Models with Implicit Syntactic Supervision

Feb 20, 2024
Ryo Yoshida, Taiga Someya, Yohei Oseki

Viaarxiv icon

More Discriminative Sentence Embeddings via Semantic Graph Smoothing

Feb 20, 2024
Chakib Fettal, Lazhar Labiod, Mohamed Nadif

Viaarxiv icon

CLoVe: Encoding Compositional Language in Contrastive Vision-Language Models

Feb 22, 2024
Santiago Castro, Amir Ziai, Avneesh Saluja, Zhuoning Yuan, Rada Mihalcea

Viaarxiv icon

Sparse and Structured Hopfield Networks

Feb 21, 2024
Saul Santos, Vlad Niculae, Daniel McNamee, Andre F. T. Martins

Viaarxiv icon

A Human-Inspired Reading Agent with Gist Memory of Very Long Contexts

Feb 23, 2024
Kuang-Huei Lee, Xinyun Chen, Hiroki Furuta, John Canny, Ian Fischer

Viaarxiv icon

Speech Translation with Speech Foundation Models and Large Language Models: What is There and What is Missing?

Feb 19, 2024
Marco Gaido, Sara Papi, Matteo Negri, Luisa Bentivogli

Viaarxiv icon