Alert button

"Text": models, code, and papers
Alert button

One-to-many Reconstruction of 3D Geometry of cultural Artifacts using a synthetically trained Generative Model

Feb 13, 2024
Thomas Pöllabauer, Julius Kühn, Jiayi Li, Arjan Kuijper

Viaarxiv icon

Detecting the Clinical Features of Difficult-to-Treat Depression using Synthetic Data from Large Language Models

Feb 12, 2024
Isabelle Lorge, Dan W. Joyce, Niall Taylor, Alejo Nevado-Holgado, Andrea Cipriani, Andrey Kormilitzin

Viaarxiv icon

SpeechCLIP+: Self-supervised multi-task representation learning for speech via CLIP and speech-image data

Feb 10, 2024
Hsuan-Fu Wang, Yi-Jen Shih, Heng-Jui Chang, Layne Berry, Puyuan Peng, Hung-yi Lee, Hsin-Min Wang, David Harwath

Viaarxiv icon

Weatherproofing Retrieval for Localization with Generative AI and Geometric Consistency

Feb 14, 2024
Yannis Kalantidis, Mert Bülent Sarıyıldız, Rafael S. Rezende, Philippe Weinzaepfel, Diane Larlus, Gabriela Csurka

Viaarxiv icon

Multi-Fidelity Methods for Optimization: A Survey

Feb 15, 2024
Ke Li, Fan Li

Viaarxiv icon

Towards Reducing Diagnostic Errors with Interpretable Risk Prediction

Feb 15, 2024
Denis Jered McInerney, William Dickinson, Lucy Flynn, Andrea Young, Geoffrey Young, Jan-Willem van de Meent, Byron C. Wallace

Viaarxiv icon

Quantized Embedding Vectors for Controllable Diffusion Language Models

Feb 15, 2024
Cheng Kang, Xinye Chen, Yong Hu, Daniel Novak

Viaarxiv icon

SoftEDA: Rethinking Rule-Based Data Augmentation with Soft Labels

Feb 08, 2024
Juhwan Choi, Kyohoon Jin, Junho Lee, Sangmin Song, Youngbin Kim

Viaarxiv icon

Open-ended VQA benchmarking of Vision-Language models by exploiting Classification datasets and their semantic hierarchy

Feb 11, 2024
Simon Ging, María A. Bravo, Thomas Brox

Viaarxiv icon

CBVS: A Large-Scale Chinese Image-Text Benchmark for Real-World Short Video Search Scenarios

Jan 25, 2024
Xiangshuo Qiao, Xianxin Li, Xiaozhe Qu, Jie Zhang, Yang Liu, Yu Luo, Cihang Jin, Jin Ma

Viaarxiv icon