Alert button
Picture for Leonid Karlinsky

Leonid Karlinsky

Alert button

Towards Multimodal In-Context Learning for Vision & Language Models

Mar 19, 2024
Sivan Doveh, Shaked Perek, M. Jehanzeb Mirza, Amit Alfassy, Assaf Arbelle, Shimon Ullman, Leonid Karlinsky

Viaarxiv icon

Meta-Prompting for Automating Zero-shot Visual Recognition with LLMs

Mar 19, 2024
M. Jehanzeb Mirza, Leonid Karlinsky, Wei Lin, Sivan Doveh, Jakub Micorek, Mateusz Kozinski, Hilde Kuhene, Horst Possegger

Viaarxiv icon

CAMELoT: Towards Large Language Models with Training-Free Consolidated Associative Memory

Feb 21, 2024
Zexue He, Leonid Karlinsky, Donghyun Kim, Julian McAuley, Dmitry Krotov, Rogerio Feris

Viaarxiv icon

3VL: using Trees to teach Vision & Language models compositional concepts

Dec 28, 2023
Nir Yellinek, Leonid Karlinsky, Raja Giryes

Viaarxiv icon

Learning Human Action Recognition Representations Without Real Humans

Nov 10, 2023
Howard Zhong, Samarth Mishra, Donghyun Kim, SouYoung Jin, Rameswar Panda, Hilde Kuehne, Leonid Karlinsky, Venkatesh Saligrama, Aude Oliva, Rogerio Feris

Viaarxiv icon

GeRA: Label-Efficient Geometrically Regularized Alignment

Oct 07, 2023
Dustin Klebe, Tal Shnitzer, Mikhail Yurochkin, Leonid Karlinsky, Justin Solomon

Viaarxiv icon

Joint Audio and Speech Understanding

Oct 02, 2023
Yuan Gong, Alexander H. Liu, Hongyin Luo, Leonid Karlinsky, James Glass

Viaarxiv icon

Self-Specialization: Uncovering Latent Expertise within Large Language Models

Sep 29, 2023
Junmo Kang, Hongyin Luo, Yada Zhu, James Glass, David Cox, Alan Ritter, Rogerio Feris, Leonid Karlinsky

Figure 1 for Self-Specialization: Uncovering Latent Expertise within Large Language Models
Figure 2 for Self-Specialization: Uncovering Latent Expertise within Large Language Models
Figure 3 for Self-Specialization: Uncovering Latent Expertise within Large Language Models
Figure 4 for Self-Specialization: Uncovering Latent Expertise within Large Language Models
Viaarxiv icon

TAP: Targeted Prompting for Task Adaptive Generation of Textual Training Instances for Visual Classification

Sep 13, 2023
M. Jehanzeb Mirza, Leonid Karlinsky, Wei Lin, Horst Possegger, Rogerio Feris, Horst Bischof

Figure 1 for TAP: Targeted Prompting for Task Adaptive Generation of Textual Training Instances for Visual Classification
Figure 2 for TAP: Targeted Prompting for Task Adaptive Generation of Textual Training Instances for Visual Classification
Figure 3 for TAP: Targeted Prompting for Task Adaptive Generation of Textual Training Instances for Visual Classification
Figure 4 for TAP: Targeted Prompting for Task Adaptive Generation of Textual Training Instances for Visual Classification
Viaarxiv icon

Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong General Audio Event Taggers

Jul 06, 2023
Yuan Gong, Sameer Khurana, Leonid Karlinsky, James Glass

Figure 1 for Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong General Audio Event Taggers
Figure 2 for Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong General Audio Event Taggers
Figure 3 for Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong General Audio Event Taggers
Figure 4 for Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong General Audio Event Taggers
Viaarxiv icon