Picture for Luyu Wang

Luyu Wang

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

Vision-Language Models as a Source of Rewards

Add code
Dec 14, 2023
Figure 1 for Vision-Language Models as a Source of Rewards
Figure 2 for Vision-Language Models as a Source of Rewards
Figure 3 for Vision-Language Models as a Source of Rewards
Figure 4 for Vision-Language Models as a Source of Rewards
Viaarxiv icon

Zorro: the masked multimodal transformer

Add code
Jan 23, 2023
Figure 1 for Zorro: the masked multimodal transformer
Figure 2 for Zorro: the masked multimodal transformer
Figure 3 for Zorro: the masked multimodal transformer
Figure 4 for Zorro: the masked multimodal transformer
Viaarxiv icon

In-context Reinforcement Learning with Algorithm Distillation

Add code
Oct 25, 2022
Figure 1 for In-context Reinforcement Learning with Algorithm Distillation
Figure 2 for In-context Reinforcement Learning with Algorithm Distillation
Figure 3 for In-context Reinforcement Learning with Algorithm Distillation
Figure 4 for In-context Reinforcement Learning with Algorithm Distillation
Viaarxiv icon

Towards Learning Universal Audio Representations

Add code
Dec 01, 2021
Figure 1 for Towards Learning Universal Audio Representations
Figure 2 for Towards Learning Universal Audio Representations
Figure 3 for Towards Learning Universal Audio Representations
Figure 4 for Towards Learning Universal Audio Representations
Viaarxiv icon

WikiGraphs: A Wikipedia Text - Knowledge Graph Paired Dataset

Add code
Jul 20, 2021
Figure 1 for WikiGraphs: A Wikipedia Text - Knowledge Graph Paired Dataset
Figure 2 for WikiGraphs: A Wikipedia Text - Knowledge Graph Paired Dataset
Figure 3 for WikiGraphs: A Wikipedia Text - Knowledge Graph Paired Dataset
Figure 4 for WikiGraphs: A Wikipedia Text - Knowledge Graph Paired Dataset
Viaarxiv icon

Multimodal Self-Supervised Learning of General Audio Representations

Add code
Apr 28, 2021
Figure 1 for Multimodal Self-Supervised Learning of General Audio Representations
Figure 2 for Multimodal Self-Supervised Learning of General Audio Representations
Figure 3 for Multimodal Self-Supervised Learning of General Audio Representations
Figure 4 for Multimodal Self-Supervised Learning of General Audio Representations
Viaarxiv icon

Broaden Your Views for Self-Supervised Video Learning

Add code
Mar 30, 2021
Figure 1 for Broaden Your Views for Self-Supervised Video Learning
Figure 2 for Broaden Your Views for Self-Supervised Video Learning
Figure 3 for Broaden Your Views for Self-Supervised Video Learning
Figure 4 for Broaden Your Views for Self-Supervised Video Learning
Viaarxiv icon

Multi-Format Contrastive Learning of Audio Representations

Add code
Mar 24, 2021
Figure 1 for Multi-Format Contrastive Learning of Audio Representations
Figure 2 for Multi-Format Contrastive Learning of Audio Representations
Figure 3 for Multi-Format Contrastive Learning of Audio Representations
Figure 4 for Multi-Format Contrastive Learning of Audio Representations
Viaarxiv icon

Learning Robust and Multilingual Speech Representations

Add code
Jan 29, 2020
Figure 1 for Learning Robust and Multilingual Speech Representations
Figure 2 for Learning Robust and Multilingual Speech Representations
Figure 3 for Learning Robust and Multilingual Speech Representations
Figure 4 for Learning Robust and Multilingual Speech Representations
Viaarxiv icon