Alert button
Picture for Dani Yogatama

Dani Yogatama

Alert button

Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models

Add code
Bookmark button
Alert button
Apr 18, 2024
Aitor Ormazabal, Che Zheng, Cyprien de Masson d'Autume, Dani Yogatama, Deyu Fu, Donovan Ong, Eric Chen, Eugenie Lamprecht, Hai Pham, Isaac Ong, Kaloyan Aleksiev, Lei Li, Matthew Henderson, Max Bain, Mikel Artetxe, Nishant Relan, Piotr Padlewski, Qi Liu, Ren Chen, Samuel Phua, Yazheng Yang, Yi Tay, Yuqi Wang, Zhongkai Zhu, Zhihui Xie

Viaarxiv icon

IsoBench: Benchmarking Multimodal Foundation Models on Isomorphic Representations

Add code
Bookmark button
Alert button
Apr 02, 2024
Deqing Fu, Ghazal Khalighinejad, Ollie Liu, Bhuwan Dhingra, Dani Yogatama, Robin Jia, Willie Neiswanger

Viaarxiv icon

Understanding In-Context Learning with a Pelican Soup Framework

Add code
Bookmark button
Alert button
Feb 16, 2024
Ting-Rui Chiang, Dani Yogatama

Viaarxiv icon

DeLLMa: A Framework for Decision Making Under Uncertainty with Large Language Models

Add code
Bookmark button
Alert button
Feb 04, 2024
Ollie Liu, Deqing Fu, Dani Yogatama, Willie Neiswanger

Viaarxiv icon

On Retrieval Augmentation and the Limitations of Language Model Training

Add code
Bookmark button
Alert button
Nov 16, 2023
Ting-Rui Chiang, Xinyan Velocity Yu, Joshua Robinson, Ollie Liu, Isabelle Lee, Dani Yogatama

Viaarxiv icon

The Distributional Hypothesis Does Not Fully Explain the Benefits of Masked Language Model Pretraining

Add code
Bookmark button
Alert button
Oct 25, 2023
Ting-Rui Chiang, Dani Yogatama

Viaarxiv icon

Interpretable Diffusion via Information Decomposition

Add code
Bookmark button
Alert button
Oct 12, 2023
Xianghao Kong, Ollie Liu, Han Li, Dani Yogatama, Greg Ver Steeg

Figure 1 for Interpretable Diffusion via Information Decomposition
Figure 2 for Interpretable Diffusion via Information Decomposition
Figure 3 for Interpretable Diffusion via Information Decomposition
Figure 4 for Interpretable Diffusion via Information Decomposition
Viaarxiv icon

Scaling Laws vs Model Architectures: How does Inductive Bias Influence Scaling?

Add code
Bookmark button
Alert button
Jul 21, 2022
Yi Tay, Mostafa Dehghani, Samira Abnar, Hyung Won Chung, William Fedus, Jinfeng Rao, Sharan Narang, Vinh Q. Tran, Dani Yogatama, Donald Metzler

Figure 1 for Scaling Laws vs Model Architectures: How does Inductive Bias Influence Scaling?
Figure 2 for Scaling Laws vs Model Architectures: How does Inductive Bias Influence Scaling?
Figure 3 for Scaling Laws vs Model Architectures: How does Inductive Bias Influence Scaling?
Figure 4 for Scaling Laws vs Model Architectures: How does Inductive Bias Influence Scaling?
Viaarxiv icon

Questions Are All You Need to Train a Dense Passage Retriever

Add code
Bookmark button
Alert button
Jun 21, 2022
Devendra Singh Sachan, Mike Lewis, Dani Yogatama, Luke Zettlemoyer, Joelle Pineau, Manzil Zaheer

Figure 1 for Questions Are All You Need to Train a Dense Passage Retriever
Figure 2 for Questions Are All You Need to Train a Dense Passage Retriever
Figure 3 for Questions Are All You Need to Train a Dense Passage Retriever
Figure 4 for Questions Are All You Need to Train a Dense Passage Retriever
Viaarxiv icon

Emergent Abilities of Large Language Models

Add code
Bookmark button
Alert button
Jun 15, 2022
Jason Wei, Yi Tay, Rishi Bommasani, Colin Raffel, Barret Zoph, Sebastian Borgeaud, Dani Yogatama, Maarten Bosma, Denny Zhou, Donald Metzler, Ed H. Chi, Tatsunori Hashimoto, Oriol Vinyals, Percy Liang, Jeff Dean, William Fedus

Figure 1 for Emergent Abilities of Large Language Models
Figure 2 for Emergent Abilities of Large Language Models
Figure 3 for Emergent Abilities of Large Language Models
Figure 4 for Emergent Abilities of Large Language Models
Viaarxiv icon