Alert button
Picture for Marco Tulio Ribeiro

Marco Tulio Ribeiro

Alert button

Targeted Data Generation: Finding and Fixing Model Weaknesses

Add code
Bookmark button
Alert button
May 28, 2023
Zexue He, Marco Tulio Ribeiro, Fereshte Khani

Figure 1 for Targeted Data Generation: Finding and Fixing Model Weaknesses
Figure 2 for Targeted Data Generation: Finding and Fixing Model Weaknesses
Figure 3 for Targeted Data Generation: Finding and Fixing Model Weaknesses
Figure 4 for Targeted Data Generation: Finding and Fixing Model Weaknesses
Viaarxiv icon

Collaborative Development of NLP models

Add code
Bookmark button
Alert button
May 24, 2023
Fereshte Khani, Marco Tulio Ribeiro

Figure 1 for Collaborative Development of NLP models
Figure 2 for Collaborative Development of NLP models
Figure 3 for Collaborative Development of NLP models
Figure 4 for Collaborative Development of NLP models
Viaarxiv icon

Supporting Human-AI Collaboration in Auditing LLMs with LLMs

Add code
Bookmark button
Alert button
Apr 19, 2023
Charvi Rastogi, Marco Tulio Ribeiro, Nicholas King, Saleema Amershi

Figure 1 for Supporting Human-AI Collaboration in Auditing LLMs with LLMs
Figure 2 for Supporting Human-AI Collaboration in Auditing LLMs with LLMs
Figure 3 for Supporting Human-AI Collaboration in Auditing LLMs with LLMs
Figure 4 for Supporting Human-AI Collaboration in Auditing LLMs with LLMs
Viaarxiv icon

Sparks of Artificial General Intelligence: Early experiments with GPT-4

Add code
Bookmark button
Alert button
Mar 27, 2023
Sébastien Bubeck, Varun Chandrasekaran, Ronen Eldan, Johannes Gehrke, Eric Horvitz, Ece Kamar, Peter Lee, Yin Tat Lee, Yuanzhi Li, Scott Lundberg, Harsha Nori, Hamid Palangi, Marco Tulio Ribeiro, Yi Zhang

Figure 1 for Sparks of Artificial General Intelligence: Early experiments with GPT-4
Figure 2 for Sparks of Artificial General Intelligence: Early experiments with GPT-4
Figure 3 for Sparks of Artificial General Intelligence: Early experiments with GPT-4
Figure 4 for Sparks of Artificial General Intelligence: Early experiments with GPT-4
Viaarxiv icon

ART: Automatic multi-step reasoning and tool-use for large language models

Add code
Bookmark button
Alert button
Mar 16, 2023
Bhargavi Paranjape, Scott Lundberg, Sameer Singh, Hannaneh Hajishirzi, Luke Zettlemoyer, Marco Tulio Ribeiro

Figure 1 for ART: Automatic multi-step reasoning and tool-use for large language models
Figure 2 for ART: Automatic multi-step reasoning and tool-use for large language models
Figure 3 for ART: Automatic multi-step reasoning and tool-use for large language models
Figure 4 for ART: Automatic multi-step reasoning and tool-use for large language models
Viaarxiv icon

ScatterShot: Interactive In-context Example Curation for Text Transformation

Add code
Bookmark button
Alert button
Feb 14, 2023
Tongshuang Wu, Hua Shen, Daniel S. Weld, Jeffrey Heer, Marco Tulio Ribeiro

Figure 1 for ScatterShot: Interactive In-context Example Curation for Text Transformation
Figure 2 for ScatterShot: Interactive In-context Example Curation for Text Transformation
Figure 3 for ScatterShot: Interactive In-context Example Curation for Text Transformation
Figure 4 for ScatterShot: Interactive In-context Example Curation for Text Transformation
Viaarxiv icon

Editing Models with Task Arithmetic

Add code
Bookmark button
Alert button
Dec 08, 2022
Gabriel Ilharco, Marco Tulio Ribeiro, Mitchell Wortsman, Suchin Gururangan, Ludwig Schmidt, Hannaneh Hajishirzi, Ali Farhadi

Figure 1 for Editing Models with Task Arithmetic
Figure 2 for Editing Models with Task Arithmetic
Figure 3 for Editing Models with Task Arithmetic
Figure 4 for Editing Models with Task Arithmetic
Viaarxiv icon

Adaptive Testing of Computer Vision Models

Add code
Bookmark button
Alert button
Dec 06, 2022
Irena Gao, Gabriel Ilharco, Scott Lundberg, Marco Tulio Ribeiro

Figure 1 for Adaptive Testing of Computer Vision Models
Figure 2 for Adaptive Testing of Computer Vision Models
Figure 3 for Adaptive Testing of Computer Vision Models
Figure 4 for Adaptive Testing of Computer Vision Models
Viaarxiv icon