Picture for Xudong Shen

Xudong Shen

Introducing v0.5 of the AI Safety Benchmark from MLCommons

Add code
Apr 18, 2024
Figure 1 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Figure 2 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Figure 3 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Figure 4 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Viaarxiv icon

Finetuning Text-to-Image Diffusion Models for Fairness

Add code
Nov 11, 2023
Viaarxiv icon

Multimodal Prompt Transformer with Hybrid Contrastive Learning for Emotion Recognition in Conversation

Add code
Oct 04, 2023
Viaarxiv icon

Towards Regulatable AI Systems: Technical Gaps and Policy Opportunities

Add code
Jun 22, 2023
Viaarxiv icon

Inverse Scaling: When Bigger Isn't Better

Add code
Jun 15, 2023
Figure 1 for Inverse Scaling: When Bigger Isn't Better
Figure 2 for Inverse Scaling: When Bigger Isn't Better
Figure 3 for Inverse Scaling: When Bigger Isn't Better
Figure 4 for Inverse Scaling: When Bigger Isn't Better
Viaarxiv icon

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

Add code
Jun 10, 2022
Viaarxiv icon

Investigating Accuracy-Novelty Performance for Graph-based Collaborative Filtering

Add code
Apr 27, 2022
Figure 1 for Investigating Accuracy-Novelty Performance for Graph-based Collaborative Filtering
Figure 2 for Investigating Accuracy-Novelty Performance for Graph-based Collaborative Filtering
Figure 3 for Investigating Accuracy-Novelty Performance for Graph-based Collaborative Filtering
Figure 4 for Investigating Accuracy-Novelty Performance for Graph-based Collaborative Filtering
Viaarxiv icon

Benchmarking Generalization via In-Context Instructions on 1,600+ Language Tasks

Add code
Apr 16, 2022
Figure 1 for Benchmarking Generalization via In-Context Instructions on 1,600+ Language Tasks
Figure 2 for Benchmarking Generalization via In-Context Instructions on 1,600+ Language Tasks
Figure 3 for Benchmarking Generalization via In-Context Instructions on 1,600+ Language Tasks
Figure 4 for Benchmarking Generalization via In-Context Instructions on 1,600+ Language Tasks
Viaarxiv icon

NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation

Add code
Dec 06, 2021
Figure 1 for NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation
Figure 2 for NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation
Figure 3 for NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation
Figure 4 for NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation
Viaarxiv icon

RL4RS: A Real-World Benchmark for Reinforcement Learning based Recommender System

Add code
Oct 18, 2021
Figure 1 for RL4RS: A Real-World Benchmark for Reinforcement Learning based Recommender System
Figure 2 for RL4RS: A Real-World Benchmark for Reinforcement Learning based Recommender System
Figure 3 for RL4RS: A Real-World Benchmark for Reinforcement Learning based Recommender System
Figure 4 for RL4RS: A Real-World Benchmark for Reinforcement Learning based Recommender System
Viaarxiv icon