Picture for Thomas Wang

Thomas Wang

DRACO: a Cross-Domain Benchmark for Deep Research Accuracy, Completeness, and Objectivity

Add code
Feb 12, 2026
Viaarxiv icon

Voxtral Realtime

Add code
Feb 11, 2026
Viaarxiv icon

Ministral 3

Add code
Jan 13, 2026
Viaarxiv icon

Monitoring Deployed AI Systems in Health Care

Add code
Dec 09, 2025
Figure 1 for Monitoring Deployed AI Systems in Health Care
Figure 2 for Monitoring Deployed AI Systems in Health Care
Figure 3 for Monitoring Deployed AI Systems in Health Care
Viaarxiv icon

OpenGuardrails: An Open-Source Context-Aware AI Guardrails Platform

Add code
Oct 22, 2025
Figure 1 for OpenGuardrails: An Open-Source Context-Aware AI Guardrails Platform
Figure 2 for OpenGuardrails: An Open-Source Context-Aware AI Guardrails Platform
Figure 3 for OpenGuardrails: An Open-Source Context-Aware AI Guardrails Platform
Figure 4 for OpenGuardrails: An Open-Source Context-Aware AI Guardrails Platform
Viaarxiv icon

Voxtral

Add code
Jul 17, 2025
Viaarxiv icon

Magistral

Add code
Jun 12, 2025
Figure 1 for Magistral
Figure 2 for Magistral
Figure 3 for Magistral
Figure 4 for Magistral
Viaarxiv icon

MedHELM: Holistic Evaluation of Large Language Models for Medical Tasks

Add code
May 26, 2025
Figure 1 for MedHELM: Holistic Evaluation of Large Language Models for Medical Tasks
Figure 2 for MedHELM: Holistic Evaluation of Large Language Models for Medical Tasks
Figure 3 for MedHELM: Holistic Evaluation of Large Language Models for Medical Tasks
Figure 4 for MedHELM: Holistic Evaluation of Large Language Models for Medical Tasks
Viaarxiv icon

Pixtral 12B

Add code
Oct 09, 2024
Figure 1 for Pixtral 12B
Figure 2 for Pixtral 12B
Figure 3 for Pixtral 12B
Figure 4 for Pixtral 12B
Viaarxiv icon

Mixtral of Experts

Add code
Jan 08, 2024
Figure 1 for Mixtral of Experts
Figure 2 for Mixtral of Experts
Figure 3 for Mixtral of Experts
Figure 4 for Mixtral of Experts
Viaarxiv icon