Picture for Daniel Li

Daniel Li

Jack

Perception Encoder: The best visual embeddings are not at the output of the network

Add code
Apr 17, 2025
Viaarxiv icon

Data Foundations for Large Scale Multimodal Clinical Foundation Models

Add code
Mar 09, 2025
Viaarxiv icon

DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level Vision-Language Alignment

Add code
Dec 20, 2024
Figure 1 for DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level Vision-Language Alignment
Figure 2 for DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level Vision-Language Alignment
Figure 3 for DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level Vision-Language Alignment
Figure 4 for DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level Vision-Language Alignment
Viaarxiv icon

Creating a Cooperative AI Policymaking Platform through Open Source Collaboration

Add code
Dec 09, 2024
Figure 1 for Creating a Cooperative AI Policymaking Platform through Open Source Collaboration
Figure 2 for Creating a Cooperative AI Policymaking Platform through Open Source Collaboration
Viaarxiv icon

Incorporating Human Explanations for Robust Hate Speech Detection

Add code
Nov 09, 2024
Viaarxiv icon

The Llama 3 Herd of Models

Add code
Jul 31, 2024
Viaarxiv icon

VoiceCraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild

Add code
Mar 25, 2024
Viaarxiv icon

Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM

Add code
Mar 12, 2024
Figure 1 for Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM
Figure 2 for Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM
Figure 3 for Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM
Figure 4 for Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM
Viaarxiv icon

HumanEval on Latest GPT Models -- 2024

Add code
Feb 20, 2024
Viaarxiv icon

SeamlessM4T-Massively Multilingual & Multimodal Machine Translation

Add code
Aug 23, 2023
Figure 1 for SeamlessM4T-Massively Multilingual & Multimodal Machine Translation
Figure 2 for SeamlessM4T-Massively Multilingual & Multimodal Machine Translation
Figure 3 for SeamlessM4T-Massively Multilingual & Multimodal Machine Translation
Figure 4 for SeamlessM4T-Massively Multilingual & Multimodal Machine Translation
Viaarxiv icon