Picture for Dahua Lin

Dahua Lin

Eric

MMScan: A Multi-Modal 3D Scene Dataset with Hierarchical Grounded Language Annotations

Add code
Jun 13, 2024
Figure 1 for MMScan: A Multi-Modal 3D Scene Dataset with Hierarchical Grounded Language Annotations
Figure 2 for MMScan: A Multi-Modal 3D Scene Dataset with Hierarchical Grounded Language Annotations
Figure 3 for MMScan: A Multi-Modal 3D Scene Dataset with Hierarchical Grounded Language Annotations
Figure 4 for MMScan: A Multi-Modal 3D Scene Dataset with Hierarchical Grounded Language Annotations
Viaarxiv icon

OmniCorpus: An Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

Add code
Jun 12, 2024
Figure 1 for OmniCorpus: An Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Figure 2 for OmniCorpus: An Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Figure 3 for OmniCorpus: An Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Figure 4 for OmniCorpus: An Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Viaarxiv icon

Lean Workbook: A large-scale Lean problem set formalized from natural language math problems

Add code
Jun 07, 2024
Viaarxiv icon

Uncertainty Aware Learning for Language Model Alignment

Add code
Jun 07, 2024
Figure 1 for Uncertainty Aware Learning for Language Model Alignment
Figure 2 for Uncertainty Aware Learning for Language Model Alignment
Figure 3 for Uncertainty Aware Learning for Language Model Alignment
Figure 4 for Uncertainty Aware Learning for Language Model Alignment
Viaarxiv icon

ShareGPT4Video: Improving Video Understanding and Generation with Better Captions

Add code
Jun 06, 2024
Figure 1 for ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
Figure 2 for ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
Figure 3 for ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
Figure 4 for ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
Viaarxiv icon

Bootstrap3D: Improving 3D Content Creation with Synthetic Data

Add code
May 31, 2024
Figure 1 for Bootstrap3D: Improving 3D Content Creation with Synthetic Data
Figure 2 for Bootstrap3D: Improving 3D Content Creation with Synthetic Data
Figure 3 for Bootstrap3D: Improving 3D Content Creation with Synthetic Data
Figure 4 for Bootstrap3D: Improving 3D Content Creation with Synthetic Data
Viaarxiv icon

ANAH: Analytical Annotation of Hallucinations in Large Language Models

Add code
May 30, 2024
Figure 1 for ANAH: Analytical Annotation of Hallucinations in Large Language Models
Figure 2 for ANAH: Analytical Annotation of Hallucinations in Large Language Models
Figure 3 for ANAH: Analytical Annotation of Hallucinations in Large Language Models
Figure 4 for ANAH: Analytical Annotation of Hallucinations in Large Language Models
Viaarxiv icon

AlchemistCoder: Harmonizing and Eliciting Code Capability by Hindsight Tuning on Multi-source Data

Add code
May 29, 2024
Viaarxiv icon

DSDL: Data Set Description Language for Bridging Modalities and Tasks in AI Data

Add code
May 28, 2024
Viaarxiv icon

Streaming Long Video Understanding with Large Language Models

Add code
May 25, 2024
Figure 1 for Streaming Long Video Understanding with Large Language Models
Figure 2 for Streaming Long Video Understanding with Large Language Models
Figure 3 for Streaming Long Video Understanding with Large Language Models
Figure 4 for Streaming Long Video Understanding with Large Language Models
Viaarxiv icon