Picture for Wei Li

Wei Li

Tsinghua University, Beijing, China

OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

Add code
Jun 13, 2024
Figure 1 for OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Figure 2 for OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Figure 3 for OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Figure 4 for OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Viaarxiv icon

Can Large Language Models Understand Spatial Audio?

Add code
Jun 12, 2024
Viaarxiv icon

OmniCorpus: An Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

Add code
Jun 12, 2024
Figure 1 for OmniCorpus: An Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Figure 2 for OmniCorpus: An Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Figure 3 for OmniCorpus: An Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Figure 4 for OmniCorpus: An Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Viaarxiv icon

Dual-Pipeline with Low-Rank Adaptation for New Language Integration in Multilingual ASR

Add code
Jun 12, 2024
Viaarxiv icon

HalluDial: A Large-Scale Benchmark for Automatic Dialogue-Level Hallucination Evaluation

Add code
Jun 11, 2024
Figure 1 for HalluDial: A Large-Scale Benchmark for Automatic Dialogue-Level Hallucination Evaluation
Figure 2 for HalluDial: A Large-Scale Benchmark for Automatic Dialogue-Level Hallucination Evaluation
Figure 3 for HalluDial: A Large-Scale Benchmark for Automatic Dialogue-Level Hallucination Evaluation
Figure 4 for HalluDial: A Large-Scale Benchmark for Automatic Dialogue-Level Hallucination Evaluation
Viaarxiv icon

PSBD: Prediction Shift Uncertainty Unlocks Backdoor Detection

Add code
Jun 09, 2024
Viaarxiv icon

F-LMM: Grounding Frozen Large Multimodal Models

Add code
Jun 09, 2024
Figure 1 for F-LMM: Grounding Frozen Large Multimodal Models
Figure 2 for F-LMM: Grounding Frozen Large Multimodal Models
Figure 3 for F-LMM: Grounding Frozen Large Multimodal Models
Figure 4 for F-LMM: Grounding Frozen Large Multimodal Models
Viaarxiv icon

On the Effects of Data Scale on Computer Control Agents

Add code
Jun 06, 2024
Figure 1 for On the Effects of Data Scale on Computer Control Agents
Figure 2 for On the Effects of Data Scale on Computer Control Agents
Figure 3 for On the Effects of Data Scale on Computer Control Agents
Figure 4 for On the Effects of Data Scale on Computer Control Agents
Viaarxiv icon

Organizing Background to Explore Latent Classes for Incremental Few-shot Semantic Segmentation

Add code
May 29, 2024
Figure 1 for Organizing Background to Explore Latent Classes for Incremental Few-shot Semantic Segmentation
Figure 2 for Organizing Background to Explore Latent Classes for Incremental Few-shot Semantic Segmentation
Figure 3 for Organizing Background to Explore Latent Classes for Incremental Few-shot Semantic Segmentation
Figure 4 for Organizing Background to Explore Latent Classes for Incremental Few-shot Semantic Segmentation
Viaarxiv icon

Lifelong Learning and Selective Forgetting via Contrastive Strategy

Add code
May 28, 2024
Figure 1 for Lifelong Learning and Selective Forgetting via Contrastive Strategy
Figure 2 for Lifelong Learning and Selective Forgetting via Contrastive Strategy
Figure 3 for Lifelong Learning and Selective Forgetting via Contrastive Strategy
Figure 4 for Lifelong Learning and Selective Forgetting via Contrastive Strategy
Viaarxiv icon