Picture for Hao Tian

Hao Tian

Sichuan University

OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

Add code
Jun 13, 2024
Figure 1 for OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Figure 2 for OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Figure 3 for OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Figure 4 for OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Viaarxiv icon

OmniCorpus: An Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

Add code
Jun 12, 2024
Figure 1 for OmniCorpus: An Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Figure 2 for OmniCorpus: An Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Figure 3 for OmniCorpus: An Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Figure 4 for OmniCorpus: An Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Viaarxiv icon

Deep Hierarchical Graph Alignment Kernels

Add code
May 09, 2024
Figure 1 for Deep Hierarchical Graph Alignment Kernels
Figure 2 for Deep Hierarchical Graph Alignment Kernels
Figure 3 for Deep Hierarchical Graph Alignment Kernels
Figure 4 for Deep Hierarchical Graph Alignment Kernels
Viaarxiv icon

How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites

Add code
Apr 29, 2024
Figure 1 for How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites
Figure 2 for How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites
Figure 3 for How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites
Figure 4 for How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites
Viaarxiv icon

CSST Strong Lensing Preparation: a Framework for Detecting Strong Lenses in the Multi-color Imaging Survey by the China Survey Space Telescope (CSST)

Add code
Apr 02, 2024
Viaarxiv icon

JumpCoder: Go Beyond Autoregressive Coder via Online Modification

Add code
Jan 15, 2024
Viaarxiv icon

Efficient Asynchronous Federated Learning with Sparsification and Quantization

Add code
Jan 06, 2024
Viaarxiv icon

COMBHelper: A Neural Approach to Reduce Search Space for Graph Combinatorial Problems

Add code
Jan 01, 2024
Figure 1 for COMBHelper: A Neural Approach to Reduce Search Space for Graph Combinatorial Problems
Figure 2 for COMBHelper: A Neural Approach to Reduce Search Space for Graph Combinatorial Problems
Figure 3 for COMBHelper: A Neural Approach to Reduce Search Space for Graph Combinatorial Problems
Figure 4 for COMBHelper: A Neural Approach to Reduce Search Space for Graph Combinatorial Problems
Viaarxiv icon

DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving

Add code
Dec 25, 2023
Figure 1 for DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving
Figure 2 for DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving
Figure 3 for DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving
Figure 4 for DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving
Viaarxiv icon

Retrieving Conditions from Reference Images for Diffusion Models

Add code
Dec 05, 2023
Figure 1 for Retrieving Conditions from Reference Images for Diffusion Models
Figure 2 for Retrieving Conditions from Reference Images for Diffusion Models
Figure 3 for Retrieving Conditions from Reference Images for Diffusion Models
Figure 4 for Retrieving Conditions from Reference Images for Diffusion Models
Viaarxiv icon