Picture for Hao Tian

Hao Tian

Sichuan University

How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites

Add code
Apr 29, 2024
Figure 1 for How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites
Figure 2 for How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites
Figure 3 for How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites
Figure 4 for How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites
Viaarxiv icon

CSST Strong Lensing Preparation: a Framework for Detecting Strong Lenses in the Multi-color Imaging Survey by the China Survey Space Telescope (CSST)

Add code
Apr 02, 2024
Viaarxiv icon

JumpCoder: Go Beyond Autoregressive Coder via Online Modification

Add code
Jan 15, 2024
Figure 1 for JumpCoder: Go Beyond Autoregressive Coder via Online Modification
Figure 2 for JumpCoder: Go Beyond Autoregressive Coder via Online Modification
Figure 3 for JumpCoder: Go Beyond Autoregressive Coder via Online Modification
Figure 4 for JumpCoder: Go Beyond Autoregressive Coder via Online Modification
Viaarxiv icon

Efficient Asynchronous Federated Learning with Sparsification and Quantization

Add code
Jan 06, 2024
Viaarxiv icon

COMBHelper: A Neural Approach to Reduce Search Space for Graph Combinatorial Problems

Add code
Jan 01, 2024
Figure 1 for COMBHelper: A Neural Approach to Reduce Search Space for Graph Combinatorial Problems
Figure 2 for COMBHelper: A Neural Approach to Reduce Search Space for Graph Combinatorial Problems
Figure 3 for COMBHelper: A Neural Approach to Reduce Search Space for Graph Combinatorial Problems
Figure 4 for COMBHelper: A Neural Approach to Reduce Search Space for Graph Combinatorial Problems
Viaarxiv icon

DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving

Add code
Dec 25, 2023
Figure 1 for DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving
Figure 2 for DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving
Figure 3 for DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving
Figure 4 for DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving
Viaarxiv icon

Retrieving Conditions from Reference Images for Diffusion Models

Add code
Dec 05, 2023
Figure 1 for Retrieving Conditions from Reference Images for Diffusion Models
Figure 2 for Retrieving Conditions from Reference Images for Diffusion Models
Figure 3 for Retrieving Conditions from Reference Images for Diffusion Models
Figure 4 for Retrieving Conditions from Reference Images for Diffusion Models
Viaarxiv icon

InstructSeq: Unifying Vision Tasks with Instruction-conditioned Multi-modal Sequence Generation

Add code
Nov 30, 2023
Figure 1 for InstructSeq: Unifying Vision Tasks with Instruction-conditioned Multi-modal Sequence Generation
Figure 2 for InstructSeq: Unifying Vision Tasks with Instruction-conditioned Multi-modal Sequence Generation
Figure 3 for InstructSeq: Unifying Vision Tasks with Instruction-conditioned Multi-modal Sequence Generation
Figure 4 for InstructSeq: Unifying Vision Tasks with Instruction-conditioned Multi-modal Sequence Generation
Viaarxiv icon

Tool-Augmented Reward Modeling

Add code
Oct 02, 2023
Figure 1 for Tool-Augmented Reward Modeling
Figure 2 for Tool-Augmented Reward Modeling
Figure 3 for Tool-Augmented Reward Modeling
Figure 4 for Tool-Augmented Reward Modeling
Viaarxiv icon

DiffGAN-F2S: Symmetric and Efficient Denoising Diffusion GANs for Structural Connectivity Prediction from Brain fMRI

Add code
Sep 28, 2023
Viaarxiv icon