Picture for Xiaodan Liang

Xiaodan Liang

Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs

Add code
Jun 28, 2024
Figure 1 for Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs
Figure 2 for Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs
Figure 3 for Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs
Figure 4 for Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs
Viaarxiv icon

FVEL: Interactive Formal Verification Environment with Large Language Models via Theorem Proving

Add code
Jun 20, 2024
Figure 1 for FVEL: Interactive Formal Verification Environment with Large Language Models via Theorem Proving
Figure 2 for FVEL: Interactive Formal Verification Environment with Large Language Models via Theorem Proving
Figure 3 for FVEL: Interactive Formal Verification Environment with Large Language Models via Theorem Proving
Figure 4 for FVEL: Interactive Formal Verification Environment with Large Language Models via Theorem Proving
Viaarxiv icon

Predicting Genetic Mutation from Whole Slide Images via Biomedical-Linguistic Knowledge Enhanced Multi-label Classification

Add code
Jun 05, 2024
Viaarxiv icon

UA-Track: Uncertainty-Aware End-to-End 3D Multi-Object Tracking

Add code
Jun 04, 2024
Figure 1 for UA-Track: Uncertainty-Aware End-to-End 3D Multi-Object Tracking
Figure 2 for UA-Track: Uncertainty-Aware End-to-End 3D Multi-Object Tracking
Figure 3 for UA-Track: Uncertainty-Aware End-to-End 3D Multi-Object Tracking
Figure 4 for UA-Track: Uncertainty-Aware End-to-End 3D Multi-Object Tracking
Viaarxiv icon

AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation

Add code
Jun 03, 2024
Viaarxiv icon

RAP: Efficient Text-Video Retrieval with Sparse-and-Correlated Adapter

Add code
May 29, 2024
Viaarxiv icon

Correctable Landmark Discovery via Large Models for Vision-Language Navigation

Add code
May 29, 2024
Figure 1 for Correctable Landmark Discovery via Large Models for Vision-Language Navigation
Figure 2 for Correctable Landmark Discovery via Large Models for Vision-Language Navigation
Figure 3 for Correctable Landmark Discovery via Large Models for Vision-Language Navigation
Figure 4 for Correctable Landmark Discovery via Large Models for Vision-Language Navigation
Viaarxiv icon

VITON-DiT: Learning In-the-Wild Video Try-On from Human Dance Videos via Diffusion Transformers

Add code
May 28, 2024
Figure 1 for VITON-DiT: Learning In-the-Wild Video Try-On from Human Dance Videos via Diffusion Transformers
Figure 2 for VITON-DiT: Learning In-the-Wild Video Try-On from Human Dance Videos via Diffusion Transformers
Figure 3 for VITON-DiT: Learning In-the-Wild Video Try-On from Human Dance Videos via Diffusion Transformers
Figure 4 for VITON-DiT: Learning In-the-Wild Video Try-On from Human Dance Videos via Diffusion Transformers
Viaarxiv icon

The SkatingVerse Workshop & Challenge: Methods and Results

Add code
May 27, 2024
Figure 1 for The SkatingVerse Workshop & Challenge: Methods and Results
Figure 2 for The SkatingVerse Workshop & Challenge: Methods and Results
Viaarxiv icon

DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data

Add code
May 23, 2024
Figure 1 for DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data
Figure 2 for DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data
Figure 3 for DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data
Figure 4 for DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data
Viaarxiv icon