Picture for Le Zhuo

Le Zhuo

Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers

Add code
May 09, 2024
Viaarxiv icon

ProtLLM: An Interleaved Protein-Language LLM with Protein-as-Word Pre-Training

Add code
Feb 28, 2024
Figure 1 for ProtLLM: An Interleaved Protein-Language LLM with Protein-as-Word Pre-Training
Figure 2 for ProtLLM: An Interleaved Protein-Language LLM with Protein-as-Word Pre-Training
Figure 3 for ProtLLM: An Interleaved Protein-Language LLM with Protein-as-Word Pre-Training
Figure 4 for ProtLLM: An Interleaved Protein-Language LLM with Protein-as-Word Pre-Training
Viaarxiv icon

LLMs as Visual Explainers: Advancing Image Classification with Evolving Visual Descriptions

Add code
Nov 20, 2023
Figure 1 for LLMs as Visual Explainers: Advancing Image Classification with Evolving Visual Descriptions
Figure 2 for LLMs as Visual Explainers: Advancing Image Classification with Evolving Visual Descriptions
Figure 3 for LLMs as Visual Explainers: Advancing Image Classification with Evolving Visual Descriptions
Figure 4 for LLMs as Visual Explainers: Advancing Image Classification with Evolving Visual Descriptions
Viaarxiv icon

GraphText: Graph Reasoning in Text Space

Add code
Oct 02, 2023
Figure 1 for GraphText: Graph Reasoning in Text Space
Figure 2 for GraphText: Graph Reasoning in Text Space
Figure 3 for GraphText: Graph Reasoning in Text Space
Figure 4 for GraphText: Graph Reasoning in Text Space
Viaarxiv icon

DiffDance: Cascaded Human Motion Diffusion Model for Dance Generation

Add code
Aug 05, 2023
Viaarxiv icon

MARBLE: Music Audio Representation Benchmark for Universal Evaluation

Add code
Jul 12, 2023
Figure 1 for MARBLE: Music Audio Representation Benchmark for Universal Evaluation
Figure 2 for MARBLE: Music Audio Representation Benchmark for Universal Evaluation
Figure 3 for MARBLE: Music Audio Representation Benchmark for Universal Evaluation
Figure 4 for MARBLE: Music Audio Representation Benchmark for Universal Evaluation
Viaarxiv icon

LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT

Add code
Jul 07, 2023
Figure 1 for LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT
Figure 2 for LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT
Figure 3 for LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT
Figure 4 for LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT
Viaarxiv icon

Video Background Music Generation: Dataset, Method and Evaluation

Add code
Nov 21, 2022
Figure 1 for Video Background Music Generation: Dataset, Method and Evaluation
Figure 2 for Video Background Music Generation: Dataset, Method and Evaluation
Figure 3 for Video Background Music Generation: Dataset, Method and Evaluation
Figure 4 for Video Background Music Generation: Dataset, Method and Evaluation
Viaarxiv icon