Picture for Yao Lu

Yao Lu

VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation

Add code
Sep 06, 2024
Viaarxiv icon

LongVILA: Scaling Long-Context Visual Language Models for Long Videos

Add code
Aug 21, 2024
Viaarxiv icon

Enhancing Robustness in Large Language Models: Prompting for Mitigating the Impact of Irrelevant Information

Add code
Aug 20, 2024
Viaarxiv icon

MDM: Advancing Multi-Domain Distribution Matching for Automatic Modulation Recognition Dataset Synthesis

Add code
Aug 05, 2024
Viaarxiv icon

Wolf: Captioning Everything with a World Summarization Framework

Add code
Jul 26, 2024
Figure 1 for Wolf: Captioning Everything with a World Summarization Framework
Figure 2 for Wolf: Captioning Everything with a World Summarization Framework
Figure 3 for Wolf: Captioning Everything with a World Summarization Framework
Figure 4 for Wolf: Captioning Everything with a World Summarization Framework
Viaarxiv icon

$VILA^2$: VILA Augmented VILA

Add code
Jul 24, 2024
Viaarxiv icon

YZS-model: A Predictive Model for Organic Drug Solubility Based on Graph Convolutional Networks and Transformer-Attention

Add code
Jun 27, 2024
Viaarxiv icon

UDA: A Benchmark Suite for Retrieval Augmented Generation in Real-world Document Analysis

Add code
Jun 21, 2024
Viaarxiv icon

LFMamba: Light Field Image Super-Resolution with State Space Model

Add code
Jun 18, 2024
Viaarxiv icon

Words Worth a Thousand Pictures: Measuring and Understanding Perceptual Variability in Text-to-Image Generation

Add code
Jun 12, 2024
Viaarxiv icon