Picture for Yuan Gao

Yuan Gao

Department of Information Technology, Uppsala University, Uppsala, Sweden

Valley2: Exploring Multimodal Models with Scalable Vision-Language Design

Add code
Jan 13, 2025
Figure 1 for Valley2: Exploring Multimodal Models with Scalable Vision-Language Design
Figure 2 for Valley2: Exploring Multimodal Models with Scalable Vision-Language Design
Figure 3 for Valley2: Exploring Multimodal Models with Scalable Vision-Language Design
Figure 4 for Valley2: Exploring Multimodal Models with Scalable Vision-Language Design
Viaarxiv icon

Reach Measurement, Optimization and Frequency Capping In Targeted Online Advertising Under k-Anonymity

Add code
Jan 08, 2025
Figure 1 for Reach Measurement, Optimization and Frequency Capping In Targeted Online Advertising Under k-Anonymity
Figure 2 for Reach Measurement, Optimization and Frequency Capping In Targeted Online Advertising Under k-Anonymity
Figure 3 for Reach Measurement, Optimization and Frequency Capping In Targeted Online Advertising Under k-Anonymity
Figure 4 for Reach Measurement, Optimization and Frequency Capping In Targeted Online Advertising Under k-Anonymity
Viaarxiv icon

LongDocURL: a Comprehensive Multimodal Long Document Benchmark Integrating Understanding, Reasoning, and Locating

Add code
Dec 24, 2024
Viaarxiv icon

Improved Forecasts of Global Extreme Marine Heatwaves Through a Physics-guided Data-driven Approach

Add code
Dec 20, 2024
Figure 1 for Improved Forecasts of Global Extreme Marine Heatwaves Through a Physics-guided Data-driven Approach
Figure 2 for Improved Forecasts of Global Extreme Marine Heatwaves Through a Physics-guided Data-driven Approach
Figure 3 for Improved Forecasts of Global Extreme Marine Heatwaves Through a Physics-guided Data-driven Approach
Figure 4 for Improved Forecasts of Global Extreme Marine Heatwaves Through a Physics-guided Data-driven Approach
Viaarxiv icon

Owl-1: Omni World Model for Consistent Long Video Generation

Add code
Dec 12, 2024
Figure 1 for Owl-1: Omni World Model for Consistent Long Video Generation
Figure 2 for Owl-1: Omni World Model for Consistent Long Video Generation
Figure 3 for Owl-1: Omni World Model for Consistent Long Video Generation
Figure 4 for Owl-1: Omni World Model for Consistent Long Video Generation
Viaarxiv icon

Self-test loss functions for learning weak-form operators and gradient flows

Add code
Dec 04, 2024
Figure 1 for Self-test loss functions for learning weak-form operators and gradient flows
Figure 2 for Self-test loss functions for learning weak-form operators and gradient flows
Figure 3 for Self-test loss functions for learning weak-form operators and gradient flows
Figure 4 for Self-test loss functions for learning weak-form operators and gradient flows
Viaarxiv icon

GrokFormer: Graph Fourier Kolmogorov-Arnold Transformers

Add code
Nov 26, 2024
Viaarxiv icon

DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding

Add code
Nov 21, 2024
Figure 1 for DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding
Figure 2 for DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding
Figure 3 for DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding
Figure 4 for DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding
Viaarxiv icon

Intensity Field Decomposition for Tissue-Guided Neural Tomography

Add code
Nov 01, 2024
Viaarxiv icon

Advancing Efficient Brain Tumor Multi-Class Classification -- New Insights from the Vision Mamba Model in Transfer Learning

Add code
Oct 29, 2024
Figure 1 for Advancing Efficient Brain Tumor Multi-Class Classification -- New Insights from the Vision Mamba Model in Transfer Learning
Figure 2 for Advancing Efficient Brain Tumor Multi-Class Classification -- New Insights from the Vision Mamba Model in Transfer Learning
Figure 3 for Advancing Efficient Brain Tumor Multi-Class Classification -- New Insights from the Vision Mamba Model in Transfer Learning
Figure 4 for Advancing Efficient Brain Tumor Multi-Class Classification -- New Insights from the Vision Mamba Model in Transfer Learning
Viaarxiv icon