Picture for Yu Wang

Yu Wang

University of Oregon

Enhancing Contrastive Learning Inspired by the Philosophy of "The Blind Men and the Elephant"

Add code
Dec 21, 2024
Figure 1 for Enhancing Contrastive Learning Inspired by the Philosophy of "The Blind Men and the Elephant"
Figure 2 for Enhancing Contrastive Learning Inspired by the Philosophy of "The Blind Men and the Elephant"
Figure 3 for Enhancing Contrastive Learning Inspired by the Philosophy of "The Blind Men and the Elephant"
Figure 4 for Enhancing Contrastive Learning Inspired by the Philosophy of "The Blind Men and the Elephant"
Viaarxiv icon

E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling

Add code
Dec 19, 2024
Figure 1 for E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling
Figure 2 for E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling
Figure 3 for E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling
Figure 4 for E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling
Viaarxiv icon

GUI Agents: A Survey

Add code
Dec 18, 2024
Figure 1 for GUI Agents: A Survey
Figure 2 for GUI Agents: A Survey
Figure 3 for GUI Agents: A Survey
Figure 4 for GUI Agents: A Survey
Viaarxiv icon

What Matters in Learning A Zero-Shot Sim-to-Real RL Policy for Quadrotor Control? A Comprehensive Study

Add code
Dec 17, 2024
Viaarxiv icon

Drawing the Line: Enhancing Trustworthiness of MLLMs Through the Power of Refusal

Add code
Dec 15, 2024
Figure 1 for Drawing the Line: Enhancing Trustworthiness of MLLMs Through the Power of Refusal
Figure 2 for Drawing the Line: Enhancing Trustworthiness of MLLMs Through the Power of Refusal
Figure 3 for Drawing the Line: Enhancing Trustworthiness of MLLMs Through the Power of Refusal
Figure 4 for Drawing the Line: Enhancing Trustworthiness of MLLMs Through the Power of Refusal
Viaarxiv icon

A4-Unet: Deformable Multi-Scale Attention Network for Brain Tumor Segmentation

Add code
Dec 08, 2024
Figure 1 for A4-Unet: Deformable Multi-Scale Attention Network for Brain Tumor Segmentation
Figure 2 for A4-Unet: Deformable Multi-Scale Attention Network for Brain Tumor Segmentation
Figure 3 for A4-Unet: Deformable Multi-Scale Attention Network for Brain Tumor Segmentation
Figure 4 for A4-Unet: Deformable Multi-Scale Attention Network for Brain Tumor Segmentation
Viaarxiv icon

GenMAC: Compositional Text-to-Video Generation with Multi-Agent Collaboration

Add code
Dec 05, 2024
Figure 1 for GenMAC: Compositional Text-to-Video Generation with Multi-Agent Collaboration
Figure 2 for GenMAC: Compositional Text-to-Video Generation with Multi-Agent Collaboration
Figure 3 for GenMAC: Compositional Text-to-Video Generation with Multi-Agent Collaboration
Figure 4 for GenMAC: Compositional Text-to-Video Generation with Multi-Agent Collaboration
Viaarxiv icon

ASIGN: An Anatomy-aware Spatial Imputation Graphic Network for 3D Spatial Transcriptomics

Add code
Dec 04, 2024
Figure 1 for ASIGN: An Anatomy-aware Spatial Imputation Graphic Network for 3D Spatial Transcriptomics
Figure 2 for ASIGN: An Anatomy-aware Spatial Imputation Graphic Network for 3D Spatial Transcriptomics
Figure 3 for ASIGN: An Anatomy-aware Spatial Imputation Graphic Network for 3D Spatial Transcriptomics
Figure 4 for ASIGN: An Anatomy-aware Spatial Imputation Graphic Network for 3D Spatial Transcriptomics
Viaarxiv icon

Jailbreak Large Vision-Language Models Through Multi-Modal Linkage

Add code
Dec 03, 2024
Figure 1 for Jailbreak Large Vision-Language Models Through Multi-Modal Linkage
Figure 2 for Jailbreak Large Vision-Language Models Through Multi-Modal Linkage
Figure 3 for Jailbreak Large Vision-Language Models Through Multi-Modal Linkage
Figure 4 for Jailbreak Large Vision-Language Models Through Multi-Modal Linkage
Viaarxiv icon

Personalized Multimodal Large Language Models: A Survey

Add code
Dec 03, 2024
Viaarxiv icon