Picture for Yan Wang

Yan Wang

Fudan university

STORM: Spatio-Temporal Reconstruction Model for Large-Scale Outdoor Scenes

Add code
Dec 31, 2024
Figure 1 for STORM: Spatio-Temporal Reconstruction Model for Large-Scale Outdoor Scenes
Figure 2 for STORM: Spatio-Temporal Reconstruction Model for Large-Scale Outdoor Scenes
Figure 3 for STORM: Spatio-Temporal Reconstruction Model for Large-Scale Outdoor Scenes
Figure 4 for STORM: Spatio-Temporal Reconstruction Model for Large-Scale Outdoor Scenes
Viaarxiv icon

P3S-Diffusion:A Selective Subject-driven Generation Framework via Point Supervision

Add code
Dec 27, 2024
Figure 1 for P3S-Diffusion:A Selective Subject-driven Generation Framework via Point Supervision
Figure 2 for P3S-Diffusion:A Selective Subject-driven Generation Framework via Point Supervision
Figure 3 for P3S-Diffusion:A Selective Subject-driven Generation Framework via Point Supervision
Figure 4 for P3S-Diffusion:A Selective Subject-driven Generation Framework via Point Supervision
Viaarxiv icon

Attention Entropy is a Key Factor: An Analysis of Parallel Context Encoding with Full-attention-based Pre-trained Language Models

Add code
Dec 21, 2024
Figure 1 for Attention Entropy is a Key Factor: An Analysis of Parallel Context Encoding with Full-attention-based Pre-trained Language Models
Figure 2 for Attention Entropy is a Key Factor: An Analysis of Parallel Context Encoding with Full-attention-based Pre-trained Language Models
Figure 3 for Attention Entropy is a Key Factor: An Analysis of Parallel Context Encoding with Full-attention-based Pre-trained Language Models
Figure 4 for Attention Entropy is a Key Factor: An Analysis of Parallel Context Encoding with Full-attention-based Pre-trained Language Models
Viaarxiv icon

Multimodal Hypothetical Summary for Retrieval-based Multi-image Question Answering

Add code
Dec 19, 2024
Viaarxiv icon

Extrapolated Urban View Synthesis Benchmark

Add code
Dec 10, 2024
Figure 1 for Extrapolated Urban View Synthesis Benchmark
Figure 2 for Extrapolated Urban View Synthesis Benchmark
Figure 3 for Extrapolated Urban View Synthesis Benchmark
Figure 4 for Extrapolated Urban View Synthesis Benchmark
Viaarxiv icon

Progressive Vision-Language Prompt for Multi-Organ Multi-Class Cell Semantic Segmentation with Single Branch

Add code
Dec 04, 2024
Figure 1 for Progressive Vision-Language Prompt for Multi-Organ Multi-Class Cell Semantic Segmentation with Single Branch
Figure 2 for Progressive Vision-Language Prompt for Multi-Organ Multi-Class Cell Semantic Segmentation with Single Branch
Figure 3 for Progressive Vision-Language Prompt for Multi-Organ Multi-Class Cell Semantic Segmentation with Single Branch
Figure 4 for Progressive Vision-Language Prompt for Multi-Organ Multi-Class Cell Semantic Segmentation with Single Branch
Viaarxiv icon

DIVD: Deblurring with Improved Video Diffusion Model

Add code
Dec 01, 2024
Figure 1 for DIVD: Deblurring with Improved Video Diffusion Model
Figure 2 for DIVD: Deblurring with Improved Video Diffusion Model
Figure 3 for DIVD: Deblurring with Improved Video Diffusion Model
Figure 4 for DIVD: Deblurring with Improved Video Diffusion Model
Viaarxiv icon

A Semi-Supervised Approach with Error Reflection for Echocardiography Segmentation

Add code
Dec 01, 2024
Figure 1 for A Semi-Supervised Approach with Error Reflection for Echocardiography Segmentation
Figure 2 for A Semi-Supervised Approach with Error Reflection for Echocardiography Segmentation
Figure 3 for A Semi-Supervised Approach with Error Reflection for Echocardiography Segmentation
Figure 4 for A Semi-Supervised Approach with Error Reflection for Echocardiography Segmentation
Viaarxiv icon

CDEMapper: Enhancing NIH Common Data Element Normalization using Large Language Models

Add code
Nov 30, 2024
Viaarxiv icon

Semantic Data Augmentation for Long-tailed Facial Expression Recognition

Add code
Nov 26, 2024
Figure 1 for Semantic Data Augmentation for Long-tailed Facial Expression Recognition
Figure 2 for Semantic Data Augmentation for Long-tailed Facial Expression Recognition
Figure 3 for Semantic Data Augmentation for Long-tailed Facial Expression Recognition
Figure 4 for Semantic Data Augmentation for Long-tailed Facial Expression Recognition
Viaarxiv icon