Picture for Yu Lin

Yu Lin

Enhancing Open-Vocabulary Object Detection through Multi-Level Fine-Grained Visual-Language Alignment

Add code
Jan 31, 2026
Viaarxiv icon

Unifying Speech Recognition, Synthesis and Conversion with Autoregressive Transformers

Add code
Jan 15, 2026
Viaarxiv icon

Traceable Drug Recommendation over Medical Knowledge Graphs

Add code
Oct 31, 2025
Figure 1 for Traceable Drug Recommendation over Medical Knowledge Graphs
Figure 2 for Traceable Drug Recommendation over Medical Knowledge Graphs
Figure 3 for Traceable Drug Recommendation over Medical Knowledge Graphs
Figure 4 for Traceable Drug Recommendation over Medical Knowledge Graphs
Viaarxiv icon

S$^2$Teacher: Step-by-step Teacher for Sparsely Annotated Oriented Object Detection

Add code
Apr 15, 2025
Figure 1 for S$^2$Teacher: Step-by-step Teacher for Sparsely Annotated Oriented Object Detection
Figure 2 for S$^2$Teacher: Step-by-step Teacher for Sparsely Annotated Oriented Object Detection
Figure 3 for S$^2$Teacher: Step-by-step Teacher for Sparsely Annotated Oriented Object Detection
Figure 4 for S$^2$Teacher: Step-by-step Teacher for Sparsely Annotated Oriented Object Detection
Viaarxiv icon

RDG-GS: Relative Depth Guidance with Gaussian Splatting for Real-time Sparse-View 3D Rendering

Add code
Jan 19, 2025
Viaarxiv icon

A Robust Anchor-based Method for Multi-Camera Pedestrian Localization

Add code
Oct 25, 2024
Figure 1 for A Robust Anchor-based Method for Multi-Camera Pedestrian Localization
Figure 2 for A Robust Anchor-based Method for Multi-Camera Pedestrian Localization
Figure 3 for A Robust Anchor-based Method for Multi-Camera Pedestrian Localization
Figure 4 for A Robust Anchor-based Method for Multi-Camera Pedestrian Localization
Viaarxiv icon

MedM2G: Unifying Medical Multi-Modal Generation via Cross-Guided Diffusion with Visual Invariant

Add code
Mar 07, 2024
Figure 1 for MedM2G: Unifying Medical Multi-Modal Generation via Cross-Guided Diffusion with Visual Invariant
Figure 2 for MedM2G: Unifying Medical Multi-Modal Generation via Cross-Guided Diffusion with Visual Invariant
Figure 3 for MedM2G: Unifying Medical Multi-Modal Generation via Cross-Guided Diffusion with Visual Invariant
Figure 4 for MedM2G: Unifying Medical Multi-Modal Generation via Cross-Guided Diffusion with Visual Invariant
Viaarxiv icon

SeqTrack3D: Exploring Sequence Information for Robust 3D Point Cloud Tracking

Add code
Feb 26, 2024
Figure 1 for SeqTrack3D: Exploring Sequence Information for Robust 3D Point Cloud Tracking
Figure 2 for SeqTrack3D: Exploring Sequence Information for Robust 3D Point Cloud Tracking
Figure 3 for SeqTrack3D: Exploring Sequence Information for Robust 3D Point Cloud Tracking
Figure 4 for SeqTrack3D: Exploring Sequence Information for Robust 3D Point Cloud Tracking
Viaarxiv icon

UniDCP: Unifying Multiple Medical Vision-language Tasks via Dynamic Cross-modal Learnable Prompts

Add code
Dec 18, 2023
Figure 1 for UniDCP: Unifying Multiple Medical Vision-language Tasks via Dynamic Cross-modal Learnable Prompts
Figure 2 for UniDCP: Unifying Multiple Medical Vision-language Tasks via Dynamic Cross-modal Learnable Prompts
Figure 3 for UniDCP: Unifying Multiple Medical Vision-language Tasks via Dynamic Cross-modal Learnable Prompts
Figure 4 for UniDCP: Unifying Multiple Medical Vision-language Tasks via Dynamic Cross-modal Learnable Prompts
Viaarxiv icon

Double-Flow-based Steganography without Embedding for Image-to-Image Hiding

Add code
Nov 25, 2023
Viaarxiv icon