Picture for Tao Jin

Tao Jin

Andrew

WorldEdit: Towards Open-World Image Editing with a Knowledge-Informed Benchmark

Add code
Feb 06, 2026
Viaarxiv icon

HVD: Human Vision-Driven Video Representation Learning for Text-Video Retrieval

Add code
Jan 22, 2026
Viaarxiv icon

Delving Deeper: Hierarchical Visual Perception for Robust Video-Text Retrieval

Add code
Jan 19, 2026
Viaarxiv icon

Hybrid LLM and Higher-Order Quantum Approximate Optimization for CSA Collateral Management

Add code
Oct 30, 2025
Viaarxiv icon

Chat-Driven Text Generation and Interaction for Person Retrieval

Add code
Sep 16, 2025
Viaarxiv icon

SSGaussian: Semantic-Aware and Structure-Preserving 3D Style Transfer

Add code
Sep 04, 2025
Figure 1 for SSGaussian: Semantic-Aware and Structure-Preserving 3D Style Transfer
Figure 2 for SSGaussian: Semantic-Aware and Structure-Preserving 3D Style Transfer
Figure 3 for SSGaussian: Semantic-Aware and Structure-Preserving 3D Style Transfer
Figure 4 for SSGaussian: Semantic-Aware and Structure-Preserving 3D Style Transfer
Viaarxiv icon

TAP: Parameter-efficient Task-Aware Prompting for Adverse Weather Removal

Add code
Aug 11, 2025
Viaarxiv icon

Vela: Scalable Embeddings with Voice Large Language Models for Multimodal Retrieval

Add code
Jun 17, 2025
Viaarxiv icon

IRBridge: Solving Image Restoration Bridge with Pre-trained Generative Diffusion Models

Add code
May 30, 2025
Viaarxiv icon

Speech Token Prediction via Compressed-to-fine Language Modeling for Speech Generation

Add code
May 30, 2025
Viaarxiv icon