Picture for Tao Jin

Tao Jin

Andrew

Delving Deeper: Hierarchical Visual Perception for Robust Video-Text Retrieval

Add code
Jan 19, 2026
Viaarxiv icon

Hybrid LLM and Higher-Order Quantum Approximate Optimization for CSA Collateral Management

Add code
Oct 30, 2025
Viaarxiv icon

Chat-Driven Text Generation and Interaction for Person Retrieval

Add code
Sep 16, 2025
Viaarxiv icon

SSGaussian: Semantic-Aware and Structure-Preserving 3D Style Transfer

Add code
Sep 04, 2025
Figure 1 for SSGaussian: Semantic-Aware and Structure-Preserving 3D Style Transfer
Figure 2 for SSGaussian: Semantic-Aware and Structure-Preserving 3D Style Transfer
Figure 3 for SSGaussian: Semantic-Aware and Structure-Preserving 3D Style Transfer
Figure 4 for SSGaussian: Semantic-Aware and Structure-Preserving 3D Style Transfer
Viaarxiv icon

TAP: Parameter-efficient Task-Aware Prompting for Adverse Weather Removal

Add code
Aug 11, 2025
Viaarxiv icon

Vela: Scalable Embeddings with Voice Large Language Models for Multimodal Retrieval

Add code
Jun 17, 2025
Viaarxiv icon

IRBridge: Solving Image Restoration Bridge with Pre-trained Generative Diffusion Models

Add code
May 30, 2025
Viaarxiv icon

Speech Token Prediction via Compressed-to-fine Language Modeling for Speech Generation

Add code
May 30, 2025
Viaarxiv icon

TCSinger 2: Customizable Multilingual Zero-shot Singing Voice Synthesis

Add code
May 20, 2025
Figure 1 for TCSinger 2: Customizable Multilingual Zero-shot Singing Voice Synthesis
Figure 2 for TCSinger 2: Customizable Multilingual Zero-shot Singing Voice Synthesis
Figure 3 for TCSinger 2: Customizable Multilingual Zero-shot Singing Voice Synthesis
Figure 4 for TCSinger 2: Customizable Multilingual Zero-shot Singing Voice Synthesis
Viaarxiv icon

Observe-R1: Unlocking Reasoning Abilities of MLLMs with Dynamic Progressive Reinforcement Learning

Add code
May 18, 2025
Figure 1 for Observe-R1: Unlocking Reasoning Abilities of MLLMs with Dynamic Progressive Reinforcement Learning
Figure 2 for Observe-R1: Unlocking Reasoning Abilities of MLLMs with Dynamic Progressive Reinforcement Learning
Figure 3 for Observe-R1: Unlocking Reasoning Abilities of MLLMs with Dynamic Progressive Reinforcement Learning
Figure 4 for Observe-R1: Unlocking Reasoning Abilities of MLLMs with Dynamic Progressive Reinforcement Learning
Viaarxiv icon