Picture for Chen Zhang

Chen Zhang

SenseTime Research

Technical Report for SoccerNet Challenge 2022 -- Replay Grounding Task

Add code
Oct 31, 2024
Figure 1 for Technical Report for SoccerNet Challenge 2022 -- Replay Grounding Task
Figure 2 for Technical Report for SoccerNet Challenge 2022 -- Replay Grounding Task
Figure 3 for Technical Report for SoccerNet Challenge 2022 -- Replay Grounding Task
Figure 4 for Technical Report for SoccerNet Challenge 2022 -- Replay Grounding Task
Viaarxiv icon

FreqMark: Invisible Image Watermarking via Frequency Based Optimization in Latent Space

Add code
Oct 28, 2024
Viaarxiv icon

Fast Graph Sharpness-Aware Minimization for Enhancing and Accelerating Few-Shot Node Classification

Add code
Oct 22, 2024
Figure 1 for Fast Graph Sharpness-Aware Minimization for Enhancing and Accelerating Few-Shot Node Classification
Figure 2 for Fast Graph Sharpness-Aware Minimization for Enhancing and Accelerating Few-Shot Node Classification
Figure 3 for Fast Graph Sharpness-Aware Minimization for Enhancing and Accelerating Few-Shot Node Classification
Figure 4 for Fast Graph Sharpness-Aware Minimization for Enhancing and Accelerating Few-Shot Node Classification
Viaarxiv icon

VoiceBench: Benchmarking LLM-Based Voice Assistants

Add code
Oct 22, 2024
Viaarxiv icon

MoDification: Mixture of Depths Made Easy

Add code
Oct 18, 2024
Viaarxiv icon

MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes

Add code
Oct 09, 2024
Figure 1 for MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes
Figure 2 for MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes
Figure 3 for MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes
Figure 4 for MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes
Viaarxiv icon

Training Interactive Agent in Large FPS Game Map with Rule-enhanced Reinforcement Learning

Add code
Oct 07, 2024
Viaarxiv icon

Beyond Single-Audio: Advancing Multi-Audio Processing in Audio Large Language Models

Add code
Sep 27, 2024
Figure 1 for Beyond Single-Audio: Advancing Multi-Audio Processing in Audio Large Language Models
Figure 2 for Beyond Single-Audio: Advancing Multi-Audio Processing in Audio Large Language Models
Figure 3 for Beyond Single-Audio: Advancing Multi-Audio Processing in Audio Large Language Models
Figure 4 for Beyond Single-Audio: Advancing Multi-Audio Processing in Audio Large Language Models
Viaarxiv icon

EmoPro: A Prompt Selection Strategy for Emotional Expression in LM-based Speech Synthesis

Add code
Sep 27, 2024
Viaarxiv icon

Disentangling Age and Identity with a Mutual Information Minimization Approach for Cross-Age Speaker Verification

Add code
Sep 24, 2024
Figure 1 for Disentangling Age and Identity with a Mutual Information Minimization Approach for Cross-Age Speaker Verification
Figure 2 for Disentangling Age and Identity with a Mutual Information Minimization Approach for Cross-Age Speaker Verification
Figure 3 for Disentangling Age and Identity with a Mutual Information Minimization Approach for Cross-Age Speaker Verification
Figure 4 for Disentangling Age and Identity with a Mutual Information Minimization Approach for Cross-Age Speaker Verification
Viaarxiv icon