Picture for Di Yang

Di Yang

SoccerNet 2025 Challenges Results

Add code
Aug 26, 2025
Viaarxiv icon

LaTeXTrans: Structured LaTeX Translation with Multi-Agent Coordination

Add code
Aug 26, 2025
Viaarxiv icon

LIA-X: Interpretable Latent Portrait Animator

Add code
Aug 13, 2025
Viaarxiv icon

AutoCBT: An Autonomous Multi-agent Framework for Cognitive Behavioral Therapy in Psychological Counseling

Add code
Jan 16, 2025
Figure 1 for AutoCBT: An Autonomous Multi-agent Framework for Cognitive Behavioral Therapy in Psychological Counseling
Figure 2 for AutoCBT: An Autonomous Multi-agent Framework for Cognitive Behavioral Therapy in Psychological Counseling
Figure 3 for AutoCBT: An Autonomous Multi-agent Framework for Cognitive Behavioral Therapy in Psychological Counseling
Figure 4 for AutoCBT: An Autonomous Multi-agent Framework for Cognitive Behavioral Therapy in Psychological Counseling
Viaarxiv icon

Small Language Model as Data Prospector for Large Language Model

Add code
Dec 13, 2024
Figure 1 for Small Language Model as Data Prospector for Large Language Model
Figure 2 for Small Language Model as Data Prospector for Large Language Model
Figure 3 for Small Language Model as Data Prospector for Large Language Model
Figure 4 for Small Language Model as Data Prospector for Large Language Model
Viaarxiv icon

Are Visual-Language Models Effective in Action Recognition? A Comparative Study

Add code
Oct 22, 2024
Figure 1 for Are Visual-Language Models Effective in Action Recognition? A Comparative Study
Figure 2 for Are Visual-Language Models Effective in Action Recognition? A Comparative Study
Figure 3 for Are Visual-Language Models Effective in Action Recognition? A Comparative Study
Figure 4 for Are Visual-Language Models Effective in Action Recognition? A Comparative Study
Viaarxiv icon

RoVRM: A Robust Visual Reward Model Optimized via Auxiliary Textual Preference Data

Add code
Aug 22, 2024
Figure 1 for RoVRM: A Robust Visual Reward Model Optimized via Auxiliary Textual Preference Data
Figure 2 for RoVRM: A Robust Visual Reward Model Optimized via Auxiliary Textual Preference Data
Figure 3 for RoVRM: A Robust Visual Reward Model Optimized via Auxiliary Textual Preference Data
Figure 4 for RoVRM: A Robust Visual Reward Model Optimized via Auxiliary Textual Preference Data
Viaarxiv icon

Text Modality Oriented Image Feature Extraction for Detecting Diffusion-based DeepFake

Add code
May 28, 2024
Viaarxiv icon

CPsyCoun: A Report-based Multi-turn Dialogue Reconstruction and Evaluation Framework for Chinese Psychological Counseling

Add code
May 26, 2024
Figure 1 for CPsyCoun: A Report-based Multi-turn Dialogue Reconstruction and Evaluation Framework for Chinese Psychological Counseling
Figure 2 for CPsyCoun: A Report-based Multi-turn Dialogue Reconstruction and Evaluation Framework for Chinese Psychological Counseling
Figure 3 for CPsyCoun: A Report-based Multi-turn Dialogue Reconstruction and Evaluation Framework for Chinese Psychological Counseling
Figure 4 for CPsyCoun: A Report-based Multi-turn Dialogue Reconstruction and Evaluation Framework for Chinese Psychological Counseling
Viaarxiv icon

CPsyExam: A Chinese Benchmark for Evaluating Psychology using Examinations

Add code
May 16, 2024
Figure 1 for CPsyExam: A Chinese Benchmark for Evaluating Psychology using Examinations
Figure 2 for CPsyExam: A Chinese Benchmark for Evaluating Psychology using Examinations
Figure 3 for CPsyExam: A Chinese Benchmark for Evaluating Psychology using Examinations
Figure 4 for CPsyExam: A Chinese Benchmark for Evaluating Psychology using Examinations
Viaarxiv icon