Picture for Zihan Li

Zihan Li

Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation

Add code
Jun 24, 2025
Figure 1 for Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation
Figure 2 for Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation
Figure 3 for Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation
Figure 4 for Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation
Viaarxiv icon

FIC-TSC: Learning Time Series Classification with Fisher Information Constraint

Add code
May 09, 2025
Figure 1 for FIC-TSC: Learning Time Series Classification with Fisher Information Constraint
Figure 2 for FIC-TSC: Learning Time Series Classification with Fisher Information Constraint
Figure 3 for FIC-TSC: Learning Time Series Classification with Fisher Information Constraint
Figure 4 for FIC-TSC: Learning Time Series Classification with Fisher Information Constraint
Viaarxiv icon

STPNet: Scale-aware Text Prompt Network for Medical Image Segmentation

Add code
Apr 02, 2025
Viaarxiv icon

Diffusion-Based mmWave Radar Point Cloud Enhancement Driven by Range Images

Add code
Mar 04, 2025
Viaarxiv icon

Power Domain Sparse Dimensional Constellation Multiple Access (PD-SDCMA): A Novel PD-NOMA for More Access Users

Add code
Feb 22, 2025
Viaarxiv icon

An Intra- and Cross-frame Topological Consistency Scheme for Semi-supervised Atherosclerotic Coronary Plaque Segmentation

Add code
Jan 14, 2025
Figure 1 for An Intra- and Cross-frame Topological Consistency Scheme for Semi-supervised Atherosclerotic Coronary Plaque Segmentation
Figure 2 for An Intra- and Cross-frame Topological Consistency Scheme for Semi-supervised Atherosclerotic Coronary Plaque Segmentation
Figure 3 for An Intra- and Cross-frame Topological Consistency Scheme for Semi-supervised Atherosclerotic Coronary Plaque Segmentation
Figure 4 for An Intra- and Cross-frame Topological Consistency Scheme for Semi-supervised Atherosclerotic Coronary Plaque Segmentation
Viaarxiv icon

LLM-RG4: Flexible and Factual Radiology Report Generation across Diverse Input Contexts

Add code
Dec 16, 2024
Viaarxiv icon

VisionUnite: A Vision-Language Foundation Model for Ophthalmology Enhanced with Clinical Knowledge

Add code
Aug 05, 2024
Figure 1 for VisionUnite: A Vision-Language Foundation Model for Ophthalmology Enhanced with Clinical Knowledge
Figure 2 for VisionUnite: A Vision-Language Foundation Model for Ophthalmology Enhanced with Clinical Knowledge
Figure 3 for VisionUnite: A Vision-Language Foundation Model for Ophthalmology Enhanced with Clinical Knowledge
Figure 4 for VisionUnite: A Vision-Language Foundation Model for Ophthalmology Enhanced with Clinical Knowledge
Viaarxiv icon

Are Large Language Models Possible to Conduct Cognitive Behavioral Therapy?

Add code
Jul 25, 2024
Figure 1 for Are Large Language Models Possible to Conduct Cognitive Behavioral Therapy?
Figure 2 for Are Large Language Models Possible to Conduct Cognitive Behavioral Therapy?
Figure 3 for Are Large Language Models Possible to Conduct Cognitive Behavioral Therapy?
Figure 4 for Are Large Language Models Possible to Conduct Cognitive Behavioral Therapy?
Viaarxiv icon

Rethinking Abdominal Organ Segmentation (RAOS) in the clinical scenario: A robustness evaluation benchmark with challenging cases

Add code
Jun 19, 2024
Figure 1 for Rethinking Abdominal Organ Segmentation (RAOS) in the clinical scenario: A robustness evaluation benchmark with challenging cases
Figure 2 for Rethinking Abdominal Organ Segmentation (RAOS) in the clinical scenario: A robustness evaluation benchmark with challenging cases
Figure 3 for Rethinking Abdominal Organ Segmentation (RAOS) in the clinical scenario: A robustness evaluation benchmark with challenging cases
Figure 4 for Rethinking Abdominal Organ Segmentation (RAOS) in the clinical scenario: A robustness evaluation benchmark with challenging cases
Viaarxiv icon