Picture for Kun Liu

Kun Liu

SurfAAV: Design and Implementation of a Novel Multimodal Surfing Aquatic-Aerial Vehicle

Add code
Jun 18, 2025
Viaarxiv icon

From Macro to Micro: Probing Dataset Diversity in Language Model Fine-Tuning

Add code
May 30, 2025
Viaarxiv icon

SurveillanceVQA-589K: A Benchmark for Comprehensive Surveillance Video-Language Understanding with Large Models

Add code
May 19, 2025
Viaarxiv icon

HOIGen-1M: A Large-scale Dataset for Human-Object Interaction Video Generation

Add code
Mar 31, 2025
Viaarxiv icon

Exploring the Potential of Large Multimodal Models as Effective Alternatives for Pronunciation Assessment

Add code
Mar 14, 2025
Viaarxiv icon

KACDP: A Highly Interpretable Credit Default Prediction Model

Add code
Nov 26, 2024
Viaarxiv icon

FireRedTTS: A Foundation Text-To-Speech Framework for Industry-Level Generative Speech Applications

Add code
Sep 05, 2024
Figure 1 for FireRedTTS: A Foundation Text-To-Speech Framework for Industry-Level Generative Speech Applications
Figure 2 for FireRedTTS: A Foundation Text-To-Speech Framework for Industry-Level Generative Speech Applications
Figure 3 for FireRedTTS: A Foundation Text-To-Speech Framework for Industry-Level Generative Speech Applications
Figure 4 for FireRedTTS: A Foundation Text-To-Speech Framework for Industry-Level Generative Speech Applications
Viaarxiv icon

Early Risk Assessment Model for ICA Timing Strategy in Unstable Angina Patients Using Multi-Modal Machine Learning

Add code
Aug 08, 2024
Viaarxiv icon

BioDrone: A Bionic Drone-based Single Object Tracking Benchmark for Robust Vision

Add code
Feb 07, 2024
Viaarxiv icon

SigFormer: Sparse Signal-Guided Transformer for Multi-Modal Human Action Segmentation

Add code
Nov 29, 2023
Figure 1 for SigFormer: Sparse Signal-Guided Transformer for Multi-Modal Human Action Segmentation
Figure 2 for SigFormer: Sparse Signal-Guided Transformer for Multi-Modal Human Action Segmentation
Figure 3 for SigFormer: Sparse Signal-Guided Transformer for Multi-Modal Human Action Segmentation
Figure 4 for SigFormer: Sparse Signal-Guided Transformer for Multi-Modal Human Action Segmentation
Viaarxiv icon