Picture for Jianzong Wang

Jianzong Wang

From Knowing to Doing Precisely: A General Self-Correction and Termination Framework for VLA models

Add code
Feb 02, 2026
Viaarxiv icon

Attention-weighted Centered Kernel Alignment for Knowledge Distillation in Large Audio-Language Models Applied to Speech Emotion Recognition

Add code
Feb 02, 2026
Viaarxiv icon

CARE: Multi-Task Pretraining for Latent Continuous Action Representation in Robot Control

Add code
Jan 30, 2026
Viaarxiv icon

Triage: Hierarchical Visual Budgeting for Efficient Video Reasoning in Vision-Language Models

Add code
Jan 30, 2026
Viaarxiv icon

MIRRORTALK: Forging Personalized Avatars Via Disentangled Style and Hierarchical Motion Control

Add code
Jan 30, 2026
Viaarxiv icon

MiTa: A Hierarchical Multi-Agent Collaboration Framework with Memory-integrated and Task Allocation

Add code
Jan 30, 2026
Viaarxiv icon

EMO-RL: Emotion-Rule-Based Reinforcement Learning Enhanced Audio-Language Model for Generalized Speech Emotion Recognition

Add code
Sep 19, 2025
Viaarxiv icon

MoQAE: Mixed-Precision Quantization for Long-Context LLM Inference via Mixture of Quantization-Aware Experts

Add code
Jun 09, 2025
Viaarxiv icon

Hierarchical-Task-Aware Multi-modal Mixture of Incremental LoRA Experts for Embodied Continual Learning

Add code
Jun 05, 2025
Figure 1 for Hierarchical-Task-Aware Multi-modal Mixture of Incremental LoRA Experts for Embodied Continual Learning
Figure 2 for Hierarchical-Task-Aware Multi-modal Mixture of Incremental LoRA Experts for Embodied Continual Learning
Figure 3 for Hierarchical-Task-Aware Multi-modal Mixture of Incremental LoRA Experts for Embodied Continual Learning
Figure 4 for Hierarchical-Task-Aware Multi-modal Mixture of Incremental LoRA Experts for Embodied Continual Learning
Viaarxiv icon

Generalized Audio Deepfake Detection Using Frame-level Latent Information Entropy

Add code
Apr 15, 2025
Figure 1 for Generalized Audio Deepfake Detection Using Frame-level Latent Information Entropy
Figure 2 for Generalized Audio Deepfake Detection Using Frame-level Latent Information Entropy
Figure 3 for Generalized Audio Deepfake Detection Using Frame-level Latent Information Entropy
Figure 4 for Generalized Audio Deepfake Detection Using Frame-level Latent Information Entropy
Viaarxiv icon