Picture for Haifeng Liu

Haifeng Liu

One Token Is Enough: Improving Diffusion Language Models with a Sink Token

Add code
Jan 27, 2026
Viaarxiv icon

Vclip: Face-based Speaker Generation by Face-voice Association Learning

Add code
Jan 06, 2026
Viaarxiv icon

CommonVoice-SpeechRE and RPG-MoGe: Advancing Speech Relation Extraction with a New Dataset and Multi-Order Generative Framework

Add code
Sep 10, 2025
Viaarxiv icon

UI-TARS: Pioneering Automated GUI Interaction with Native Agents

Add code
Jan 21, 2025
Viaarxiv icon

Depth Any Video with Scalable Synthetic Data

Add code
Oct 14, 2024
Figure 1 for Depth Any Video with Scalable Synthetic Data
Figure 2 for Depth Any Video with Scalable Synthetic Data
Figure 3 for Depth Any Video with Scalable Synthetic Data
Figure 4 for Depth Any Video with Scalable Synthetic Data
Viaarxiv icon

A Dual-Path Framework with Frequency-and-Time Excited Network for Anomalous Sound Detection

Add code
Sep 05, 2024
Figure 1 for A Dual-Path Framework with Frequency-and-Time Excited Network for Anomalous Sound Detection
Figure 2 for A Dual-Path Framework with Frequency-and-Time Excited Network for Anomalous Sound Detection
Figure 3 for A Dual-Path Framework with Frequency-and-Time Excited Network for Anomalous Sound Detection
Figure 4 for A Dual-Path Framework with Frequency-and-Time Excited Network for Anomalous Sound Detection
Viaarxiv icon

Rethinking Visual Content Refinement in Low-Shot CLIP Adaptation

Add code
Jul 19, 2024
Figure 1 for Rethinking Visual Content Refinement in Low-Shot CLIP Adaptation
Figure 2 for Rethinking Visual Content Refinement in Low-Shot CLIP Adaptation
Figure 3 for Rethinking Visual Content Refinement in Low-Shot CLIP Adaptation
Figure 4 for Rethinking Visual Content Refinement in Low-Shot CLIP Adaptation
Viaarxiv icon

TASeg: Temporal Aggregation Network for LiDAR Semantic Segmentation

Add code
Jul 13, 2024
Figure 1 for TASeg: Temporal Aggregation Network for LiDAR Semantic Segmentation
Figure 2 for TASeg: Temporal Aggregation Network for LiDAR Semantic Segmentation
Figure 3 for TASeg: Temporal Aggregation Network for LiDAR Semantic Segmentation
Figure 4 for TASeg: Temporal Aggregation Network for LiDAR Semantic Segmentation
Viaarxiv icon

Boosting Few-Shot Learning via Attentive Feature Regularization

Add code
Mar 23, 2024
Figure 1 for Boosting Few-Shot Learning via Attentive Feature Regularization
Figure 2 for Boosting Few-Shot Learning via Attentive Feature Regularization
Figure 3 for Boosting Few-Shot Learning via Attentive Feature Regularization
Figure 4 for Boosting Few-Shot Learning via Attentive Feature Regularization
Viaarxiv icon

A Knowledge-Injected Curriculum Pretraining Framework for Question Answering

Add code
Mar 11, 2024
Figure 1 for A Knowledge-Injected Curriculum Pretraining Framework for Question Answering
Figure 2 for A Knowledge-Injected Curriculum Pretraining Framework for Question Answering
Figure 3 for A Knowledge-Injected Curriculum Pretraining Framework for Question Answering
Figure 4 for A Knowledge-Injected Curriculum Pretraining Framework for Question Answering
Viaarxiv icon